Terminal Automation
Section titled “Terminal Automation”Your terminal is a programmable screen. Read what’s on it, press keys into it, wait for it to settle — all through HTTP endpoints that understand the terminal’s actual rendering state.
Traditional terminal automation sends raw bytes and hopes for the best. hoody-terminal takes a different approach: it maintains a server-side VT parser (libvterm) that mirrors the exact screen state a human would see. Every automation endpoint operates on this parsed screen, giving you deterministic control over full-screen TUI applications like vim, htop, tmux, and any interactive program.
Inspired by tui-use, but built server-side in C with libvterm. No client-side dependencies. No browser needed. Just HTTP.
Why Terminal Automation?
Section titled “Why Terminal Automation?”Traditional terminal scripting (/api/v1/terminal/execute) works great for commands that produce output and exit. But what about programs that take over the screen?
- vim — you need to navigate, type, save, quit
- htop — you need to select processes, sort columns, send signals
- python3 — you need to feed expressions and read results from a REPL
- ssh — you need to wait for a password prompt, type credentials, then wait for the shell
- tmux — you need to create panes, switch windows, send commands to specific panes
These programs don’t write to stdout/stderr. They draw on the terminal screen using escape sequences. The automation endpoints let you interact with them the way a human would: by reading the screen and pressing keys.
Throughout this page, all examples use a $TERMINAL variable for the base URL:
TERMINAL="https://$PROJECT-$CONTAINER-terminal-1.$SERVER.containers.hoody.icu"Set this once and every curl/fetch example below works as-is.
Quick Start
Section titled “Quick Start”Here’s a complete workflow: launch Python, run a calculation, read the result.
Step 1: Start a Python REPL
# Start python3 (don't wait -- it takes over the screen)curl -X POST "$TERMINAL/api/v1/terminal/execute" \ -H "Content-Type: application/json" \ -d '{"command": "python3", "wait": false}'const TERMINAL = `https://${PROJECT}-${CONTAINER}-terminal-1.${SERVER}.containers.hoody.icu`;
// Start python3 (don't wait -- it takes over the screen)await fetch(`${TERMINAL}/api/v1/terminal/execute`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ command: 'python3', wait: false })});Step 2: Wait for the Python prompt
# Wait until ">>>" appears on screencurl -X POST "$TERMINAL/api/v1/terminal/wait?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"mode": "regex", "pattern": ">>> $", "timeout_ms": 5000}'// Wait for the ">>>" promptawait fetch(`${TERMINAL}/api/v1/terminal/wait?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ mode: 'regex', pattern: '>>> $', timeout_ms: 5000 })});Step 3: Type a calculation and press Enter
# Paste the expressioncurl -X POST "$TERMINAL/api/v1/terminal/paste?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"text": "2 ** 256"}'
# Press Entercurl -X POST "$TERMINAL/api/v1/terminal/press?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"key": "enter"}'// Paste the expressionawait fetch(`${TERMINAL}/api/v1/terminal/paste?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ text: '2 ** 256' })});
// Press Enterawait fetch(`${TERMINAL}/api/v1/terminal/press?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ key: 'enter' })});Step 4: Wait for the result and read the screen
# Wait for the next prompt (means the result has been printed)curl -X POST "$TERMINAL/api/v1/terminal/wait?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"mode": "regex", "pattern": ">>> $", "timeout_ms": 5000}'
# Read the screencurl "$TERMINAL/api/v1/terminal/snapshot?terminal_id=1"// Wait for the next promptconst waitResult = await fetch(`${TERMINAL}/api/v1/terminal/wait?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ mode: 'regex', pattern: '>>> $', timeout_ms: 5000 })});const { snapshot } = await waitResult.json();
// The snapshot is included in the wait response -- no separate call needed!console.log(snapshot.lines);// [">>> 2 ** 256", "115792089237316195423570985008687907853269984665640564039457584007913129639936", ">>> "]Endpoints Reference Summary
Section titled “Endpoints Reference Summary”All automation endpoints live under /api/v1/terminal/ and require a terminal_id parameter (query string or URL path).
| Endpoint | Method | Description |
|---|---|---|
/api/v1/terminal/snapshot | GET | Rendered viewport: lines, cursor, title, fullscreen state, highlights, sequence counter |
/api/v1/terminal/find | GET | PCRE2 regex search on rendered screen with cell-coordinate hits |
/api/v1/terminal/press | POST | Send named key presses (mode-aware: respects DECCKM/DECKPAM) |
/api/v1/terminal/write | POST | Raw byte injection — escape hatch when /press and /paste don’t fit |
/api/v1/terminal/paste | POST | Bracketed paste with full UTF-8 support |
/api/v1/terminal/wait | POST | Async wait until stable/regex-match/either, returns atomic snapshot |
Full API reference: Terminal API Reference —> for complete OpenAPI docs with all parameters, response schemas, and error codes.
Snapshot
Section titled “Snapshot”GET /api/v1/terminal/snapshot
Returns the terminal screen exactly as a human would see it: a grid of text lines, the cursor position, the window title, and whether the program is in fullscreen (alt-screen) mode.
Parameters
Section titled “Parameters”| Parameter | Type | Default | Description |
|---|---|---|---|
terminal_id | string | required | Terminal session ID (1-65535) |
include_colors | boolean | false | Include ANSI SGR colored_lines array alongside plain text |
include_highlights | boolean | true | Include reverse-video highlight spans |
scroll_offset | integer | 0 | Lines into scrollback (0 = live viewport) |
Example
Section titled “Example”# Basic snapshotcurl "$TERMINAL/api/v1/terminal/snapshot?terminal_id=1"
# With colors and scrollbackcurl "$TERMINAL/api/v1/terminal/snapshot?terminal_id=1&include_colors=true&scroll_offset=10"const res = await fetch(`${TERMINAL}/api/v1/terminal/snapshot?terminal_id=1`);const snap = await res.json();
console.log(snap.lines); // ["$ ls -la", "total 16", "drwxr-xr-x 3 user user 4096 .", "..."]console.log(snap.cursor); // { row: 2, col: 0, visible: true }console.log(snap.title); // "bash"console.log(snap.is_fullscreen); // false (true when vim, htop, etc. are running)console.log(snap.seq); // 42 (monotonic counter -- changes on every screen update)Response Structure
Section titled “Response Structure”{ "terminal_id": "1", "cols": 80, "rows": 24, "lines": [ "$ ls -la", "total 16", "drwxr-xr-x 3 user user 4096 .", "" ], "cursor": { "row": 2, "col": 0, "visible": true }, "title": "bash", "is_fullscreen": false, "scroll_offset": 0, "seq": 42, "highlights": [ { "row": 0, "col": 2, "length": 5 } ]}Key fields:
lines— Array of strings, one per visible row. Trailing whitespace is trimmed. Empty rows appear as empty strings.cursor— Row/col position (0-indexed) and visibility. Programs like vim move the cursor; shell prompts park it at the input position.is_fullscreen—truewhen the program has switched to the alternate screen buffer (vim, htop, less, tmux). Useful for knowing whether you’re in a TUI or at a shell prompt.seq— Monotonic sequence counter. Increments on every screen update. Use this to detect whether the screen has changed between two snapshots without comparing all lines.highlights— Reverse-video spans (used by search highlights, selection, etc.). Each entry hasrow,col,length.
Scrollback
Section titled “Scrollback”Set scroll_offset to read lines that have scrolled off the top of the screen. A value of 10 means “show me what was on screen 10 lines ago.” The viewport is rows lines tall, so scroll_offset=24 on an 80x24 terminal shows the previous full page.
The scrollback buffer holds up to 500 lines by default (configurable with --vterm-scrollback-lines, max 10000).
GET /api/v1/terminal/find
Search the rendered terminal screen for a PCRE2 regular expression. Returns cell-coordinate hits with matched text — useful for locating specific output, error messages, or UI elements on the screen.
Parameters
Section titled “Parameters”| Parameter | Type | Default | Description |
|---|---|---|---|
terminal_id | string | required | Terminal session ID |
pattern | string | required | PCRE2 regex pattern (max 1024 bytes) |
scope | string | "screen" | Where to search: screen, scrollback, or all |
limit | integer | 100 | Max hits to return (max 1000) |
case_insensitive | boolean | false | Case-insensitive matching |
scroll_offset | integer | 0 | Scrollback offset for screen scope (0 = live viewport). |
Example
Section titled “Example”# Find all error messages on screencurl "$TERMINAL/api/v1/terminal/find?terminal_id=1&pattern=error&case_insensitive=true"
# Find IP addresses in scrollback# (backslashes doubled because the URL is in double quotes -- shell would otherwise eat single backslashes)curl "$TERMINAL/api/v1/terminal/find?terminal_id=1&pattern=\\d{1,3}\\.\\d{1,3}\\.\\d{1,3}\\.\\d{1,3}&scope=all"
# Find the shell promptcurl "$TERMINAL/api/v1/terminal/find?terminal_id=1&pattern=%24%20%24"# (URL-encoded: "$ $" -- dollar, space, end of line)// Find all lines containing "ERROR" or "error"const res = await fetch( `${TERMINAL}/api/v1/terminal/find?terminal_id=1` + `&pattern=error&case_insensitive=true`);const result = await res.json();
for (const hit of result.hits) { console.log(`Found "${hit.text}" at row ${hit.row}, col ${hit.col}`);}// Found "error" at row 5, col 12// Found "Error" at row 8, col 0Response Structure
Section titled “Response Structure”{ "pattern": "error", "scope": "screen", "hits": [ { "row": 5, "col": 12, "length": 5, "text": "error" }, { "row": 8, "col": 0, "length": 5, "text": "Error" } ], "total": 2, "truncated": false}Key fields:
hits— Array of matches with cell coordinates (0-indexedrow/col),lengthin characters, and matchedtext.truncated—trueif the number of hits reached thelimit. Increaselimitor narrow your pattern.scope— Echoes back which scope was searched.
Search Scopes
Section titled “Search Scopes”| Scope | Description |
|---|---|
screen | Visible viewport only (default). Fast, covers what a user would see. |
scrollback | Only the scrollback buffer (lines that scrolled off the top). |
all | Both screen and scrollback. Use when you’re not sure where the match is. |
POST /api/v1/terminal/press
Send named key presses to the terminal. Keys are encoded through libvterm’s keyboard API, which respects the terminal’s current mode (DECCKM for cursor keys, DECKPAM for keypad). This means arrow keys, function keys, and ctrl sequences automatically generate the correct byte sequences for whatever program is running.
Request Body
Section titled “Request Body”{ "key": "enter"}Or send multiple keys in sequence:
{ "keys": ["escape", ":", "w", "q", "enter"]}Parameters
Section titled “Parameters”| Parameter | Type | Description |
|---|---|---|
terminal_id | string (query) | Terminal session ID (required) |
key | string (body) | Single key name |
keys | string[] (body) | Array of key names to press in sequence (max 256) |
Example
Section titled “Example”# Press Entercurl -X POST "$TERMINAL/api/v1/terminal/press?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"key": "enter"}'
# Press Ctrl+C (interrupt)curl -X POST "$TERMINAL/api/v1/terminal/press?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"key": "ctrl+c"}'
# Type ":wq" and press Enter (save and quit vim)curl -X POST "$TERMINAL/api/v1/terminal/press?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"keys": ["escape", ":", "w", "q", "enter"]}'
# Navigate with arrow keyscurl -X POST "$TERMINAL/api/v1/terminal/press?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"keys": ["arrow_down", "arrow_down", "arrow_down", "enter"]}'// Press Ctrl+Cawait fetch(`${TERMINAL}/api/v1/terminal/press?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ key: 'ctrl+c' })});
// Type ":wq" and press Enter in vimawait fetch(`${TERMINAL}/api/v1/terminal/press?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ keys: ['escape', ':', 'w', 'q', 'enter'] })});Response
Section titled “Response”{ "status": "ok", "bytes_written": 7}Supported Keys
Section titled “Supported Keys”All key names are case-insensitive. Many keys have aliases for convenience.
Special Keys:
| Key | Aliases | Description |
|---|---|---|
enter | return, cr, ctrl+m, c-m | Enter/Return key |
tab | ctrl+i, c-i | Tab key |
escape | esc, ctrl+[, c-[ | Escape key |
backspace | bs, ctrl+h, c-h | Backspace key |
space | Space bar | |
backtab | shift+tab, s-tab | Shift+Tab (reverse tab) |
Arrow Keys:
| Key | Aliases | Description |
|---|---|---|
arrow_up | up | Up arrow |
arrow_down | down | Down arrow |
arrow_left | left | Left arrow |
arrow_right | right | Right arrow |
Navigation:
| Key | Aliases | Description |
|---|---|---|
home | Home key | |
end | End key | |
page_up | pgup, pageup | Page Up |
page_down | pgdn, pagedown | Page Down |
insert | ins | Insert key |
delete | del | Delete key |
Function Keys:
| Key | Description |
|---|---|
f1 through f12 | Function keys F1-F12 |
Ctrl Combinations:
| Key | Aliases | Byte | Description |
|---|---|---|---|
ctrl+a through ctrl+z | c-a through c-z | 0x01-0x1A | Ctrl+letter. ctrl+h -> backspace, ctrl+i -> tab, ctrl+m -> enter |
ctrl+space | c-space, ctrl+@, c-@ | 0x00 | NUL byte |
ctrl+j | c-j | 0x0A | Raw line feed (LF) — distinct from enter which may send CR |
ctrl+\\ | c-\\ | 0x1C | SIGQUIT in shell |
ctrl+] | c-] | 0x1D | Ctrl+Right bracket |
ctrl+^ | c-^ | 0x1E | Ctrl+Caret |
ctrl+_ | c-_ | 0x1F | Ctrl+Underscore |
ctrl+? | c-? | 0x7F | DEL character |
Modified Keys:
| Key | Description |
|---|---|
shift+arrow_up/down/left/right | Shift+Arrow (text selection in some programs) |
ctrl+arrow_up/down/left/right | Ctrl+Arrow (word navigation in some programs) |
alt+enter | Alt+Enter |
alt+backspace | Alt+Backspace (delete word in zsh/bash) |
Single Characters:
Any single printable ASCII character (! through ~, plus space) can be used as a key name. For example, {"key": "a"} presses the letter “a”, {"key": "!"} presses exclamation mark.
POST /api/v1/terminal/paste
Paste text into the terminal with optional bracketed paste mode. This is the preferred way to send multi-character text (commands, code snippets, file content) into the terminal.
Request Body
Section titled “Request Body”{ "text": "echo 'Hello, World!'", "bracketed": true}Parameters
Section titled “Parameters”| Parameter | Type | Default | Description |
|---|---|---|---|
terminal_id | string (query) | required | Terminal session ID |
text | string (body) | required | Text to paste (UTF-8) |
bracketed | boolean (body) | true | Use bracketed paste mode if the program supports it |
Example
Section titled “Example”# Paste a command (bracketed paste protects against auto-indent)curl -X POST "$TERMINAL/api/v1/terminal/paste?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"text": "echo Hello, World!"}'
# Paste multi-line code into vimcurl -X POST "$TERMINAL/api/v1/terminal/paste?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"text": "def hello():\n print(\"Hello!\")\n\nhello()"}'
# Paste without bracketed mode (raw keystrokes)curl -X POST "$TERMINAL/api/v1/terminal/paste?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"text": "ls -la", "bracketed": false}'// Paste a multi-line Python scriptawait fetch(`${TERMINAL}/api/v1/terminal/paste?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ text: 'for i in range(10):\n print(f"Item {i}")\n', bracketed: true })});Response
Section titled “Response”{ "status": "ok", "bytes_written": 42, "bracketed_active": true}Key fields:
bytes_written— Number of bytes written to the terminal PTY.bracketed_active—trueif the program had DECSET 2004 enabled and bracketed paste markers were actually sent.falseif the program doesn’t support bracketed paste (the text was still sent, just without markers).
Paste vs Press
Section titled “Paste vs Press”| Use | Paste | Press |
|---|---|---|
| Multi-character text | Preferred — single HTTP call, bracketed paste protection | Works but requires one key per character |
| Special keys | Cannot send Enter, Escape, Ctrl+C, etc. | Designed for this |
| Code with newlines | Handles \n correctly with bracketed paste | Each line would need separate Enter presses |
| UTF-8 / emoji / CJK | Full support | Single printable ASCII only |
| Speed | Fast — single write | Sequential key-by-key |
Bracketed Paste Mode
Section titled “Bracketed Paste Mode”When bracketed is true (default), the text is wrapped in escape sequences (\e[200~ … \e[201~) if the running program has opted in via DECSET 2004. Most modern programs support this:
- zsh, bash (readline), fish — Yes. Prevents auto-execution of pasted newlines.
- vim, neovim — Yes. Prevents auto-indent mangling of pasted code.
- python REPL — No (unless using IPython). Text is pasted as raw keystrokes.
- htop, top — No. These aren’t text input programs.
When bracketed is false, the text is sent as raw keystrokes regardless of the program’s paste mode setting.
Write (Raw Bytes)
Section titled “Write (Raw Bytes)”POST /api/v1/terminal/write
/write is the raw-byte escape hatch for terminal automation. It injects bytes directly into the session’s PTY master fd, exactly as if typed at a physical keyboard. Use it when you need to send escape sequences the /press key table doesn’t cover, or when you want byte-level control over what hits the shell.
Request Body
Section titled “Request Body”{ "input": "y", "enter": true}Parameters
Section titled “Parameters”| Parameter | Type | Default | Description |
|---|---|---|---|
terminal_id | string (query) | required | Terminal session ID |
input | string (body) | required | Text to type (UTF-8). Empty string is valid — sends just an Enter if enter=true. |
enter | boolean (body) | true | Auto-append a newline after input. Set to false for raw-keystroke input. |
Example
Section titled “Example”# Answer a y/n promptcurl -X POST "$TERMINAL/api/v1/terminal/write?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"input": "y"}'
# Send raw bytes without an auto-Entercurl -X POST "$TERMINAL/api/v1/terminal/write?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"input": "[6~", "enter": false}'
# Just press Entercurl -X POST "$TERMINAL/api/v1/terminal/write?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"input": ""}'// Answer a y/n promptawait fetch(`${TERMINAL}/api/v1/terminal/write?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ input: 'y' })});
// Send raw bytes without an auto-Enterawait fetch(`${TERMINAL}/api/v1/terminal/write?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ input: '[6~', enter: false })});
// Just press Enterawait fetch(`${TERMINAL}/api/v1/terminal/write?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ input: '' })});Response
Section titled “Response”{ "success": true, "terminal_id": "1", "bytes_written": 2 }POST /api/v1/terminal/wait
Block until a condition is met, then return an atomic snapshot of the screen at the exact moment of resolution. This is the key endpoint that eliminates sleep-polling from terminal automation scripts.
The word “atomic” matters here: the snapshot is captured at the same instant the condition resolves. If you called /wait and /snapshot as two separate requests, the screen could change between them (a TOCTOU race). With /wait, the snapshot in the response is guaranteed to reflect the state that matched your condition.
Request Body
Section titled “Request Body”{ "mode": "regex", "pattern": "\\$ $", "timeout_ms": 10000, "debounce_ms": 100}Parameters
Section titled “Parameters”| Parameter | Type | Default | Description |
|---|---|---|---|
terminal_id | string (query) | required | Terminal session ID |
mode | string (body) | "stable" | Wait mode: stable, regex, or either |
pattern | string (body) | — | PCRE2 regex (required for regex and either modes, max 1024 bytes) |
timeout_ms | integer (body) | 5000 | Hard deadline in ms (10-300000) |
debounce_ms | integer (body) | 100 | Stable mode debounce in ms (10-60000) |
search_scope | string (body) | "screen" | Where to search: screen, scrollback, or all |
include_colors | boolean (body) | false | Include colored_lines in snapshot |
include_highlights | boolean (body) | true | Include highlights in snapshot |
Wait Modes
Section titled “Wait Modes”stable — Wait until the screen stops changing. The endpoint watches the terminal’s sequence counter (seq) and resolves when no screen updates arrive for debounce_ms consecutive milliseconds. Think of debounce_ms as “how long must the screen be quiet before I consider it settled.” Use this when you don’t know what the output will look like, but you know the program will eventually stop printing.
# Wait until output settles (500ms of quiet)curl -X POST "$TERMINAL/api/v1/terminal/wait?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"mode": "stable", "debounce_ms": 500, "timeout_ms": 30000}'const result = await fetch(`${TERMINAL}/api/v1/terminal/wait?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ mode: 'stable', debounce_ms: 500, timeout_ms: 30000 })});const { status, snapshot } = await result.json();// status: "stable" or "timeout"regex — Wait until a PCRE2 pattern matches on the screen. Resolves the instant the match appears, returning the match coordinates alongside the snapshot.
# Wait for shell promptcurl -X POST "$TERMINAL/api/v1/terminal/wait?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"mode": "regex", "pattern": "\\$ $", "timeout_ms": 10000}'
# Wait for a specific error messagecurl -X POST "$TERMINAL/api/v1/terminal/wait?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"mode": "regex", "pattern": "BUILD (SUCCESS|FAILED)", "timeout_ms": 60000}'const result = await fetch(`${TERMINAL}/api/v1/terminal/wait?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ mode: 'regex', pattern: '\\$ $', timeout_ms: 10000 })});const { status, match, snapshot } = await result.json();// status: "matched" or "timeout"// match: { row: 10, col: 2, length: 2, text: "$ " }either — First condition wins: resolves on regex match OR stability, whichever comes first. Useful when you’re not sure if the program will produce a specific prompt or just stop outputting.
# Wait for either a prompt or 2 seconds of quietcurl -X POST "$TERMINAL/api/v1/terminal/wait?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"mode": "either", "pattern": "[\\$#>] $", "debounce_ms": 2000, "timeout_ms": 30000}'const result = await fetch(`${TERMINAL}/api/v1/terminal/wait?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ mode: 'either', pattern: '[\\$#>] $', debounce_ms: 2000, timeout_ms: 30000 })});const { status, snapshot } = await result.json();// status: "matched", "stable", or "timeout"Response Structure
Section titled “Response Structure”{ "status": "matched", "elapsed_ms": 423, "match": { "row": 10, "col": 2, "length": 2, "text": "$ " }, "snapshot": { "terminal_id": "1", "cols": 80, "rows": 24, "lines": ["..."], "cursor": { "row": 10, "col": 4, "visible": true }, "title": "bash", "is_fullscreen": false, "seq": 42 }}Status values:
| Status | Meaning |
|---|---|
matched | Regex pattern matched on screen |
stable | Screen was stable for debounce_ms (no regex match in either mode) |
timeout | Neither condition met before timeout_ms |
exited | Underlying process died mid-wait. Includes snapshot. |
vterm_reinit | VT parser was torn down and re-initialized mid-wait (memory-cap resize). Client should retry; no match or snapshot returned. |
Always check all five statuses; treating only matched/stable as success and ignoring exited/vterm_reinit can cause silent failures.
Error Handling
Section titled “Error Handling”Every automation endpoint returns standard HTTP status codes. Handle these in your scripts to build robust automation.
Status Codes
Section titled “Status Codes”| Code | Meaning | When It Happens |
|---|---|---|
| 200 | Success | Request completed normally |
| 400 | Bad request | Invalid key name, malformed regex, missing required parameter, body too large |
| 404 | Session not found | The terminal_id doesn’t exist or the session has been terminated |
| 429 | Too many waiters | More than 16 concurrent /wait requests on the same session |
| 503 | Resource exhausted | libvterm memory cap exceeded — too many concurrent automation sessions |
Error Response Format
Section titled “Error Response Format”All errors return a JSON body with an error field and a human-readable message:
{ "error": "invalid_key", "message": "Unknown key name 'crtl+c'. Did you mean 'ctrl+c'?", "supported_keys": ["enter", "tab", "escape", "..."]}Handling Errors in Practice
Section titled “Handling Errors in Practice”# Check HTTP status code with -wHTTP_CODE=$(curl -s -o /tmp/resp.json -w '%{http_code}' \ -X POST "$TERMINAL/api/v1/terminal/press?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"key": "enter"}')
case "$HTTP_CODE" in 200) echo "Key sent" ;; 400) echo "Bad request: $(cat /tmp/resp.json)" ;; 404) echo "Session does not exist" ;; 429) echo "Too many waiters -- wait for existing ones to resolve" ;; 503) echo "Server overloaded -- back off and retry" ;;esacasync function pressKey(key) { const res = await fetch(`${TERMINAL}/api/v1/terminal/press?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ key }) });
if (res.ok) return res.json();
const err = await res.json(); switch (res.status) { case 400: throw new Error(`Bad request: ${err.message}`); case 404: throw new Error('Terminal session not found'); case 429: throw new Error('Too many concurrent waiters'); case 503: throw new Error('Server overloaded, retry later'); default: throw new Error(`Unexpected ${res.status}: ${err.message}`); }}Common Patterns
Section titled “Common Patterns”Reusable building blocks that show up in most automation scripts. Copy these into your projects.
Type a String (Key by Key)
Section titled “Type a String (Key by Key)”If you need to type text character-by-character instead of pasting (some TUI programs don’t support paste), split the string into individual key presses:
# Type "hello" key by keycurl -X POST "$TERMINAL/api/v1/terminal/press?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"keys": ["h", "e", "l", "l", "o"]}'// Helper: type a string as individual key pressesasync function typeString(text) { const keys = [...text]; // split into characters await fetch(`${TERMINAL}/api/v1/terminal/press?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ keys }) });}
await typeString(':wq');Run a Command and Wait for the Result
Section titled “Run a Command and Wait for the Result”The most common pattern: paste a command, press Enter, wait for the prompt, read the output.
# Paste command, press Enter, wait for prompt, read screencurl -X POST "$TERMINAL/api/v1/terminal/paste?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"text": "uname -a"}'
curl -X POST "$TERMINAL/api/v1/terminal/press?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"key": "enter"}'
RESULT=$(curl -s -X POST "$TERMINAL/api/v1/terminal/wait?terminal_id=1" \ -H "Content-Type: application/json" \ -d '{"mode": "regex", "pattern": "\\$ $", "timeout_ms": 10000}')
echo "$RESULT" | jq -r '.snapshot.lines[]'// Helper: run a command and return the screen after completionasync function runCommand(cmd, promptPattern = '\\$ $', timeoutMs = 10000) { await fetch(`${TERMINAL}/api/v1/terminal/paste?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ text: cmd }) }); await fetch(`${TERMINAL}/api/v1/terminal/press?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ key: 'enter' }) }); const res = await fetch(`${TERMINAL}/api/v1/terminal/wait?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ mode: 'regex', pattern: promptPattern, timeout_ms: timeoutMs }) }); return res.json();}
const { status, snapshot } = await runCommand('uname -a');if (status === 'matched') { console.log(snapshot.lines);}Check if a TUI Is Running
Section titled “Check if a TUI Is Running”Use the is_fullscreen field from a snapshot to detect whether a full-screen program (vim, htop, less, tmux) is currently active:
# Check if we're in a TUI or at a shell promptIS_TUI=$(curl -s "$TERMINAL/api/v1/terminal/snapshot?terminal_id=1" | jq '.is_fullscreen')
if [ "$IS_TUI" = "true" ]; then echo "A full-screen program is running -- press 'q' or Ctrl+C to exit first"else echo "At shell prompt -- safe to run commands"fiasync function isTuiRunning() { const res = await fetch(`${TERMINAL}/api/v1/terminal/snapshot?terminal_id=1`); const snap = await res.json(); return snap.is_fullscreen;}
if (await isTuiRunning()) { // Exit the TUI first await fetch(`${TERMINAL}/api/v1/terminal/press?terminal_id=1`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ key: 'q' }) });}Wait-Then-Act Loop
Section titled “Wait-Then-Act Loop”For multi-step interactions (wizards, installers, interactive prompts), use a loop that waits for a condition, inspects the screen, and decides the next action. This example assumes wait, paste, typeString, and press helpers like those defined above:
// Drive an interactive installer step by stepconst steps = [ { wait: 'Accept license\\?', action: async () => { await typeString('yes'); await press('enter'); } }, { wait: 'Install directory', action: async () => { await paste('/opt/app'); await press('enter'); } }, { wait: 'Confirm\\?', action: async () => press('enter') }, { wait: '\\$ $', action: null } // done -- back at shell];
for (const step of steps) { const { status } = await wait({ mode: 'regex', pattern: step.wait, timeout_ms: 30000 }); if (status === 'timeout') throw new Error(`Timed out waiting for: ${step.wait}`); if (step.action) await step.action();}Practical Workflows
Section titled “Practical Workflows”1. Drive Vim: Open, Edit, Save, Quit
Section titled “1. Drive Vim: Open, Edit, Save, Quit”TID="terminal_id=1"
# Open a file in vimcurl -X POST "$TERMINAL/api/v1/terminal/execute" \ -H "Content-Type: application/json" \ -d '{"command": "vim /tmp/hello.py", "wait": false}'
# Wait for vim to load (alt-screen active)curl -X POST "$TERMINAL/api/v1/terminal/wait?$TID" \ -H "Content-Type: application/json" \ -d '{"mode": "stable", "debounce_ms": 300, "timeout_ms": 5000}'
# Enter insert modecurl -X POST "$TERMINAL/api/v1/terminal/press?$TID" \ -H "Content-Type: application/json" \ -d '{"key": "i"}'
# Type some codecurl -X POST "$TERMINAL/api/v1/terminal/paste?$TID" \ -H "Content-Type: application/json" \ -d '{"text": "#!/usr/bin/env python3\nprint(\"Hello from vim automation!\")\n"}'
# Exit insert mode, save, and quitcurl -X POST "$TERMINAL/api/v1/terminal/press?$TID" \ -H "Content-Type: application/json" \ -d '{"keys": ["escape", ":", "w", "q", "enter"]}'
# Wait for vim to close (back to shell prompt)curl -X POST "$TERMINAL/api/v1/terminal/wait?$TID" \ -H "Content-Type: application/json" \ -d '{"mode": "regex", "pattern": "\\$ $", "timeout_ms": 5000}'const TID = 'terminal_id=1';
const press = (keys) => fetch(`${TERMINAL}/api/v1/terminal/press?${TID}`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify(Array.isArray(keys) ? { keys } : { key: keys })});const paste = (text) => fetch(`${TERMINAL}/api/v1/terminal/paste?${TID}`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ text })});const wait = (opts) => fetch(`${TERMINAL}/api/v1/terminal/wait?${TID}`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify(opts)}).then(r => r.json());
// Open vimawait fetch(`${TERMINAL}/api/v1/terminal/execute`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ command: 'vim /tmp/hello.py', wait: false })});
// Wait for vim to loadawait wait({ mode: 'stable', debounce_ms: 300, timeout_ms: 5000 });
// Enter insert mode, type code, save and quitawait press('i');await paste('#!/usr/bin/env python3\nprint("Hello from vim automation!")\n');await press(['escape', ':', 'w', 'q', 'enter']);
// Wait for shell promptconst { snapshot } = await wait({ mode: 'regex', pattern: '\\$ $', timeout_ms: 5000 });console.log('Back at shell:', snapshot.lines[snapshot.cursor.row]);2. Navigate htop: Filter and Kill a Process
Section titled “2. Navigate htop: Filter and Kill a Process”// Launch htop and wait for it to renderawait fetch(`${TERMINAL}/api/v1/terminal/execute`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ command: 'htop', wait: false })});await wait({ mode: 'stable', debounce_ms: 500, timeout_ms: 5000 });
// Filter for "node" processes (F4 opens htop's filter)await press('f4');await paste('node');await wait({ mode: 'stable', debounce_ms: 300, timeout_ms: 3000 });
// Read the filtered listconst { snapshot } = await wait({ mode: 'stable', debounce_ms: 200, timeout_ms: 2000 });console.log('Filtered:', snapshot.lines.filter(l => l.includes('node')));
// Send SIGTERM to selected process (F9), then quitawait press(['f9', 'enter']);await press('q');3. Python REPL: Define a Function and Call It
Section titled “3. Python REPL: Define a Function and Call It”This extends the Quick Start pattern with a multi-line function definition. The key trick: paste with bracketed: false for REPLs that don’t support bracketed paste, and press Enter twice to end an indented block.
// Start Python and wait for prompt (see Quick Start for the curl equivalent)await fetch(`${TERMINAL}/api/v1/terminal/execute`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ command: 'python3', wait: false })});await wait({ mode: 'regex', pattern: '>>> $', timeout_ms: 5000 });
// Paste a multi-line function (bracketed: false for Python's REPL)await paste('def fib(n):\n a, b = 0, 1\n for _ in range(n):\n a, b = b, a + b\n return a\n');await press(['enter', 'enter']); // two Enters to close the blockawait wait({ mode: 'regex', pattern: '>>> $', timeout_ms: 5000 });
// Call the function and read the resultawait paste('fib(100)');await press('enter');const { snapshot } = await wait({ mode: 'regex', pattern: '>>> $', timeout_ms: 5000 });
// Find the line containing the big numberconst resultLine = snapshot.lines.find(l => /^\d{10,}$/.test(l.trim()));console.log('fib(100) =', resultLine?.trim());// fib(100) = 354224848179261915075
// Exit Pythonawait press('ctrl+d');4. Drive an SSH Connection
Section titled “4. Drive an SSH Connection”SSH is a classic multi-step interactive flow: wait for host key confirmation or password prompt, respond, then wait for the remote shell.
// Start SSH (don't wait -- it takes over the screen)await fetch(`${TERMINAL}/api/v1/terminal/execute`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ command: 'ssh user@remote-server.example.com', wait: false })});
// Wait for password prompt or host key confirmationconst { match } = await wait({ mode: 'regex', pattern: '(password:|yes/no)', timeout_ms: 15000});
if (match?.text.includes('yes/no')) { await paste('yes'); await press('enter'); await wait({ mode: 'regex', pattern: 'password:', timeout_ms: 10000 });}
// Type password and wait for remote shell promptawait paste('my-password');await press('enter');const result = await wait({ mode: 'regex', pattern: '[\\$#] $', timeout_ms: 15000 });console.log('Connected!', result.snapshot.lines[result.snapshot.cursor.row]);5. Run Command and Extract Output with /find
Section titled “5. Run Command and Extract Output with /find”Combines the “run command and wait” pattern from Common Patterns with /find to extract structured data:
// Run df -h and wait for completionawait paste('df -h /');await press('enter');const { snapshot } = await wait({ mode: 'regex', pattern: '\\$ $', timeout_ms: 5000 });
// Use /find to extract the disk usage percentage from the screenconst findRes = await fetch( `${TERMINAL}/api/v1/terminal/find?${TID}&pattern=\\d+%25` // %25 = URL-encoded %).then(r => r.json());
console.log('Disk usage:', findRes.hits[0]?.text); // "42%"
// Or parse directly from the snapshot linesconst dfLine = snapshot.lines.find(l => l.includes('/dev/'));6. Interact with tmux
Section titled “6. Interact with tmux”tmux uses a prefix key (Ctrl+B by default) followed by a command key. The /press endpoint handles this naturally since it sends keys sequentially:
// Start tmuxawait fetch(`${TERMINAL}/api/v1/terminal/execute`, { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ command: 'tmux new-session -s auto', wait: false })});await wait({ mode: 'stable', debounce_ms: 500, timeout_ms: 5000 });
// Split horizontally (Ctrl+B, then %)await press(['ctrl+b', '%']);await wait({ mode: 'stable', debounce_ms: 300, timeout_ms: 3000 });
// Run a command in the new paneawait paste('watch -n1 date');await press('enter');
// Switch back to first paneawait press(['ctrl+b', 'arrow_left']);7. Monitor a Long-Running Build
Section titled “7. Monitor a Long-Running Build”The "either" mode shines here: match on a known success/failure message OR fall back to stability if the output is unexpected. Set a generous timeout_ms — the wait returns instantly when the condition is met.
// Start buildawait paste('npm run build 2>&1');await press('enter');
// Wait up to 5 minutes for completionconst { status, match } = await wait({ mode: 'either', pattern: '(Build succeeded|ERROR|FAILED|\\$ $)', debounce_ms: 3000, timeout_ms: 300000});
if (status === 'matched' && match.text.includes('ERROR')) { // Use /find to collect all error lines (including scrollback) const errors = await fetch( `${TERMINAL}/api/v1/terminal/find?${TID}&pattern=ERROR.*&scope=all` ).then(r => r.json()); console.error('Build failed with', errors.total, 'errors');} else { console.log('Build succeeded!');}Mode-Aware Key Encoding
Section titled “Mode-Aware Key Encoding”One of the most important features of /press is that it generates correct escape sequences for the terminal’s current mode. This is why it uses libvterm internally instead of sending raw bytes.
The Problem with Raw Bytes
Section titled “The Problem with Raw Bytes”When you send arrow keys to a terminal, the correct byte sequence depends on the terminal’s mode:
| Key | Normal Mode (DECCKM off) | Application Mode (DECCKM on) |
|---|---|---|
| Arrow Up | \e[A | \eOA |
| Arrow Down | \e[B | \eOB |
| Arrow Left | \e[D | \eOD |
| Arrow Right | \e[C | \eOC |
| Home | \e[H | \eOH |
| End | \e[F | \eOF |
Programs like vim, htop, and less enable application cursor mode (DECCKM). If you send the wrong byte sequence, the program ignores the key or does something unexpected.
Similarly, the numeric keypad has two modes: numeric (sends digits) and application (DECKPAM, sends \eO sequences). Programs use this for cursor navigation in TUI menus.
How /press Solves This
Section titled “How /press Solves This”When you call /press, the endpoint:
- Looks up the key name in the key table
- Calls libvterm’s keyboard API (
vterm_keyboard_keyorvterm_keyboard_unichar) with the appropriateVTermKeyenum and modifiers - libvterm checks the terminal’s current DECCKM/DECKPAM/DECNKM state and generates the correct byte sequence
- The generated bytes are drained from libvterm’s output buffer and written to the terminal PTY
This means the same /press call generates different byte sequences depending on what program is running. You don’t need to know or care about terminal modes — just send arrow_up and it works in vim, htop, bash, tmux, or any other program.
Memory and Performance
Section titled “Memory and Performance”The automation subsystem maintains a server-side libvterm instance for each terminal session that uses automation endpoints.
Lazy Initialization and Replay
Section titled “Lazy Initialization and Replay”libvterm instances are created on demand when the first automation endpoint is called for a session. Sessions that never use automation have zero memory overhead.
On first use, the session’s output buffer is replayed through libvterm to reconstruct the full terminal state (screen content, cursor position, modes, colors). This is how /press knows the correct byte sequences — libvterm tracks the terminal’s mode from the replayed output. The replay is fast (native C code) but does consume CPU proportional to the output buffer size. Subsequent calls use the already-initialized instance, kept in sync by feeding new output as it arrives.
Memory Cap
Section titled “Memory Cap”All libvterm instances share a global memory cap (default: 512 MB, configurable with --vterm-memory-cap-mb). Each instance consumes memory proportional to cols * rows * cell_size + scrollback_lines * cols * cell_size.
For a typical 80x24 terminal with 500 scrollback lines:
- Per-instance: approximately 1-2 MB
- 512 MB cap supports roughly 250-500 concurrent automation sessions
If the memory cap is exceeded, new automation requests return HTTP 503 with a vterm_memory_cap error including current and maximum usage.
Idle Eviction
Section titled “Idle Eviction”libvterm instances are evicted after 10 minutes of inactivity (configurable with --vterm-idle-ttl-sec). “Inactivity” means no automation endpoint has been called for that session.
Evicted instances are transparently re-initialized on the next automation call. The replay overhead is typically negligible (a few milliseconds for normal sessions, up to 100ms for sessions with very large output buffers).
Scrollback Buffer
Section titled “Scrollback Buffer”Each libvterm instance maintains its own scrollback buffer (default: 500 lines, configurable with --vterm-scrollback-lines, max 10000). This is separate from the terminal session’s raw output buffer.
The scrollback stores rendered cells (text + attributes), making /find and /snapshot with scroll_offset fast — no re-parsing needed.
Configuration Flags
Section titled “Configuration Flags”| Flag | Default | Description |
|---|---|---|
--vterm-memory-cap-mb | 512 | Global memory cap for all libvterm instances |
--vterm-scrollback-lines | 500 | Scrollback lines per instance (0-10000) |
--vterm-idle-ttl-sec | 600 | Seconds of idle before eviction |
--wait-max-waiters-per-session | 16 | Max concurrent /wait requests per session |
Best Practices
Section titled “Best Practices”-
Use
/waitinstead of polling/snapshotin a loop. The wait endpoint is event-driven and returns the instant the condition is met. Polling wastes HTTP round-trips and can miss transient states. -
Use
"mode": "either"when you’re not sure about the exact prompt. Combine a regex for the expected prompt with a stability debounce as a fallback. This handles both normal and error cases. -
Use
/pastefor text,/pressfor actions. Paste your command text, then press Enter. Paste your code, then press Escape. This is faster and more reliable than pressing keys one by one. -
Check
is_fullscreenin snapshots to know if you’re in a TUI program or at a shell prompt. This helps your automation script adapt to unexpected states. -
Use the
seqcounter to detect screen changes without comparing all lines. Ifseqhasn’t changed between two snapshots, the screen is identical. -
Keep
timeout_msgenerous for operations that might take time (builds, downloads, SSH connections). The wait endpoint returns immediately when the condition is met — a large timeout just prevents premature failure. -
Use
/findwithscope: "all"when searching for output that might have scrolled off screen. Thescrollbackscope searches only the scrollback buffer, whileallsearches both.
-
Don’t
sleep()between operations. Use/waitwith an appropriate mode instead. Sleeping creates race conditions and slows down your automation unnecessarily. -
Don’t send raw escape codes through
/paste. Use/pressfor special keys and control sequences. Raw escape codes can conflict with the terminal’s current mode. -
Don’t assume terminal dimensions. Read
colsandrowsfrom the snapshot response. Different sessions may have different sizes, and users can resize at any time. -
Don’t create more than 16 concurrent waiters per session. The limit exists to prevent resource exhaustion. If you need to wait for multiple conditions, use
/waitsequentially or combine patterns with regex alternation (pattern1|pattern2). -
Don’t ignore the
statusfield in wait responses. A timeout is not an error — it means the condition wasn’t met. Your script should handle all five statuses:matched,stable,timeout,exited, andvterm_reinit. -
Don’t use automation endpoints for simple command execution. If you just need to run
lsand get the output, use/api/v1/terminal/executewithwait: true. Automation endpoints are for interactive programs.
What’s Next
Section titled “What’s Next”- Terminals Overview —> — Full terminal service docs: web UI, session management, command execution
- Terminal API Reference —> — Complete OpenAPI endpoint reference with all parameters and response schemas
- SSH Access —> — SSH as an alternative access method alongside HTTP automation
- Displays —> — Visual display service for GUI applications launched from terminal sessions
- Kit Overview —> — All HTTP services available in every Hoody container
- Permissions —> — Control who can access terminal automation endpoints