Files

T

Pulse 22d9f5b21d feat: skill expansion — browser, security, SQL, files (16 skills total)

Novas skills instaladas:
- openclaw-agent-browser   v1.0.0  CLI Chromium — navegação, login, screenshots, state
- skill-security-audit    v1.0.0  SAST scanning, prompt injection, secrets audit
- sql-toolkit             v1.0.0  PostgreSQL/MySQL/SQLite — schema, query, otimização
- file                    v1.0.0  Organização de arquivos por contexto
- file-summary            v1.0.0  Extração e resumo de PDFs, Word, Excel

Workspace expandido:
- TOOLS.md: +Browser automation, Security audit, SQL, File management
- AGENTS.md: +Linux Analyst section (comandos, logs, rede, scripts) + Full-stack strategy
- MEMORY.md: 16 skills indexadas, stack map, comandos Linux ref
- SESSION-STATE.md: atualizado com contexto completo
- lock.json: sincronizado com 16 skills instaladas

2026-05-19 20:51:05 -03:00

4.8 KiB

Raw Permalink Blame History

agent-browser Command Reference

agent-browser open <url>              # Navigate (aliases: goto, navigate)
agent-browser close                   # Close browser (aliases: quit, exit)

Snapshot (Primary Way to See the Page)

agent-browser snapshot -i             # Interactive elements with refs (@e1, @e2...)
agent-browser snapshot -i -C          # Include cursor-interactive elements (onclick divs)
agent-browser snapshot -s "#selector" # Scope to CSS selector
agent-browser snapshot -i --json      # JSON output for parsing

Interaction (Use @refs from snapshot)

agent-browser click @e1               # Click element
agent-browser click @e1 --new-tab     # Click and open in new tab
agent-browser dblclick @e1            # Double-click
agent-browser fill @e2 "text"         # Clear and type text
agent-browser type @e2 "text"         # Type without clearing
agent-browser select @e1 "option"     # Select dropdown option
agent-browser check @e1               # Check checkbox
agent-browser uncheck @e1             # Uncheck checkbox
agent-browser press Enter             # Press key
agent-browser keyboard type "text"    # Type at current focus (no selector)
agent-browser scroll down 500         # Scroll page
agent-browser scroll down 500 --selector "div.content"  # Scroll within container
agent-browser drag @e1 @e2            # Drag and drop
agent-browser upload @e1 file.pdf     # Upload files
agent-browser hover @e1               # Hover element

Get Information

agent-browser get text @e1            # Get element text
agent-browser get text body > page.txt # Get all page text
agent-browser get html @e1            # Get innerHTML
agent-browser get url                 # Get current URL
agent-browser get title               # Get page title
agent-browser get text @e1 --json     # JSON output

Wait

agent-browser wait @e1                # Wait for element
agent-browser wait --load networkidle # Wait for network idle
agent-browser wait --url "**/page"    # Wait for URL pattern
agent-browser wait --fn "document.readyState === 'complete'"  # JS condition
agent-browser wait 2000               # Wait milliseconds

Downloads

agent-browser download @e1 ./file.pdf          # Click to trigger download
agent-browser wait --download ./output.zip     # Wait for download
agent-browser --download-path ./downloads open <url>  # Set download dir

Capture

agent-browser screenshot              # Screenshot to temp dir
agent-browser screenshot page.png     # Screenshot to path
agent-browser screenshot --full       # Full page screenshot
agent-browser screenshot --annotate   # Annotated with numbered labels
agent-browser pdf output.pdf          # Save as PDF

Diff (Compare Page States)

agent-browser diff snapshot                          # Current vs last snapshot
agent-browser diff snapshot --baseline before.txt    # Current vs saved file
agent-browser diff screenshot --baseline before.png  # Visual pixel diff
agent-browser diff url <url1> <url2>                 # Compare two pages

Sessions

agent-browser --session site1 open https://site-a.com  # Named session
agent-browser session list                              # List active sessions
agent-browser --session site1 close                     # Close specific session

Auth Vault

echo "pass" | agent-browser auth save github --url https://github.com/login --username user --password-stdin
agent-browser auth login github       # Login using saved profile
agent-browser auth list               # List profiles
agent-browser auth delete github      # Delete profile

State Persistence

agent-browser state save auth.json    # Save cookies/localStorage
agent-browser state load auth.json    # Restore state
agent-browser --session-name myapp open <url>  # Auto-save/restore
agent-browser state list              # List saved states
agent-browser state clean --older-than 7  # Cleanup old states

Security

# Content boundaries (recommended for AI agents)
export AGENT_BROWSER_CONTENT_BOUNDARIES=1

# Domain allowlist
export AGENT_BROWSER_ALLOWED_DOMAINS="example.com,*.example.com"

# Action policy
export AGENT_BROWSER_ACTION_POLICY=./policy.json

# Output limits
export AGENT_BROWSER_MAX_OUTPUT=50000

Debugging

agent-browser --headed open <url>     # Visual browser
agent-browser highlight @e1           # Highlight element
agent-browser record start demo.webm  # Record session
agent-browser eval "document.title"   # Run JavaScript

Connect to Existing Chrome

agent-browser --auto-connect open <url>  # Auto-discover Chrome
agent-browser --cdp 9222 snapshot        # Explicit CDP port

Local Files

agent-browser --allow-file-access open file:///path/to/doc.pdf

4.8 KiB Raw Permalink Blame History

agent-browser Command Reference

Navigation

Snapshot (Primary Way to See the Page)

Interaction (Use @refs from snapshot)

Get Information

Wait

Downloads

Capture

Diff (Compare Page States)

Sessions

Auth Vault

State Persistence

Security

Debugging

Connect to Existing Chrome

Local Files

4.8 KiB

Raw Permalink Blame History