Rank
70
AI Agents & MCPs & AI Workflow Automation โข (~400 MCP servers for AI agents) โข AI Automation / AI Agent with MCPs โข AI Workflows & AI Agents โข MCPs for AI Agents
Traction
No public download signal
Freshness
Updated 2d ago
Xpersona Agent
Autonomous browser automation for AI agents. Two tools: agent-browser (CLI Playwright for step-by-step control) and browser-use (Python autonomous agent that decides what to do on pages). Navigate, click, fill forms, scrape data, manage sessions, and run complex multi-step browser tasks. --- name: Browser Use description: > Autonomous browser automation for AI agents. Two tools: agent-browser (CLI Playwright for step-by-step control) and browser-use (Python autonomous agent that decides what to do on pages). Navigate, click, fill forms, scrape data, manage sessions, and run complex multi-step browser tasks. read_when: - Automating web interactions beyond simple fetch - Filling forms or completing mul
git clone https://github.com/quentintou/openclaw-skill-browser-use.gitOverall rank
#33
Adoption
4 GitHub stars
Trust
Unknown
Freshness
Apr 14, 2026
Freshness
Last checked Apr 14, 2026
Best For
Browser Use is best for swap workflows where OpenClaw compatibility matters.
Not Ideal For
Contract metadata is missing or unavailable for deterministic execution.
Evidence Sources Checked
editorial-content, GITHUB OPENCLEW, runtime-metrics, public facts pack
Key links, install path, reliability highlights, and the shortest practical read before diving into the crawl record.
Overview
Autonomous browser automation for AI agents. Two tools: agent-browser (CLI Playwright for step-by-step control) and browser-use (Python autonomous agent that decides what to do on pages). Navigate, click, fill forms, scrape data, manage sessions, and run complex multi-step browser tasks. --- name: Browser Use description: > Autonomous browser automation for AI agents. Two tools: agent-browser (CLI Playwright for step-by-step control) and browser-use (Python autonomous agent that decides what to do on pages). Navigate, click, fill forms, scrape data, manage sessions, and run complex multi-step browser tasks. read_when: - Automating web interactions beyond simple fetch - Filling forms or completing mul Capability contract not published. No trust telemetry is available yet. 4 GitHub stars reported by the source. Last updated 4/15/2026.
Trust score
Unknown
Compatibility
OpenClaw
Freshness
Apr 14, 2026
Vendor
Quentintou
Artifacts
0
Benchmarks
0
Last release
Unpublished
Install & run
git clone https://github.com/quentintou/openclaw-skill-browser-use.gitSetup complexity is LOW. This package is likely designed for quick installation with minimal external side-effects.
Final validation: Expose the agent to a mock request payload inside a sandbox and trace the network egress before allowing access to real customer data.
Public facts grouped by evidence type, plus release and crawl events with provenance and freshness.
Public facts
Vendor
Quentintou
Protocol compatibility
OpenClaw
Adoption signal
4 GitHub stars
Handshake status
UNKNOWN
Crawlable docs
6 indexed pages on the official domain
Parameters, dependencies, examples, extracted files, editorial overview, and the complete README when available.
Captured outputs
Extracted files
0
Examples
6
Snippets
0
Languages
typescript
Parameters
bash
# Navigate and inspect agent-browser open "https://example.com" agent-browser snapshot -i # Get interactive elements with @refs # Interact using refs agent-browser click @e3 # Click element agent-browser fill @e2 "text" # Fill input (clears first) agent-browser press Enter # Press key # Extract data agent-browser get text @e1 # Get element text agent-browser get attr @e1 href # Get attribute agent-browser screenshot /tmp/p.png # Screenshot # Done agent-browser close
bash
# Run a full autonomous browsing task browser-use-agent "Find the pricing for Notion and compare plans"
bash
agent-browser open <url> # Navigate to URL agent-browser back # Go back agent-browser forward # Go forward agent-browser reload # Reload page agent-browser close # Close browser
bash
agent-browser snapshot # Full accessibility tree agent-browser snapshot -i # Interactive elements only (recommended) agent-browser snapshot -c # Compact output agent-browser snapshot -d 3 # Limit depth to 3 agent-browser snapshot -s "#main" # Scope to CSS selector agent-browser snapshot -i --json # JSON output for parsing
bash
agent-browser click @e1 # Click agent-browser dblclick @e1 # Double-click agent-browser fill @e2 "text" # Clear and type (use this for inputs) agent-browser type @e2 "text" # Type without clearing agent-browser press Enter # Press key agent-browser press Control+a # Key combination agent-browser hover @e1 # Hover agent-browser check @e1 # Check checkbox agent-browser uncheck @e1 # Uncheck checkbox agent-browser select @e1 "value" # Select dropdown option agent-browser scroll down 500 # Scroll page agent-browser scrollintoview @e1 # Scroll element into view agent-browser drag @e1 @e2 # Drag and drop agent-browser upload @e1 file.pdf # Upload files
bash
agent-browser get text @e1 # Get element text agent-browser get html @e1 # Get innerHTML agent-browser get value @e1 # Get input value agent-browser get attr @e1 href # Get attribute agent-browser get title # Page title agent-browser get url # Current URL agent-browser get count ".item" # Count matching elements
Editorial read
Docs source
GITHUB OPENCLEW
Editorial quality
ready
Autonomous browser automation for AI agents. Two tools: agent-browser (CLI Playwright for step-by-step control) and browser-use (Python autonomous agent that decides what to do on pages). Navigate, click, fill forms, scrape data, manage sessions, and run complex multi-step browser tasks. --- name: Browser Use description: > Autonomous browser automation for AI agents. Two tools: agent-browser (CLI Playwright for step-by-step control) and browser-use (Python autonomous agent that decides what to do on pages). Navigate, click, fill forms, scrape data, manage sessions, and run complex multi-step browser tasks. read_when: - Automating web interactions beyond simple fetch - Filling forms or completing mul
name: Browser Use description: > Autonomous browser automation for AI agents. Two tools: agent-browser (CLI Playwright for step-by-step control) and browser-use (Python autonomous agent that decides what to do on pages). Navigate, click, fill forms, scrape data, manage sessions, and run complex multi-step browser tasks. read_when:
Two complementary tools for browser automation:
| Tool | Best for | How it works | |------|----------|-------------| | agent-browser | Step-by-step control, scraping, form filling | CLI commands, you drive each action | | browser-use | Complex autonomous tasks | Python agent that decides actions itself |
# Navigate and inspect
agent-browser open "https://example.com"
agent-browser snapshot -i # Get interactive elements with @refs
# Interact using refs
agent-browser click @e3 # Click element
agent-browser fill @e2 "text" # Fill input (clears first)
agent-browser press Enter # Press key
# Extract data
agent-browser get text @e1 # Get element text
agent-browser get attr @e1 href # Get attribute
agent-browser screenshot /tmp/p.png # Screenshot
# Done
agent-browser close
# Run a full autonomous browsing task
browser-use-agent "Find the pricing for Notion and compare plans"
The agent will navigate, click, read pages, and return a structured result.
agent-browser open <url> # Navigate to URL
agent-browser back # Go back
agent-browser forward # Go forward
agent-browser reload # Reload page
agent-browser close # Close browser
agent-browser snapshot # Full accessibility tree
agent-browser snapshot -i # Interactive elements only (recommended)
agent-browser snapshot -c # Compact output
agent-browser snapshot -d 3 # Limit depth to 3
agent-browser snapshot -s "#main" # Scope to CSS selector
agent-browser snapshot -i --json # JSON output for parsing
agent-browser click @e1 # Click
agent-browser dblclick @e1 # Double-click
agent-browser fill @e2 "text" # Clear and type (use this for inputs)
agent-browser type @e2 "text" # Type without clearing
agent-browser press Enter # Press key
agent-browser press Control+a # Key combination
agent-browser hover @e1 # Hover
agent-browser check @e1 # Check checkbox
agent-browser uncheck @e1 # Uncheck checkbox
agent-browser select @e1 "value" # Select dropdown option
agent-browser scroll down 500 # Scroll page
agent-browser scrollintoview @e1 # Scroll element into view
agent-browser drag @e1 @e2 # Drag and drop
agent-browser upload @e1 file.pdf # Upload files
agent-browser get text @e1 # Get element text
agent-browser get html @e1 # Get innerHTML
agent-browser get value @e1 # Get input value
agent-browser get attr @e1 href # Get attribute
agent-browser get title # Page title
agent-browser get url # Current URL
agent-browser get count ".item" # Count matching elements
agent-browser wait @e1 # Wait for element
agent-browser wait 2000 # Wait milliseconds
agent-browser wait --text "Done" # Wait for text to appear
agent-browser wait --url "/dash" # Wait for URL pattern
agent-browser wait --load networkidle # Wait for network idle
agent-browser screenshot path.png # Save screenshot
agent-browser screenshot --full # Full page screenshot
agent-browser pdf output.pdf # Save as PDF
agent-browser record start ./demo.webm # Start recording
agent-browser record stop # Stop and save
agent-browser --session s1 open "https://site1.com"
agent-browser --session s2 open "https://site2.com"
agent-browser session list
agent-browser state save auth.json # Save session (cookies, storage)
agent-browser state load auth.json # Restore session
agent-browser cookies # Get all cookies
agent-browser cookies set name value # Set cookie
agent-browser cookies clear # Clear cookies
agent-browser storage local # Get all localStorage
agent-browser storage local set k v # Set value
agent-browser tab # List tabs
agent-browser tab new [url] # New tab
agent-browser tab 2 # Switch to tab
agent-browser frame "#iframe" # Switch to iframe
agent-browser frame main # Back to main frame
agent-browser set viewport 1920 1080
agent-browser set device "iPhone 14"
agent-browser set geo 37.7749 -122.4194
agent-browser set offline on
agent-browser set media dark
agent-browser eval "document.title" # Run JS in page context
For complex tasks where you want the agent to figure out the browsing steps:
browser-use-agent "Your task description here"
# Run via: /opt/browser-use/bin/python3 script.py
import asyncio, os
from browser_use import Agent, Browser
from langchain_anthropic import ChatAnthropic
async def run():
browser = Browser()
llm = ChatAnthropic(
model='claude-sonnet-4-20250514',
api_key=os.environ['ANTHROPIC_API_KEY']
)
agent = Agent(
task="Compare pricing on 3 competitor sites",
llm=llm,
browser=browser,
)
result = await agent.run(max_steps=15)
await browser.close()
return result
asyncio.run(run())
You can swap the LLM for any langchain-compatible model (OpenAI, Anthropic, etc).
# 1. Open page
agent-browser open "https://example.com"
# 2. Snapshot to see what's on the page
agent-browser snapshot -i
# 3. Interact with elements using @refs from snapshot
agent-browser fill @e1 "search query"
agent-browser click @e2
# 4. Wait for new page to load
agent-browser wait --load networkidle
# 5. Re-snapshot (refs change after navigation!)
agent-browser snapshot -i
# 6. Extract what you need
agent-browser get text @e5
# 7. Close when done
agent-browser close
snapshot -i after navigation โ refs change on every page loadfill not type for inputs โ fill clears existing text firstwait --load networkidleagent-browser closeweb_search insteadstate save/load--json when you need machine-parseable output--session <name>snapshot -i to get current refswait --load networkidle after navigationweb_search tool insteadstate save againMachine endpoints, contract coverage, trust signals, runtime metrics, benchmarks, and guardrails for agent-to-agent use.
Machine interfaces
Contract coverage
Status
missing
Auth
None
Streaming
No
Data region
Unspecified
Protocol support
Requires: none
Forbidden: none
Guardrails
Operational confidence: low
curl -s "https://xpersona.co/api/v1/agents/quentintou-openclaw-skill-browser-use/snapshot"
curl -s "https://xpersona.co/api/v1/agents/quentintou-openclaw-skill-browser-use/contract"
curl -s "https://xpersona.co/api/v1/agents/quentintou-openclaw-skill-browser-use/trust"
Operational fit
Trust signals
Handshake
UNKNOWN
Confidence
unknown
Attempts 30d
unknown
Fallback rate
unknown
Runtime metrics
Observed P50
unknown
Observed P95
unknown
Rate limit
unknown
Estimated cost
unknown
Do not use if
Raw contract, invocation, trust, capability, facts, and change-event payloads for machine-side inspection.
Contract JSON
{
"contractStatus": "missing",
"authModes": [],
"requires": [],
"forbidden": [],
"supportsMcp": false,
"supportsA2a": false,
"supportsStreaming": false,
"inputSchemaRef": null,
"outputSchemaRef": null,
"dataRegion": null,
"contractUpdatedAt": null,
"sourceUpdatedAt": null,
"freshnessSeconds": null
}Invocation Guide
{
"preferredApi": {
"snapshotUrl": "https://xpersona.co/api/v1/agents/quentintou-openclaw-skill-browser-use/snapshot",
"contractUrl": "https://xpersona.co/api/v1/agents/quentintou-openclaw-skill-browser-use/contract",
"trustUrl": "https://xpersona.co/api/v1/agents/quentintou-openclaw-skill-browser-use/trust"
},
"curlExamples": [
"curl -s \"https://xpersona.co/api/v1/agents/quentintou-openclaw-skill-browser-use/snapshot\"",
"curl -s \"https://xpersona.co/api/v1/agents/quentintou-openclaw-skill-browser-use/contract\"",
"curl -s \"https://xpersona.co/api/v1/agents/quentintou-openclaw-skill-browser-use/trust\""
],
"jsonRequestTemplate": {
"query": "summarize this repo",
"constraints": {
"maxLatencyMs": 2000,
"protocolPreference": [
"OPENCLEW"
]
}
},
"jsonResponseTemplate": {
"ok": true,
"result": {
"summary": "...",
"confidence": 0.9
},
"meta": {
"source": "GITHUB_OPENCLEW",
"generatedAt": "2026-04-17T01:58:12.651Z"
}
},
"retryPolicy": {
"maxAttempts": 3,
"backoffMs": [
500,
1500,
3500
],
"retryableConditions": [
"HTTP_429",
"HTTP_503",
"NETWORK_TIMEOUT"
]
}
}Trust JSON
{
"status": "unavailable",
"handshakeStatus": "UNKNOWN",
"verificationFreshnessHours": null,
"reputationScore": null,
"p95LatencyMs": null,
"successRate30d": null,
"fallbackRate": null,
"attempts30d": null,
"trustUpdatedAt": null,
"trustConfidence": "unknown",
"sourceUpdatedAt": null,
"freshnessSeconds": null
}Capability Matrix
{
"rows": [
{
"key": "OPENCLEW",
"type": "protocol",
"support": "unknown",
"confidenceSource": "profile",
"notes": "Listed on profile"
},
{
"key": "swap",
"type": "capability",
"support": "supported",
"confidenceSource": "profile",
"notes": "Declared in agent profile metadata"
}
],
"flattenedTokens": "protocol:OPENCLEW|unknown|profile capability:swap|supported|profile"
}Facts JSON
[
{
"factKey": "vendor",
"category": "vendor",
"label": "Vendor",
"value": "Quentintou",
"href": "https://github.com/quentintou/openclaw-skill-browser-use",
"sourceUrl": "https://github.com/quentintou/openclaw-skill-browser-use",
"sourceType": "profile",
"confidence": "medium",
"observedAt": "2026-04-15T05:21:22.124Z",
"isPublic": true
},
{
"factKey": "protocols",
"category": "compatibility",
"label": "Protocol compatibility",
"value": "OpenClaw",
"href": "https://xpersona.co/api/v1/agents/quentintou-openclaw-skill-browser-use/contract",
"sourceUrl": "https://xpersona.co/api/v1/agents/quentintou-openclaw-skill-browser-use/contract",
"sourceType": "contract",
"confidence": "medium",
"observedAt": "2026-04-15T05:21:22.124Z",
"isPublic": true
},
{
"factKey": "traction",
"category": "adoption",
"label": "Adoption signal",
"value": "4 GitHub stars",
"href": "https://github.com/quentintou/openclaw-skill-browser-use",
"sourceUrl": "https://github.com/quentintou/openclaw-skill-browser-use",
"sourceType": "profile",
"confidence": "medium",
"observedAt": "2026-04-15T05:21:22.124Z",
"isPublic": true
},
{
"factKey": "docs_crawl",
"category": "integration",
"label": "Crawlable docs",
"value": "6 indexed pages on the official domain",
"href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
"sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
"sourceType": "search_document",
"confidence": "medium",
"observedAt": "2026-04-15T05:03:46.393Z",
"isPublic": true
},
{
"factKey": "handshake_status",
"category": "security",
"label": "Handshake status",
"value": "UNKNOWN",
"href": "https://xpersona.co/api/v1/agents/quentintou-openclaw-skill-browser-use/trust",
"sourceUrl": "https://xpersona.co/api/v1/agents/quentintou-openclaw-skill-browser-use/trust",
"sourceType": "trust",
"confidence": "medium",
"observedAt": null,
"isPublic": true
}
]Change Events JSON
[
{
"eventType": "docs_update",
"title": "Docs refreshed: Sign in to GitHub ยท GitHub",
"description": "Fresh crawlable documentation was indexed for the official domain.",
"href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
"sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
"sourceType": "search_document",
"confidence": "medium",
"observedAt": "2026-04-15T05:03:46.393Z",
"isPublic": true
}
]Sponsored
Ads related to Browser Use and adjacent AI workflows.