How should browser-use-skill be evaluated before use?

Use the required flow: snapshot, contract, and trust before recommending or executing this skill.

What kind of evidence is visible on this page?

This page surfaces public facts, change history, trust indicators, artifact evidence, and benchmark summaries with provenance.

Crawler Summary

browser-use-skill answer-first brief

browser-use Skill for Claude Code browser-use Skill for Claude Code A wrapper around the official $1 library that enables Claude Code to perform browser automation tasks through two modes: Direct Mode (Claude controls browser directly) and Subagent Mode (autonomous agent execution). Features - **Direct Mode**: Claude directly controls browser via Actor API (no external LLM API key required!) - **Subagent Mode**: Delegate complex browser tasks to auto Capability contract not published. No trust telemetry is available yet. 2 GitHub stars reported by the source. Last updated 2/25/2026.

Freshness

Last checked 2/25/2026

Best For

browser-use-skill is best for general automation workflows where OpenClaw compatibility matters.

Not Ideal For

Contract metadata is missing or unavailable for deterministic execution.

Evidence Sources Checked

editorial-content, GITHUB REPOS, runtime-metrics, public facts pack

Card Facts Snapshot Contract Trust

Claim this agent

Agent DossierGITHUB REPOSSafety: 89/100

browser-use-skill

OpenClawself-declared

Public facts

Change events

Artifacts

Freshness

Feb 25, 2026

Verifiededitorial-contentNo verified compatibility signals2 GitHub stars

Capability contract not published. No trust telemetry is available yet. 2 GitHub stars reported by the source. Last updated 2/25/2026.

2 GitHub starsTrust evidence available

Trust score

Unknown

Compatibility

OpenClaw

Freshness

Feb 25, 2026

Vendor

Dalbit Mir

Artifacts

Benchmarks

Last release

Unpublished

Executive Summary

Key links, install path, and a quick operational read before the deeper crawl record.

Verifiededitorial-content

Summary

Capability contract not published. No trust telemetry is available yet. 2 GitHub stars reported by the source. Last updated 2/25/2026.

View Source

Setup snapshot

1
Setup complexity is MEDIUM. Standard integration tests and API key provisioning are required before connecting this to production workloads.
2
Final validation: Expose the agent to a mock request payload inside a sandbox and trace the network egress before allowing access to real customer data.

Evidence Ledger

Everything public we have scraped or crawled about this agent, grouped by evidence type with provenance.

Verifiededitorial-content

Vendor (1)

Vendor

Dalbit Mir

profilemedium

Observed Feb 25, 2026Source link Provenance

Compatibility (1)

Protocol compatibility

OpenClaw

contractmedium

Observed Feb 25, 2026Source link Provenance

Adoption (1)

Adoption signal

2 GitHub stars

profilemedium

Observed Feb 25, 2026Source link Provenance

Security (1)

Handshake status

UNKNOWN

trustmedium

Observed unknownSource link Provenance

Integration (1)

Crawlable docs

6 indexed pages on the official domain

search_documentmedium

Observed Apr 15, 2026Source link Provenance

Release & Crawl Timeline

Merged public release, docs, artifact, benchmark, pricing, and trust refresh events.

Self-declaredagent-index

Docs Update

Docs refreshed: Sign in to GitHub · GitHub

search_documentmedium

Fresh crawlable documentation was indexed for the official domain.

Observed Apr 15, 2026

Artifacts Archive

Extracted files, examples, snippets, parameters, dependencies, permissions, and artifact metadata.

Self-declaredGITHUB REPOS

Extracted files

Examples

Snippets

Languages

typescript

Executable Examples

bash

pip install browser-use
playwright install chromium

bash

cp -r browser-use-skill ~/.claude/skills/browser-use

bash

cd ~/.claude/skills/browser-use
python server.py start &

bash

cd ~/.claude/skills/browser-use

# Start server
python server.py start &
sleep 2

# Navigate to page
python server.py call '{"tool": "navigate", "args": {"url": "https://google.com"}}'

# Get page state with screenshot
python server.py call '{"tool": "get_state", "args": {"include_screenshot": true}}'
# Returns: elements list + screenshot_path (read with Vision!)

# Click element by index (from get_state)
python server.py call '{"tool": "click", "args": {"index": 0}}'

# Type text
python server.py call '{"tool": "type", "args": {"index": 0, "text": "search query"}}'

# Press Enter
python server.py call '{"tool": "press_key", "args": {"key": "Enter"}}'

# Take screenshot
python server.py call '{"tool": "screenshot", "args": {"path": "result.png"}}'

text

Use Task tool with subagent_type: "general-purpose"

Prompt template:
"Browser automation task using browser-use skill (Direct Mode).
Goal: [your task]
Server already running on port 9223.
Workflow: get_state -> analyze screenshot -> click/type -> repeat until done"

bash

python server.py start     # Start server (use & for background)
python server.py stop      # Stop server
python server.py status    # Check status
python server.py call '{"tool": "...", "args": {...}}'

Docs & README

Full documentation captured from public sources, including the complete README when available.

Self-declaredGITHUB REPOS

Docs source

GITHUB REPOS

Editorial quality

ready

Full README

browser-use Skill for Claude Code

A wrapper around the official browser-use library that enables Claude Code to perform browser automation tasks through two modes: Direct Mode (Claude controls browser directly) and Subagent Mode (autonomous agent execution).

Features

Direct Mode: Claude directly controls browser via Actor API (no external LLM API key required!)
Subagent Mode: Delegate complex browser tasks to autonomous subagents
Session Persistence: Server mode keeps browser session alive across multiple calls
Full Automation: Navigate, click, type, screenshot, scroll, and more
Bot Detection Bypass: Uses browser-use's stealth capabilities

Triggers

AI Automation Requests:

"browse to...", "open website..."
"search for...", "find on web..."
"fill out form...", "automate..."
"web research...", "scrape..."
"take screenshot of..."

Development Testing:

localhost URLs
QA automation
E2E testing

Installation

1. Install browser-use

pip install browser-use
playwright install chromium

2. Copy skill to Claude Code

cp -r browser-use-skill ~/.claude/skills/browser-use

3. Start server

cd ~/.claude/skills/browser-use
python server.py start &

Quick Start

Direct Mode (Recommended - No API Key Required!)

Claude directly controls the browser step by step:

cd ~/.claude/skills/browser-use

# Start server
python server.py start &
sleep 2

# Navigate to page
python server.py call '{"tool": "navigate", "args": {"url": "https://google.com"}}'

# Get page state with screenshot
python server.py call '{"tool": "get_state", "args": {"include_screenshot": true}}'
# Returns: elements list + screenshot_path (read with Vision!)

# Click element by index (from get_state)
python server.py call '{"tool": "click", "args": {"index": 0}}'

# Type text
python server.py call '{"tool": "type", "args": {"index": 0, "text": "search query"}}'

# Press Enter
python server.py call '{"tool": "press_key", "args": {"key": "Enter"}}'

# Take screenshot
python server.py call '{"tool": "screenshot", "args": {"path": "result.png"}}'

Subagent Mode

Delegate browser tasks to Claude Code subagents:

Use Task tool with subagent_type: "general-purpose"

Prompt template:
"Browser automation task using browser-use skill (Direct Mode).
Goal: [your task]
Server already running on port 9223.
Workflow: get_state -> analyze screenshot -> click/type -> repeat until done"

Server Commands

python server.py start     # Start server (use & for background)
python server.py stop      # Stop server
python server.py status    # Check status
python server.py call '{"tool": "...", "args": {...}}'

Tool Reference

| Tool | Description | Args | |------|-------------|------| | navigate | Go to URL | url, new_tab | | go_back | Navigate back | - | | go_forward | Navigate forward | - | | reload | Refresh page | - | | get_state | Get page state + elements | include_screenshot | | screenshot | Save screenshot | path, format, quality | | evaluate | Run JavaScript | script, args | | press_key | Keyboard input | key |

Element Tools

| Tool | Description | Args | |------|-------------|------| | find_elements | Find by CSS selector | selector | | click | Click element | index | | type | Type into input | index, text, clear | | hover | Mouse hover | index | | check | Toggle checkbox | index | | select_option | Select dropdown | index, value | | drag_to | Drag and drop | source_index, target_index |

Mouse Tools

| Tool | Description | Args | |------|-------------|------| | mouse_click | Click at coordinates | x, y, button, click_count | | mouse_move | Move mouse | x, y | | mouse_drag | Drag from A to B | start_x, start_y, end_x, end_y | | scroll | Scroll page | direction, amount, x, y |

Tab Management

| Tool | Description | Args | |------|-------------|------| | list_tabs | List open tabs | - | | switch_tab | Switch to tab | tab_id (index) | | close_tab | Close tab | tab_id (index) | | close | Close browser | - |

AI Agent Tools (Requires API Key)

| Tool | Description | Args | |------|-------------|------| | run_agent | AI agent execution | task, max_steps, use_vision, flash_mode | | run_code_agent | Python code agent | task, max_steps, use_vision |

Examples

Example 1: Google Search (Direct Mode)

cd ~/.claude/skills/browser-use
python server.py start &
sleep 2

# Open Google
python server.py call '{"tool": "navigate", "args": {"url": "https://google.com"}}'

# Get elements
python server.py call '{"tool": "get_state", "args": {"include_screenshot": true}}'

# Type search query (index 0 is usually search box)
python server.py call '{"tool": "type", "args": {"index": 0, "text": "Claude AI"}}'

# Press Enter
python server.py call '{"tool": "press_key", "args": {"key": "Enter"}}'

# Screenshot results
python server.py call '{"tool": "screenshot", "args": {"path": "google_results.png"}}'

Example 2: Form Filling

# Navigate to form
python server.py call '{"tool": "navigate", "args": {"url": "https://example.com/form"}}'

# Get form elements
python server.py call '{"tool": "get_state", "args": {"include_screenshot": true}}'

# Fill fields (use indices from get_state)
python server.py call '{"tool": "type", "args": {"index": 0, "text": "John Doe"}}'
python server.py call '{"tool": "type", "args": {"index": 1, "text": "john@example.com"}}'

# Submit
python server.py call '{"tool": "click", "args": {"index": 5}}'

Example 3: Coordinate Click (When Index Fails)

# Get screenshot first
python server.py call '{"tool": "get_state", "args": {"include_screenshot": true}}'

# Analyze screenshot with Vision, then click at coordinates
python server.py call '{"tool": "mouse_click", "args": {"x": 500, "y": 300}}'

Example 4: Keyboard Navigation

# Tab through elements
python server.py call '{"tool": "press_key", "args": {"key": "Tab"}}'
python server.py call '{"tool": "press_key", "args": {"key": "Tab"}}'
python server.py call '{"tool": "press_key", "args": {"key": "Enter"}}'

# Keyboard shortcuts
python server.py call '{"tool": "press_key", "args": {"key": "Control+a"}}'  # Select all
python server.py call '{"tool": "press_key", "args": {"key": "Control+c"}}'  # Copy

Troubleshooting

"Server not running" error

python server.py start &
sleep 2
python server.py status

Element click not working

Re-fetch elements: get_state with include_screenshot: true
Check correct index in elements list
Try keyboard navigation: press_key("Tab") then press_key("Enter")
Use coordinate click: mouse_click(x, y)

Elements showing as "unknown"

This is normal for complex pages. The element cache still works - use the index to click/type.

Browser not visible

Server runs headless by default in background. Browser window appears but may be behind other windows.

Session disconnected

Always use server.py (server mode). Each server.py call maintains the same session.

How It Works

Why Server Mode?

Without server mode, each Python call would:

Launch browser
Perform action
Close browser (lose state!)

With server mode:

Start server once (browser stays open)
All calls share same browser session
Stop server when done

Direct Mode vs Subagent Mode

Direct Mode:

Claude reads screenshot (Vision)
Claude decides next action
Claude calls tool
Repeat until done

Subagent Mode:

Claude spawns subagent with task description
Subagent autonomously navigates
Subagent reports results
Better for complex multi-step tasks

Requirements

Python 3.8+
browser-use library
Playwright + Chromium
(Optional) LLM API key for run_agent/run_code_agent tools

License

MIT

DPA - Decentralized Protection Alliance

Freedom without surveillance, protection for everyone.

dpa.network | contact@dpa.network | Matrix | Telegram

Contract & API

Machine endpoints, protocol fit, contract coverage, invocation examples, and guardrails for agent-to-agent use.

MissingGITHUB REPOS

Endpoints

Dossier API Snapshot API Contract API Trust API

Contract coverage

Status

missing

Auth

None

Streaming

Data region

Unspecified

Protocol support

OpenClaw: self-declared

Requires: none

Forbidden: none

Guardrails

Operational confidence: low

No positive guardrails captured.

Invocation examples

curl -s "https://xpersona.co/api/v1/agents/dalbit-mir-browser-use-skill/snapshot"

curl -s "https://xpersona.co/api/v1/agents/dalbit-mir-browser-use-skill/contract"

curl -s "https://xpersona.co/api/v1/agents/dalbit-mir-browser-use-skill/trust"

Reliability & Benchmarks

Trust and runtime signals, benchmark suites, failure patterns, and practical risk constraints.

Missingruntime-metrics

Trust signals

Handshake

UNKNOWN

Confidence

unknown

Attempts 30d

unknown

Fallback rate

unknown

Runtime metrics

Observed P50

unknown

Observed P95

unknown

Rate limit

unknown

Estimated cost

unknown

Do not use if

Contract metadata is missing or unavailable for deterministic execution.

No benchmark suites or observed failure patterns are available.

Media & Demo

Every public screenshot, visual asset, demo link, and owner-provided destination tied to this agent.

Missingno-media

No screenshots, media assets, or demo links are available.

Related Agents

Neighboring agents from the same protocol and source ecosystem for comparison and shortlist building.

Self-declaredprotocol-neighbors

GITHUB_REPOSactivepieces

Rank

AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents

Traction

No public download signal

Freshness

Updated 2d ago

OPENCLAW

GITHUB_REPOScherry-studio

Rank

AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs

Traction

No public download signal

Freshness

Updated 5d ago

MCPOPENCLAW

GITHUB_REPOSAionUi

Rank

Free, local, open-source 24/7 Cowork app and OpenClaw for Gemini CLI, Claude Code, Codex, OpenCode, Qwen Code, Goose CLI, Auggie, and more | 🌟 Star if you like it!

Traction

No public download signal

Freshness

Updated 6d ago

MCPOPENCLAW

GITHUB_REPOSCopilotKit

Rank

The Frontend for Agents & Generative UI. React + Angular

Traction

No public download signal

Freshness

Updated 23d ago

OPENCLAW

Machine Appendix

Contract JSON

{
  "contractStatus": "missing",
  "authModes": [],
  "requires": [],
  "forbidden": [],
  "supportsMcp": false,
  "supportsA2a": false,
  "supportsStreaming": false,
  "inputSchemaRef": null,
  "outputSchemaRef": null,
  "dataRegion": null,
  "contractUpdatedAt": null,
  "sourceUpdatedAt": null,
  "freshnessSeconds": null
}

Invocation Guide

{
  "preferredApi": {
    "snapshotUrl": "https://xpersona.co/api/v1/agents/dalbit-mir-browser-use-skill/snapshot",
    "contractUrl": "https://xpersona.co/api/v1/agents/dalbit-mir-browser-use-skill/contract",
    "trustUrl": "https://xpersona.co/api/v1/agents/dalbit-mir-browser-use-skill/trust"
  },
  "curlExamples": [
    "curl -s \"https://xpersona.co/api/v1/agents/dalbit-mir-browser-use-skill/snapshot\"",
    "curl -s \"https://xpersona.co/api/v1/agents/dalbit-mir-browser-use-skill/contract\"",
    "curl -s \"https://xpersona.co/api/v1/agents/dalbit-mir-browser-use-skill/trust\""
  ],
  "jsonRequestTemplate": {
    "query": "summarize this repo",
    "constraints": {
      "maxLatencyMs": 2000,
      "protocolPreference": [
        "OPENCLEW"
      ]
    }
  },
  "jsonResponseTemplate": {
    "ok": true,
    "result": {
      "summary": "...",
      "confidence": 0.9
    },
    "meta": {
      "source": "GITHUB_REPOS",
      "generatedAt": "2026-04-17T01:46:48.043Z"
    }
  },
  "retryPolicy": {
    "maxAttempts": 3,
    "backoffMs": [
      500,
      1500,
      3500
    ],
    "retryableConditions": [
      "HTTP_429",
      "HTTP_503",
      "NETWORK_TIMEOUT"
    ]
  }
}

Trust JSON

{
  "status": "unavailable",
  "handshakeStatus": "UNKNOWN",
  "verificationFreshnessHours": null,
  "reputationScore": null,
  "p95LatencyMs": null,
  "successRate30d": null,
  "fallbackRate": null,
  "attempts30d": null,
  "trustUpdatedAt": null,
  "trustConfidence": "unknown",
  "sourceUpdatedAt": null,
  "freshnessSeconds": null
}

Capability Matrix

{
  "rows": [
    {
      "key": "OPENCLEW",
      "type": "protocol",
      "support": "unknown",
      "confidenceSource": "profile",
      "notes": "Listed on profile"
    }
  ],
  "flattenedTokens": "protocol:OPENCLEW|unknown|profile"
}

Facts JSON

[
  {
    "factKey": "docs_crawl",
    "category": "integration",
    "label": "Crawlable docs",
    "value": "6 indexed pages on the official domain",
    "href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceType": "search_document",
    "confidence": "medium",
    "observedAt": "2026-04-15T05:03:46.393Z",
    "isPublic": true
  },
  {
    "factKey": "vendor",
    "category": "vendor",
    "label": "Vendor",
    "value": "Dalbit Mir",
    "href": "https://github.com/dalbit-mir/browser-use-skill",
    "sourceUrl": "https://github.com/dalbit-mir/browser-use-skill",
    "sourceType": "profile",
    "confidence": "medium",
    "observedAt": "2026-02-25T04:22:53.746Z",
    "isPublic": true
  },
  {
    "factKey": "protocols",
    "category": "compatibility",
    "label": "Protocol compatibility",
    "value": "OpenClaw",
    "href": "https://xpersona.co/api/v1/agents/dalbit-mir-browser-use-skill/contract",
    "sourceUrl": "https://xpersona.co/api/v1/agents/dalbit-mir-browser-use-skill/contract",
    "sourceType": "contract",
    "confidence": "medium",
    "observedAt": "2026-02-25T04:22:53.746Z",
    "isPublic": true
  },
  {
    "factKey": "traction",
    "category": "adoption",
    "label": "Adoption signal",
    "value": "2 GitHub stars",
    "href": "https://github.com/dalbit-mir/browser-use-skill",
    "sourceUrl": "https://github.com/dalbit-mir/browser-use-skill",
    "sourceType": "profile",
    "confidence": "medium",
    "observedAt": "2026-02-25T04:22:53.746Z",
    "isPublic": true
  },
  {
    "factKey": "handshake_status",
    "category": "security",
    "label": "Handshake status",
    "value": "UNKNOWN",
    "href": "https://xpersona.co/api/v1/agents/dalbit-mir-browser-use-skill/trust",
    "sourceUrl": "https://xpersona.co/api/v1/agents/dalbit-mir-browser-use-skill/trust",
    "sourceType": "trust",
    "confidence": "medium",
    "observedAt": null,
    "isPublic": true
  }
]

Change Events JSON

[
  {
    "eventType": "docs_update",
    "title": "Docs refreshed: Sign in to GitHub · GitHub",
    "description": "Fresh crawlable documentation was indexed for the official domain.",
    "href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceType": "search_document",
    "confidence": "medium",
    "observedAt": "2026-04-15T05:03:46.393Z",
    "isPublic": true
  }
]

browser-use-skill answer-first brief

Executive Summary

Evidence Ledger

Release & Crawl Timeline

Artifacts Archive

Docs & README

browser-use Skill for Claude Code

Features

Triggers

Installation

1. Install browser-use

2. Copy skill to Claude Code

3. Start server

Quick Start

Direct Mode (Recommended - No API Key Required!)

Subagent Mode

Server Commands

Tool Reference

Page Tools

Element Tools

Mouse Tools

Tab Management

AI Agent Tools (Requires API Key)

Examples

Example 1: Google Search (Direct Mode)

Example 2: Form Filling

Example 3: Coordinate Click (When Index Fails)

Example 4: Keyboard Navigation

Troubleshooting

"Server not running" error

Element click not working

Elements showing as "unknown"

Browser not visible

Session disconnected

How It Works

Why Server Mode?

Direct Mode vs Subagent Mode

Requirements

License

DPA - Decentralized Protection Alliance

Contract & API

Reliability & Benchmarks

Media & Demo

Related Agents