Rank
70
AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents
Traction
No public download signal
Freshness
Updated 2d ago
Crawler Summary
Use this skill when you need to crawl a public website domain and produce agent-ready content files plus a structured report with URL/title/description metadata and optional PageRank scoring. --- name: sitecrawl description: Use this skill when you need to crawl a public website domain and produce agent-ready content files plus a structured report with URL/title/description metadata and optional PageRank scoring. --- sitecrawl Skill Use this skill to collect high-quality website content for downstream agent workflows. When To Use - You need a reproducible crawl of a single domain. - You need content files Published capability contract available. No trust telemetry is available yet. Last updated 3/1/2026.
Freshness
Last checked 3/1/2026
Best For
Contract is available with explicit auth and schema references.
Not Ideal For
sitecrawl is not ideal for teams that need stronger public trust telemetry, lower setup complexity, or more explicit contract coverage before production rollout.
Evidence Sources Checked
editorial-content, capability-contract, runtime-metrics, public facts pack
Use this skill when you need to crawl a public website domain and produce agent-ready content files plus a structured report with URL/title/description metadata and optional PageRank scoring. --- name: sitecrawl description: Use this skill when you need to crawl a public website domain and produce agent-ready content files plus a structured report with URL/title/description metadata and optional PageRank scoring. --- sitecrawl Skill Use this skill to collect high-quality website content for downstream agent workflows. When To Use - You need a reproducible crawl of a single domain. - You need content files
Public facts
6
Change events
1
Artifacts
0
Freshness
Mar 1, 2026
Published capability contract available. No trust telemetry is available yet. Last updated 3/1/2026.
Trust score
Unknown
Compatibility
OpenClaw
Freshness
Mar 1, 2026
Vendor
Sbstnerhrdt
Artifacts
0
Benchmarks
0
Last release
Unpublished
Key links, install path, and a quick operational read before the deeper crawl record.
Summary
Published capability contract available. No trust telemetry is available yet. Last updated 3/1/2026.
Setup snapshot
git clone https://github.com/SbstnErhrdt/sitecrawl.gitSetup complexity is LOW. This package is likely designed for quick installation with minimal external side-effects.
Final validation: Expose the agent to a mock request payload inside a sandbox and trace the network egress before allowing access to real customer data.
Everything public we have scraped or crawled about this agent, grouped by evidence type with provenance.
Vendor
Sbstnerhrdt
Protocol compatibility
OpenClaw
Auth modes
api_key
Machine-readable schemas
OpenAPI or schema references published
Handshake status
UNKNOWN
Crawlable docs
6 indexed pages on the official domain
Merged public release, docs, artifact, benchmark, pricing, and trust refresh events.
Extracted files, examples, snippets, parameters, dependencies, permissions, and artifact metadata.
Extracted files
0
Examples
2
Snippets
0
Languages
typescript
Parameters
sh
sitecrawl crawl --domain <domain> --format md --out <out_dir> --strategy pagerank --max-pages 100
sh
./scripts/run_crawl.sh --domain <domain> --format md --out <out_dir> --strategy pagerank --max-pages 100
Full documentation captured from public sources, including the complete README when available.
Docs source
GITHUB OPENCLEW
Editorial quality
ready
Use this skill when you need to crawl a public website domain and produce agent-ready content files plus a structured report with URL/title/description metadata and optional PageRank scoring. --- name: sitecrawl description: Use this skill when you need to crawl a public website domain and produce agent-ready content files plus a structured report with URL/title/description metadata and optional PageRank scoring. --- sitecrawl Skill Use this skill to collect high-quality website content for downstream agent workflows. When To Use - You need a reproducible crawl of a single domain. - You need content files
Use this skill to collect high-quality website content for downstream agent workflows.
md, html, or json) plus a machine-readable report.url, title, description) for triage, ranking, and routing.<domain> + www.<domain>.Required:
domain (example: example.com)out directoryformat (md, html, json)Optional:
strategy (pagerank, limit, depth)max-pagesmax-depthcleanheadfuldelay-mspage-timeoutuser-agentlogsitecrawl crawl --domain <domain> --format md --out <out_dir> --strategy pagerank --max-pages 100
Or through bundled helper script:
./scripts/run_crawl.sh --domain <domain> --format md --out <out_dir> --strategy pagerank --max-pages 100
report.json as the primary index for downstream steps.status=ok and highest score (pagerank strategy).For architecture and release details, read:
docs/ARCHITECTURE.mddocs/RELEASES.mdExpect these outputs in <out_dir>:
*.md, *.html, or *.json)report.jsonreport.json contains:
domain, strategy, timings, options)urltitledescriptionfinal_urlstatusout_pathlinks_countscore (if strategy is pagerank)visited, errors, skipped counters)After crawl:
errors in report.json are acceptablepages[]url, title, description)max-pages or markdown format for cleaner contentMachine endpoints, protocol fit, contract coverage, invocation examples, and guardrails for agent-to-agent use.
Contract coverage
Status
ready
Auth
api_key
Streaming
Yes
Data region
global
Protocol support
Requires: openclew, lang:typescript, streaming
Forbidden: none
Guardrails
Operational confidence: medium
curl -s "https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/snapshot"
curl -s "https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/contract"
curl -s "https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/trust"
Trust and runtime signals, benchmark suites, failure patterns, and practical risk constraints.
Trust signals
Handshake
UNKNOWN
Confidence
unknown
Attempts 30d
unknown
Fallback rate
unknown
Runtime metrics
Observed P50
unknown
Observed P95
unknown
Rate limit
unknown
Estimated cost
unknown
Every public screenshot, visual asset, demo link, and owner-provided destination tied to this agent.
Neighboring agents from the same protocol and source ecosystem for comparison and shortlist building.
Rank
70
AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents
Traction
No public download signal
Freshness
Updated 2d ago
Rank
70
AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs
Traction
No public download signal
Freshness
Updated 5d ago
Rank
70
Free, local, open-source 24/7 Cowork app and OpenClaw for Gemini CLI, Claude Code, Codex, OpenCode, Qwen Code, Goose CLI, Auggie, and more | 🌟 Star if you like it!
Traction
No public download signal
Freshness
Updated 6d ago
Rank
70
The Frontend for Agents & Generative UI. React + Angular
Traction
No public download signal
Freshness
Updated 23d ago
Contract JSON
{
"contractStatus": "ready",
"authModes": [
"api_key"
],
"requires": [
"openclew",
"lang:typescript",
"streaming"
],
"forbidden": [],
"supportsMcp": false,
"supportsA2a": false,
"supportsStreaming": true,
"inputSchemaRef": "https://github.com/SbstnErhrdt/sitecrawl#input",
"outputSchemaRef": "https://github.com/SbstnErhrdt/sitecrawl#output",
"dataRegion": "global",
"contractUpdatedAt": "2026-02-24T19:41:43.736Z",
"sourceUpdatedAt": "2026-02-24T19:41:43.736Z",
"freshnessSeconds": 4425187
}Invocation Guide
{
"preferredApi": {
"snapshotUrl": "https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/snapshot",
"contractUrl": "https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/contract",
"trustUrl": "https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/trust"
},
"curlExamples": [
"curl -s \"https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/snapshot\"",
"curl -s \"https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/contract\"",
"curl -s \"https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/trust\""
],
"jsonRequestTemplate": {
"query": "summarize this repo",
"constraints": {
"maxLatencyMs": 2000,
"protocolPreference": [
"OPENCLEW"
]
}
},
"jsonResponseTemplate": {
"ok": true,
"result": {
"summary": "...",
"confidence": 0.9
},
"meta": {
"source": "GITHUB_OPENCLEW",
"generatedAt": "2026-04-17T00:54:51.285Z"
}
},
"retryPolicy": {
"maxAttempts": 3,
"backoffMs": [
500,
1500,
3500
],
"retryableConditions": [
"HTTP_429",
"HTTP_503",
"NETWORK_TIMEOUT"
]
}
}Trust JSON
{
"status": "unavailable",
"handshakeStatus": "UNKNOWN",
"verificationFreshnessHours": null,
"reputationScore": null,
"p95LatencyMs": null,
"successRate30d": null,
"fallbackRate": null,
"attempts30d": null,
"trustUpdatedAt": null,
"trustConfidence": "unknown",
"sourceUpdatedAt": null,
"freshnessSeconds": null
}Capability Matrix
{
"rows": [
{
"key": "OPENCLEW",
"type": "protocol",
"support": "unknown",
"confidenceSource": "profile",
"notes": "Listed on profile"
}
],
"flattenedTokens": "protocol:OPENCLEW|unknown|profile"
}Facts JSON
[
{
"factKey": "docs_crawl",
"category": "integration",
"label": "Crawlable docs",
"value": "6 indexed pages on the official domain",
"href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
"sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
"sourceType": "search_document",
"confidence": "medium",
"observedAt": "2026-04-15T05:03:46.393Z",
"isPublic": true
},
{
"factKey": "vendor",
"category": "vendor",
"label": "Vendor",
"value": "Sbstnerhrdt",
"href": "https://github.com/SbstnErhrdt/sitecrawl",
"sourceUrl": "https://github.com/SbstnErhrdt/sitecrawl",
"sourceType": "profile",
"confidence": "medium",
"observedAt": "2026-03-01T06:02:26.566Z",
"isPublic": true
},
{
"factKey": "protocols",
"category": "compatibility",
"label": "Protocol compatibility",
"value": "OpenClaw",
"href": "https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/contract",
"sourceUrl": "https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/contract",
"sourceType": "contract",
"confidence": "medium",
"observedAt": "2026-02-24T19:41:43.736Z",
"isPublic": true
},
{
"factKey": "auth_modes",
"category": "compatibility",
"label": "Auth modes",
"value": "api_key",
"href": "https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/contract",
"sourceUrl": "https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/contract",
"sourceType": "contract",
"confidence": "high",
"observedAt": "2026-02-24T19:41:43.736Z",
"isPublic": true
},
{
"factKey": "schema_refs",
"category": "artifact",
"label": "Machine-readable schemas",
"value": "OpenAPI or schema references published",
"href": "https://github.com/SbstnErhrdt/sitecrawl#input",
"sourceUrl": "https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/contract",
"sourceType": "contract",
"confidence": "high",
"observedAt": "2026-02-24T19:41:43.736Z",
"isPublic": true
},
{
"factKey": "handshake_status",
"category": "security",
"label": "Handshake status",
"value": "UNKNOWN",
"href": "https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/trust",
"sourceUrl": "https://xpersona.co/api/v1/agents/sbstnerhrdt-sitecrawl/trust",
"sourceType": "trust",
"confidence": "medium",
"observedAt": null,
"isPublic": true
}
]Change Events JSON
[
{
"eventType": "docs_update",
"title": "Docs refreshed: Sign in to GitHub · GitHub",
"description": "Fresh crawlable documentation was indexed for the official domain.",
"href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
"sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
"sourceType": "search_document",
"confidence": "medium",
"observedAt": "2026-04-15T05:03:46.393Z",
"isPublic": true
}
]Sponsored
Ads related to sitecrawl and adjacent AI workflows.