Crawler Summary

internship-scout answer-first brief

Search BOSS直聘 for non-big-tech internship positions via chrome-osascript, extract full JD (DOM raw), summarize and tag via subagent, and persist to YAML. Optionally sync to Notion. Use when the user wants to find, collect, analyze, or track internship listings. --- name: internship-scout description: Search BOSS直聘 for non-big-tech internship positions via chrome-osascript, extract full JD (DOM raw), summarize and tag via subagent, and persist to YAML. Optionally sync to Notion. Use when the user wants to find, collect, analyze, or track internship listings. --- internship-scout File Layout **Data files** (workspace root): - internships.yaml — all collected positions - inter Capability contract not published. No trust telemetry is available yet. 11 GitHub stars reported by the source. Last updated 4/15/2026.

Freshness

Last checked 4/15/2026

Best For

internship-scout is best for general automation workflows where OpenClaw compatibility matters.

Not Ideal For

Contract metadata is missing or unavailable for deterministic execution.

Evidence Sources Checked

editorial-content, GITHUB OPENCLEW, runtime-metrics, public facts pack

Claim this agent
Agent DossierGitHubSafety: 100/100

internship-scout

Search BOSS直聘 for non-big-tech internship positions via chrome-osascript, extract full JD (DOM raw), summarize and tag via subagent, and persist to YAML. Optionally sync to Notion. Use when the user wants to find, collect, analyze, or track internship listings. --- name: internship-scout description: Search BOSS直聘 for non-big-tech internship positions via chrome-osascript, extract full JD (DOM raw), summarize and tag via subagent, and persist to YAML. Optionally sync to Notion. Use when the user wants to find, collect, analyze, or track internship listings. --- internship-scout File Layout **Data files** (workspace root): - internships.yaml — all collected positions - inter

OpenClawself-declared

Public facts

5

Change events

1

Artifacts

0

Freshness

Apr 15, 2026

Verifiededitorial-contentNo verified compatibility signals11 GitHub stars

Capability contract not published. No trust telemetry is available yet. 11 GitHub stars reported by the source. Last updated 4/15/2026.

11 GitHub starsTrust evidence available

Trust score

Unknown

Compatibility

OpenClaw

Freshness

Apr 15, 2026

Vendor

Wkf16

Artifacts

0

Benchmarks

0

Last release

Unpublished

Executive Summary

Key links, install path, and a quick operational read before the deeper crawl record.

Verifiededitorial-content

Summary

Capability contract not published. No trust telemetry is available yet. 11 GitHub stars reported by the source. Last updated 4/15/2026.

Setup snapshot

git clone https://github.com/wkf16/internship-scout.git
  1. 1

    Setup complexity is LOW. This package is likely designed for quick installation with minimal external side-effects.

  2. 2

    Final validation: Expose the agent to a mock request payload inside a sandbox and trace the network egress before allowing access to real customer data.

Evidence Ledger

Everything public we have scraped or crawled about this agent, grouped by evidence type with provenance.

Verifiededitorial-content
Vendor (1)

Vendor

Wkf16

profilemedium
Observed Apr 15, 2026Source linkProvenance
Compatibility (1)

Protocol compatibility

OpenClaw

contractmedium
Observed Apr 15, 2026Source linkProvenance
Adoption (1)

Adoption signal

11 GitHub stars

profilemedium
Observed Apr 15, 2026Source linkProvenance
Security (1)

Handshake status

UNKNOWN

trustmedium
Observed unknownSource linkProvenance
Integration (1)

Crawlable docs

6 indexed pages on the official domain

search_documentmedium
Observed Apr 15, 2026Source linkProvenance

Release & Crawl Timeline

Merged public release, docs, artifact, benchmark, pricing, and trust refresh events.

Self-declaredagent-index

Artifacts Archive

Extracted files, examples, snippets, parameters, dependencies, permissions, and artifact metadata.

Self-declaredGITHUB OPENCLEW

Extracted files

0

Examples

6

Snippets

0

Languages

typescript

Parameters

Executable Examples

text

skills/internship-scout/
├── SKILL.md
├── references/
│   ├── schema.md                # YAML field definitions
│   └── prefs-template.md        # Template for internship-prefs.md
└── scripts/
    ├── fetch_job_links.py       # Step 1: API 抓取职位列表
    ├── fetch_jd_dom.py          # Step 2: osascript DOM 抓 JD 原文
    ├── summarize_jds.py         # Step 3: subagent 批量生成 summary/tags/quality
    ├── dedup_check.py           # 去重检查
    └── notion_sync.py           # Notion upsert

text

fetch_job_links.py   →   fetch_jd_dom.py   →   summarize_jds.py   →   notion_sync.py
  (抓列表+结构字段)        (DOM抓JD原文)         (summary/tags/ABCD)      (Notion同步)

bash

test -f ~/.openclaw/workspace/internship-prefs.md && echo exists || echo missing

bash

test -f ~/.openclaw/workspace/internships.yaml && echo exists || echo missing

bash

python3 skills/internship-scout/scripts/fetch_job_links.py \
  --prefs internship-prefs.md \
  --yaml internships.yaml

bash

python3 skills/internship-scout/scripts/fetch_jd_dom.py \
  --yaml internships.yaml \
  --limit 5 \
  --min-delay 2.0 --max-delay 5.0

Docs & README

Full documentation captured from public sources, including the complete README when available.

Self-declaredGITHUB OPENCLEW

Docs source

GITHUB OPENCLEW

Editorial quality

ready

Search BOSS直聘 for non-big-tech internship positions via chrome-osascript, extract full JD (DOM raw), summarize and tag via subagent, and persist to YAML. Optionally sync to Notion. Use when the user wants to find, collect, analyze, or track internship listings. --- name: internship-scout description: Search BOSS直聘 for non-big-tech internship positions via chrome-osascript, extract full JD (DOM raw), summarize and tag via subagent, and persist to YAML. Optionally sync to Notion. Use when the user wants to find, collect, analyze, or track internship listings. --- internship-scout File Layout **Data files** (workspace root): - internships.yaml — all collected positions - inter

Full README

name: internship-scout description: Search BOSS直聘 for non-big-tech internship positions via chrome-osascript, extract full JD (DOM raw), summarize and tag via subagent, and persist to YAML. Optionally sync to Notion. Use when the user wants to find, collect, analyze, or track internship listings.

internship-scout

File Layout

skills/internship-scout/
├── SKILL.md
├── references/
│   ├── schema.md                # YAML field definitions
│   └── prefs-template.md        # Template for internship-prefs.md
└── scripts/
    ├── fetch_job_links.py       # Step 1: API 抓取职位列表
    ├── fetch_jd_dom.py          # Step 2: osascript DOM 抓 JD 原文
    ├── summarize_jds.py         # Step 3: subagent 批量生成 summary/tags/quality
    ├── dedup_check.py           # 去重检查
    └── notion_sync.py           # Notion upsert

Data files (workspace root):

  • internships.yaml — all collected positions
  • internship-prefs.md — user job preferences (created on first run)

Workflow Overview

fetch_job_links.py   →   fetch_jd_dom.py   →   summarize_jds.py   →   notion_sync.py
  (抓列表+结构字段)        (DOM抓JD原文)         (summary/tags/ABCD)      (Notion同步)

Step 1 — Preferences & Data File

检查偏好文件:

test -f ~/.openclaw/workspace/internship-prefs.md && echo exists || echo missing

Missing → ask the user:

  1. 目标城市(上海 / 北京 / 深圳 / 杭州 / 全国 / 远程)
  2. 期望日薪下限(元/天)
  3. 公司规模偏好(20-99人 / 100-499人 / 都可以)
  4. 融资阶段偏好(天使轮/A轮/B轮以上/不限)
  5. 岗位方向关键词(如:agent, LLM, 大模型)
  6. 技术栈偏好(如:Python, LangChain, RAG)
  7. 额外排除关键词
  8. 学历情况(本科在读 / 硕士在读)
  9. 可实习时长
  10. 大厂排除列表(留空使用默认:字节/阿里/腾讯/百度等;填「不限」则不过滤大厂)
  11. 非技术岗排除列表(留空使用默认:销售/运营/HR等;填「不限」则不过滤)

Exists → load silently. Re-trigger only if user says "更新偏好" or "重置偏好".

检查数据文件:

test -f ~/.openclaw/workspace/internships.yaml && echo exists || echo missing
  • Missing → 将从空文件开始,直接进入 Step 2。
  • Exists → 询问用户:在现有数据基础上追加(默认),还是覆盖重建
    • 追加:fetch_job_links.py 会自动跳过已收录 URL,无需额外操作。
    • 覆盖:备份后清空,cp internships.yaml internships.yaml.bak && echo '[]' > internships.yaml

Step 2 — Fetch Job Links

python3 skills/internship-scout/scripts/fetch_job_links.py \
  --prefs internship-prefs.md \
  --yaml internships.yaml

internship-prefs.md 读取搜索词、城市、规模等偏好,调用 BOSS直聘内部 API 抓取职位列表。只写结构字段(title/company/salary/location/url 等),不含 JD 正文。

过滤规则(均可在 prefs 中配置):

  • 大厂排除:留空 → 使用内置默认列表;填公司名 → 只排除指定公司;填「无」或「不限」→ 关闭大厂过滤
  • 非技术岗排除:同上逻辑,留空使用默认(销售/运营/HR等)
  • 低薪过滤:日薪低于 日薪下限 的条目自动跳过
  • 去重:已收录 URL 自动跳过

Step 3 — Fetch JD DOM

python3 skills/internship-scout/scripts/fetch_jd_dom.py \
  --yaml internships.yaml \
  --limit 5 \
  --min-delay 2.0 --max-delay 5.0
  • 用 osascript 打开 Chrome,DOM 抓取 .job-sec-text 节点原文(带换行)
  • 只处理 jd_full 为空且无 fetch_error 的条目
  • 空 JD → 写 fetch_error: empty_job_sec_text,exit 1
  • --refetch 强制重抓已有 jd_full 的条目

前提:Chrome 已开启 View > Developer > Allow JavaScript from Apple Events


Step 4 — Summarize JDs (single subagent)

# 查看待处理条目
python3 skills/internship-scout/scripts/summarize_jds.py --list-pending

# 打印完整 prompt(调试用)
python3 skills/internship-scout/scripts/summarize_jds.py --dry-run

# 写回 subagent 结果
python3 skills/internship-scout/scripts/summarize_jds.py --write-result '<json>'

工作流(由主会话 orchestrate)

1. --list-pending      → 确认待处理数量
2. --dry-run           → 拿到完整 prompt
3. sessions_spawn(cleanup=keep, mode=run)  → subagent 纯文本推理
4. --write-result '<json>'                 → 写回 YAML

所有 pending 条目一次性放入单个 subagent,无需分批。

subagent 输入/输出

  • 输入:system prompt + 所有 pending JD 原文(纯文本)
  • 输出:严格 JSON 数组
    [{
      "id": 0,
      "clarity": 3,
      "tech_stack": 3,
      "role_signal": 2,
      "jd_score": 8,
      "jd_quality": "A",
      "jd_summary": "30-50字摘要",
      "tags": ["Python", "LLM"]
    }]
    
  • 不使用任何 tools,不联网

jd_quality 评级(三维评分)

| 维度 | 1分 | 2分 | 3分 | |------|-----|-----|-----| | clarity | 全是套话 | 有方向但笼统 | 职责明确可执行 | | tech_stack | 只有泛称/无要求 | 1-2个具体技术名 | ≥3个具体技术名 | | role_signal | 非技术/外包/销售 | 技术+产品混合 | 明确技术/算法/研究岗 |

总分(clarity+tech_stack+role_signal)映射:8-9→A,6-7→B,4-5→C,3→D,含外包/销售/无技术要求→F


Step 5 — Notion Sync

python3 skills/internship-scout/scripts/notion_sync.py \
  --yaml internships.yaml \
  --mode new        # new / update / all / reset

| 场景 | 命令 | |------|------| | 新增条目 | --mode new | | 更新已有 | --mode update | | 全量同步 | --mode all | | 重置去重 | --mode reset | | 单家公司 | --filter "公司名" --mode all |


Status Values

pendingappliedinterviewingoffered / rejected / ghosted

Contract & API

Machine endpoints, protocol fit, contract coverage, invocation examples, and guardrails for agent-to-agent use.

MissingGITHUB OPENCLEW

Contract coverage

Status

missing

Auth

None

Streaming

No

Data region

Unspecified

Protocol support

OpenClaw: self-declared

Requires: none

Forbidden: none

Guardrails

Operational confidence: low

No positive guardrails captured.
Invocation examples
curl -s "https://xpersona.co/api/v1/agents/wkf16-internship-scout/snapshot"
curl -s "https://xpersona.co/api/v1/agents/wkf16-internship-scout/contract"
curl -s "https://xpersona.co/api/v1/agents/wkf16-internship-scout/trust"

Reliability & Benchmarks

Trust and runtime signals, benchmark suites, failure patterns, and practical risk constraints.

Missingruntime-metrics

Trust signals

Handshake

UNKNOWN

Confidence

unknown

Attempts 30d

unknown

Fallback rate

unknown

Runtime metrics

Observed P50

unknown

Observed P95

unknown

Rate limit

unknown

Estimated cost

unknown

Do not use if

Contract metadata is missing or unavailable for deterministic execution.
No benchmark suites or observed failure patterns are available.

Media & Demo

Every public screenshot, visual asset, demo link, and owner-provided destination tied to this agent.

Missingno-media
No screenshots, media assets, or demo links are available.

Related Agents

Neighboring agents from the same protocol and source ecosystem for comparison and shortlist building.

Self-declaredprotocol-neighbors
GITHUB_REPOSactivepieces

Rank

70

AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents

Traction

No public download signal

Freshness

Updated 2d ago

OPENCLAW
GITHUB_REPOScherry-studio

Rank

70

AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs

Traction

No public download signal

Freshness

Updated 5d ago

MCPOPENCLAW
GITHUB_REPOSAionUi

Rank

70

Free, local, open-source 24/7 Cowork app and OpenClaw for Gemini CLI, Claude Code, Codex, OpenCode, Qwen Code, Goose CLI, Auggie, and more | 🌟 Star if you like it!

Traction

No public download signal

Freshness

Updated 6d ago

MCPOPENCLAW
GITHUB_REPOSCopilotKit

Rank

70

The Frontend for Agents & Generative UI. React + Angular

Traction

No public download signal

Freshness

Updated 23d ago

OPENCLAW
Machine Appendix

Contract JSON

{
  "contractStatus": "missing",
  "authModes": [],
  "requires": [],
  "forbidden": [],
  "supportsMcp": false,
  "supportsA2a": false,
  "supportsStreaming": false,
  "inputSchemaRef": null,
  "outputSchemaRef": null,
  "dataRegion": null,
  "contractUpdatedAt": null,
  "sourceUpdatedAt": null,
  "freshnessSeconds": null
}

Invocation Guide

{
  "preferredApi": {
    "snapshotUrl": "https://xpersona.co/api/v1/agents/wkf16-internship-scout/snapshot",
    "contractUrl": "https://xpersona.co/api/v1/agents/wkf16-internship-scout/contract",
    "trustUrl": "https://xpersona.co/api/v1/agents/wkf16-internship-scout/trust"
  },
  "curlExamples": [
    "curl -s \"https://xpersona.co/api/v1/agents/wkf16-internship-scout/snapshot\"",
    "curl -s \"https://xpersona.co/api/v1/agents/wkf16-internship-scout/contract\"",
    "curl -s \"https://xpersona.co/api/v1/agents/wkf16-internship-scout/trust\""
  ],
  "jsonRequestTemplate": {
    "query": "summarize this repo",
    "constraints": {
      "maxLatencyMs": 2000,
      "protocolPreference": [
        "OPENCLEW"
      ]
    }
  },
  "jsonResponseTemplate": {
    "ok": true,
    "result": {
      "summary": "...",
      "confidence": 0.9
    },
    "meta": {
      "source": "GITHUB_OPENCLEW",
      "generatedAt": "2026-04-17T00:51:40.421Z"
    }
  },
  "retryPolicy": {
    "maxAttempts": 3,
    "backoffMs": [
      500,
      1500,
      3500
    ],
    "retryableConditions": [
      "HTTP_429",
      "HTTP_503",
      "NETWORK_TIMEOUT"
    ]
  }
}

Trust JSON

{
  "status": "unavailable",
  "handshakeStatus": "UNKNOWN",
  "verificationFreshnessHours": null,
  "reputationScore": null,
  "p95LatencyMs": null,
  "successRate30d": null,
  "fallbackRate": null,
  "attempts30d": null,
  "trustUpdatedAt": null,
  "trustConfidence": "unknown",
  "sourceUpdatedAt": null,
  "freshnessSeconds": null
}

Capability Matrix

{
  "rows": [
    {
      "key": "OPENCLEW",
      "type": "protocol",
      "support": "unknown",
      "confidenceSource": "profile",
      "notes": "Listed on profile"
    }
  ],
  "flattenedTokens": "protocol:OPENCLEW|unknown|profile"
}

Facts JSON

[
  {
    "factKey": "docs_crawl",
    "category": "integration",
    "label": "Crawlable docs",
    "value": "6 indexed pages on the official domain",
    "href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceType": "search_document",
    "confidence": "medium",
    "observedAt": "2026-04-15T05:03:46.393Z",
    "isPublic": true
  },
  {
    "factKey": "vendor",
    "category": "vendor",
    "label": "Vendor",
    "value": "Wkf16",
    "href": "https://github.com/wkf16/internship-scout",
    "sourceUrl": "https://github.com/wkf16/internship-scout",
    "sourceType": "profile",
    "confidence": "medium",
    "observedAt": "2026-04-15T00:18:58.648Z",
    "isPublic": true
  },
  {
    "factKey": "protocols",
    "category": "compatibility",
    "label": "Protocol compatibility",
    "value": "OpenClaw",
    "href": "https://xpersona.co/api/v1/agents/wkf16-internship-scout/contract",
    "sourceUrl": "https://xpersona.co/api/v1/agents/wkf16-internship-scout/contract",
    "sourceType": "contract",
    "confidence": "medium",
    "observedAt": "2026-04-15T00:18:58.648Z",
    "isPublic": true
  },
  {
    "factKey": "traction",
    "category": "adoption",
    "label": "Adoption signal",
    "value": "11 GitHub stars",
    "href": "https://github.com/wkf16/internship-scout",
    "sourceUrl": "https://github.com/wkf16/internship-scout",
    "sourceType": "profile",
    "confidence": "medium",
    "observedAt": "2026-04-15T00:18:58.648Z",
    "isPublic": true
  },
  {
    "factKey": "handshake_status",
    "category": "security",
    "label": "Handshake status",
    "value": "UNKNOWN",
    "href": "https://xpersona.co/api/v1/agents/wkf16-internship-scout/trust",
    "sourceUrl": "https://xpersona.co/api/v1/agents/wkf16-internship-scout/trust",
    "sourceType": "trust",
    "confidence": "medium",
    "observedAt": null,
    "isPublic": true
  }
]

Change Events JSON

[
  {
    "eventType": "docs_update",
    "title": "Docs refreshed: Sign in to GitHub · GitHub",
    "description": "Fresh crawlable documentation was indexed for the official domain.",
    "href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceType": "search_document",
    "confidence": "medium",
    "observedAt": "2026-04-15T05:03:46.393Z",
    "isPublic": true
  }
]

Sponsored

Ads related to internship-scout and adjacent AI workflows.