Crawler Summary

Mapping-Skill answer-first brief

Complete AI talent discovery and outreach workflow using BrightData MCP. This skill should be used when users need to find PhD students, researchers, or engineers in AI/ML fields, extract their profiles, identify Chinese candidates, classify by type, deduplicate, and generate personalized outreach emails. --- name: Mapping-Skill description: Complete AI talent discovery and outreach workflow using BrightData MCP. This skill should be used when users need to find PhD students, researchers, or engineers in AI/ML fields, extract their profiles, identify Chinese candidates, classify by type, deduplicate, and generate personalized outreach emails. --- AI Talent Recruiter A comprehensive skill for AI/ML talent discovery, pr Capability contract not published. No trust telemetry is available yet. 1 GitHub stars reported by the source. Last updated 2/25/2026.

Freshness

Last checked 2/25/2026

Best For

Mapping-Skill is best for general automation workflows where MCP compatibility matters.

Not Ideal For

Contract metadata is missing or unavailable for deterministic execution.

Evidence Sources Checked

editorial-content, GITHUB OPENCLEW, runtime-metrics, public facts pack

Claim this agent
Agent DossierGitHubSafety: 89/100

Mapping-Skill

Complete AI talent discovery and outreach workflow using BrightData MCP. This skill should be used when users need to find PhD students, researchers, or engineers in AI/ML fields, extract their profiles, identify Chinese candidates, classify by type, deduplicate, and generate personalized outreach emails. --- name: Mapping-Skill description: Complete AI talent discovery and outreach workflow using BrightData MCP. This skill should be used when users need to find PhD students, researchers, or engineers in AI/ML fields, extract their profiles, identify Chinese candidates, classify by type, deduplicate, and generate personalized outreach emails. --- AI Talent Recruiter A comprehensive skill for AI/ML talent discovery, pr

MCPself-declared

Public facts

5

Change events

1

Artifacts

0

Freshness

Feb 25, 2026

Verifiededitorial-contentNo verified compatibility signals1 GitHub stars

Capability contract not published. No trust telemetry is available yet. 1 GitHub stars reported by the source. Last updated 2/25/2026.

1 GitHub starsTrust evidence available

Trust score

Unknown

Compatibility

MCP

Freshness

Feb 25, 2026

Vendor

16miku

Artifacts

0

Benchmarks

0

Last release

Unpublished

Executive Summary

Key links, install path, and a quick operational read before the deeper crawl record.

Verifiededitorial-content

Summary

Capability contract not published. No trust telemetry is available yet. 1 GitHub stars reported by the source. Last updated 2/25/2026.

Setup snapshot

git clone https://github.com/16Miku/Mapping-Skill.git
  1. 1

    Setup complexity is LOW. This package is likely designed for quick installation with minimal external side-effects.

  2. 2

    Final validation: Expose the agent to a mock request payload inside a sandbox and trace the network egress before allowing access to real customer data.

Evidence Ledger

Everything public we have scraped or crawled about this agent, grouped by evidence type with provenance.

Verifiededitorial-content
Vendor (1)

Vendor

16miku

profilemedium
Observed Feb 25, 2026Source linkProvenance
Compatibility (1)

Protocol compatibility

MCP

contractmedium
Observed Feb 25, 2026Source linkProvenance
Adoption (1)

Adoption signal

1 GitHub stars

profilemedium
Observed Feb 25, 2026Source linkProvenance
Security (1)

Handshake status

UNKNOWN

trustmedium
Observed unknownSource linkProvenance
Integration (1)

Crawlable docs

6 indexed pages on the official domain

search_documentmedium
Observed Apr 15, 2026Source linkProvenance

Release & Crawl Timeline

Merged public release, docs, artifact, benchmark, pricing, and trust refresh events.

Self-declaredagent-index

Artifacts Archive

Extracted files, examples, snippets, parameters, dependencies, permissions, and artifact metadata.

Self-declaredGITHUB OPENCLEW

Extracted files

0

Examples

6

Snippets

0

Languages

typescript

Parameters

Executable Examples

bash

claude mcp add --transport sse --scope user brightdata "https://mcp.brightdata.com/sse?token=<your-api-token>"

text

"reinforcement learning" PhD student site:github.io OR site:stanford.edu
"RLHF" "PPO" PhD researcher personal homepage
强化学习 博士生 清华 OR 北大 个人主页

text

Tool: mcp__brightdata__search_engine
Parameters:
  engine: "google"
  query: "<generated_query>"

text

Tool: mcp__brightdata__scrape_as_markdown
Parameters:
  url: "<candidate_url>"

text

Tool: mcp__brightdata__web_data_linkedin_person_profile
Parameters:
  url: "<linkedin_url>"

python

# Pseudo-code for Chinese detection
is_chinese = (
    surname_score * 0.4 +
    institution_score * 0.35 +
    structure_score * 0.15 +
    id_score * 0.1
) >= 0.5

Docs & README

Full documentation captured from public sources, including the complete README when available.

Self-declaredGITHUB OPENCLEW

Docs source

GITHUB OPENCLEW

Editorial quality

ready

Complete AI talent discovery and outreach workflow using BrightData MCP. This skill should be used when users need to find PhD students, researchers, or engineers in AI/ML fields, extract their profiles, identify Chinese candidates, classify by type, deduplicate, and generate personalized outreach emails. --- name: Mapping-Skill description: Complete AI talent discovery and outreach workflow using BrightData MCP. This skill should be used when users need to find PhD students, researchers, or engineers in AI/ML fields, extract their profiles, identify Chinese candidates, classify by type, deduplicate, and generate personalized outreach emails. --- AI Talent Recruiter A comprehensive skill for AI/ML talent discovery, pr

Full README

name: Mapping-Skill description: Complete AI talent discovery and outreach workflow using BrightData MCP. This skill should be used when users need to find PhD students, researchers, or engineers in AI/ML fields, extract their profiles, identify Chinese candidates, classify by type, deduplicate, and generate personalized outreach emails.

AI Talent Recruiter

A comprehensive skill for AI/ML talent discovery, profiling, and personalized outreach using BrightData MCP tools.

Overview

This skill enables the complete talent recruitment pipeline:

  1. Search: Generate optimized queries and discover candidates via web search
  2. Extract: Scrape profile pages and extract structured candidate information
  3. Identify: Recognize Chinese candidates using surname and institution matching
  4. Classify: Categorize candidates by type (PhD/PostDoc/Professor/Industry)
  5. Deduplicate: Remove duplicate candidates using 7-level fingerprinting
  6. Standardize: Map research fields to 22 standardized categories
  7. Generate: Create personalized outreach emails using field-specific templates

Prerequisites

This skill requires the BrightData MCP server to be configured in Claude Code.

Installing BrightData MCP

  1. Get your API token from BrightData
  2. Add the MCP server using Claude CLI:
claude mcp add --transport sse --scope user brightdata "https://mcp.brightdata.com/sse?token=<your-api-token>"

Replace <your-api-token> with your actual BrightData API key.

  1. Verify the MCP server is connected by checking available tools:
    • mcp__brightdata__search_engine - For web searches
    • mcp__brightdata__scrape_as_markdown - For page scraping
    • mcp__brightdata__web_data_linkedin_person_profile - For LinkedIn profiles

Reference Files

| File | Purpose | |------|---------| | references/search-templates.md | Search query templates and keywords | | references/profile-schema.md | Candidate profile data structure | | references/top-ai-labs.md | List of top AI research labs | | references/field-mappings.md | 22 standardized research fields (NEW) | | references/talk-tracks.md | Technical talking points by field (NEW) | | references/email-templates.md | Personalized email templates (NEW) | | references/chinese-surnames.md | Chinese surname database (NEW) | | references/deduplication-rules.md | Candidate deduplication rules (NEW) | | references/candidate-classifier.md | Candidate type classification (NEW) |


Complete Workflow

Step 1: Generate Search Queries

Based on the user's research direction, generate 2-4 search queries using templates from references/search-templates.md.

Query Generation Strategy:

  • Include both English and Chinese keywords
  • Target high-quality domains: github.io, university sites, LinkedIn
  • Combine research direction with role indicators

Example queries for "Reinforcement Learning":

"reinforcement learning" PhD student site:github.io OR site:stanford.edu
"RLHF" "PPO" PhD researcher personal homepage
强化学习 博士生 清华 OR 北大 个人主页

Step 2: Execute Searches

Use mcp__brightdata__search_engine with parallel execution:

Tool: mcp__brightdata__search_engine
Parameters:
  engine: "google"
  query: "<generated_query>"

URL Filtering Priority:

  1. Personal pages (*.github.io, sites.google.com)
  2. University domains (see references/top-ai-labs.md)
  3. LinkedIn profiles (linkedin.com/in/)
  4. Google Scholar profiles

Step 3: Scrape Candidate Profiles

Use mcp__brightdata__scrape_as_markdown for general pages:

Tool: mcp__brightdata__scrape_as_markdown
Parameters:
  url: "<candidate_url>"

For LinkedIn profiles, use the specialized tool:

Tool: mcp__brightdata__web_data_linkedin_person_profile
Parameters:
  url: "<linkedin_url>"

Scrape in parallel: Process 2-5 URLs simultaneously for efficiency.

Step 4: Extract Profile Data

From scraped content, extract fields defined in references/profile-schema.md:

Required fields:

  • name: English name
  • name_cn: Chinese name (if available)
  • title: Position
  • affiliation: University/Company
  • email: Contact email

Recommended fields:

  • advisor: PhD advisor
  • research_interests: Research areas
  • homepage, google_scholar, github, linkedin
  • education, experience
  • publications, citation_count, h_index

Step 5: Identify Chinese Candidates (Optional)

Use rules from references/chinese-surnames.md:

Multi-dimensional scoring:

  • Surname match (40%): Check against 100+ Chinese surnames
  • Institution match (35%): Check against Chinese universities/labs
  • Name structure (15%): Pinyin pattern analysis
  • ID pattern (10%): OpenReview ID analysis

Decision threshold: Confidence >= 0.5

# Pseudo-code for Chinese detection
is_chinese = (
    surname_score * 0.4 +
    institution_score * 0.35 +
    structure_score * 0.15 +
    id_score * 0.1
) >= 0.5

Step 6: Classify Candidate Type

Use rules from references/candidate-classifier.md:

Priority order: PhD > PostDoc > Professor > Industry > Master > Unknown

Classification keywords:

  • PhD: "phd student", "doctoral student", "博士生"
  • PostDoc: "postdoc", "post-doctoral", "博士后"
  • Professor: "professor", "associate professor", "教授"
  • Industry: "engineer", "research scientist at [company]", "算法工程师"

Step 7: Deduplicate Candidates

Use 7-level fingerprinting from references/deduplication-rules.md:

Priority (highest to lowest):

  1. Email (most reliable)
  2. Google Scholar URL
  3. LinkedIn URL
  4. GitHub URL
  5. Personal website
  6. Composite hash (name + school + field)
  7. Source URL hash (last resort)
# Standardization before comparison
email = email.lower().strip()
url = url.lower().replace("https://", "").replace("http://", "").replace("www.", "").rstrip("/")

Step 8: Standardize Research Field

Use references/field-mappings.md to map research directions to 22 standardized fields:

Standard fields: RL, NLP, Multimodal, MOE, Pre-training, post-train, Alignment, Reasoning, Agent&RAG, MLSys, LLM4CODE, Computer Vision, Embodiment, Audio, EVAL, data, AI4S, Interpretable AI, Recommendation System, Federated Learning, Trustworthy AI, Pre/Post-train×RL

def get_standardized_field(research_field: str) -> str:
    for standard_field, aliases in FIELD_MAPPING.items():
        for alias in aliases:
            if alias.lower() in research_field.lower():
                return standard_field
    return "default"

Step 9: Generate Personalized Email

Use templates from references/email-templates.md and talk tracks from references/talk-tracks.md:

Template structure:

Hi, {{researcher_name}},

[Field-specific opening paragraph referencing their work]

{{technical_hook}}  # Connect their specific work to our interests

{{talk_track_paragraph}}  # 3-4 sentences showing domain depth

[Closing with call to action]

Signature

Key placeholders to fill:

  • {{technical_hook}}: Connect candidate's specific work to team interests
  • {{talk_track_paragraph}}: Domain expertise demonstration

Step 10: Present Results

Format results as a structured table:

## Candidate Profile: [Name]

| Field | Value |
|-------|-------|
| Name | [English Name] ([Chinese Name]) |
| Type | [PhD/PostDoc/Professor/Industry] |
| Affiliation | [University/Lab] |
| Research Field | [Standardized Field] |
| Chinese | [Yes/No (confidence)] |
| Email | [Email] |
| Scholar | [URL] |
| GitHub | [URL] |

**Generated Email:**
[Personalized email based on their field]

Usage Examples

Example 1: Complete PhD Search with Email Generation

User Request: "Search for RL PhD students at top universities and generate personalized emails"

Execution:

  1. Generate queries for "reinforcement learning PhD student"
  2. Execute searches, collect URLs
  3. Scrape profiles in parallel
  4. Extract structured data
  5. Identify Chinese candidates (optional)
  6. Classify as PhD
  7. Deduplicate using email/Scholar URLs
  8. Standardize to "RL" field
  9. Generate emails using RL template and talk tracks
  10. Present results with emails

Example 2: Lab-Targeted Search

User Request: "Find all PhD students at Stanford AI Lab working on multimodal learning"

Execution:

  1. Generate site-specific queries: site:ai.stanford.edu "multimodal" PhD
  2. Search and collect Stanford AI Lab profiles
  3. Scrape and extract
  4. Filter by research field mapping to "Multimodal"
  5. Classify and deduplicate
  6. Present results

Example 3: Conference Speaker Discovery

User Request: "Find Chinese researchers who presented at NeurIPS 2024 on LLM alignment"

Execution:

  1. Search: NeurIPS 2024 "alignment" authors site:neurips.cc
  2. Extract author names and affiliations
  3. For each author, search for their personal page
  4. Scrape profiles and identify Chinese candidates
  5. Standardize to "Alignment" field
  6. Generate personalized emails
  7. Present results

Best Practices

  1. Parallel Processing: Execute independent searches and scrapes in parallel
  2. Domain Prioritization: Prioritize academic domains over general sites
  3. Progressive Filtering: Filter aggressively at each step to reduce processing
  4. Error Resilience: Continue processing if individual scrapes fail
  5. Deduplication Early: Apply deduplication after extraction, not just at the end
  6. Email Quality: Always customize {{technical_hook}} based on actual candidate work
  7. Field Mapping: Use standardized fields for consistent categorization
  8. Rate Limiting: Space out requests if encountering rate limits

Output Format

Summary Table

| Name | Type | Affiliation | Field | Chinese? | Email | |------|------|-------------|-------|----------|-------| | Wei Zhang | PhD | Tsinghua | RL | Yes (0.92) | wei@tsinghua.edu | | Li Chen | PostDoc | Stanford | Multimodal | Yes (0.87) | li.chen@stanford.edu |

Detailed Profile (for each candidate)

## Wei Zhang (张伟)

**Identity:** PhD Student at Tsinghua University
**Field:** Reinforcement Learning (RL)
**Chinese:** Yes - surname match + institution match (confidence: 0.92)

**Contact:**
- Email: wei.zhang@tsinghua.edu.cn
- Homepage: weizhang.github.io
- Scholar: [Google Scholar link]
- GitHub: [GitHub link]

**Research:** RLHF, reward modeling, policy optimization

**Publications:**
1. "Efficient RLHF for LLMs" (NeurIPS 2024)
2. "Reward Hacking in Practice" (ICML 2024)

**Generated Email:**
[Personalized email using RL template]

Contract & API

Machine endpoints, protocol fit, contract coverage, invocation examples, and guardrails for agent-to-agent use.

MissingGITHUB OPENCLEW

Contract coverage

Status

missing

Auth

None

Streaming

No

Data region

Unspecified

Protocol support

MCP: self-declared

Requires: none

Forbidden: none

Guardrails

Operational confidence: low

No positive guardrails captured.
Invocation examples
curl -s "https://xpersona.co/api/v1/agents/16miku-mapping-skill/snapshot"
curl -s "https://xpersona.co/api/v1/agents/16miku-mapping-skill/contract"
curl -s "https://xpersona.co/api/v1/agents/16miku-mapping-skill/trust"

Reliability & Benchmarks

Trust and runtime signals, benchmark suites, failure patterns, and practical risk constraints.

Missingruntime-metrics

Trust signals

Handshake

UNKNOWN

Confidence

unknown

Attempts 30d

unknown

Fallback rate

unknown

Runtime metrics

Observed P50

unknown

Observed P95

unknown

Rate limit

unknown

Estimated cost

unknown

Do not use if

Contract metadata is missing or unavailable for deterministic execution.
No benchmark suites or observed failure patterns are available.

Media & Demo

Every public screenshot, visual asset, demo link, and owner-provided destination tied to this agent.

Missingno-media
No screenshots, media assets, or demo links are available.

Related Agents

Neighboring agents from the same protocol and source ecosystem for comparison and shortlist building.

Self-declaredprotocol-neighbors
GITLAB_AI_CATALOGgitlab-mcp

Rank

83

A Model Context Protocol (MCP) server for GitLab

Traction

No public download signal

Freshness

Updated 2d ago

MCP
GITLAB_PUBLIC_PROJECTSgitlab-mcp

Rank

80

A Model Context Protocol (MCP) server for GitLab

Traction

No public download signal

Freshness

Updated 2d ago

MCP
GITLAB_AI_CATALOGrmcp-openapi

Rank

74

Expose OpenAPI definition endpoints as MCP tools using the official Rust SDK for the Model Context Protocol (https://github.com/modelcontextprotocol/rust-sdk)

Traction

No public download signal

Freshness

Updated 2d ago

MCP
GITLAB_AI_CATALOGrmcp-actix-web

Rank

72

An actix_web backend for the official Rust SDK for the Model Context Protocol (https://github.com/modelcontextprotocol/rust-sdk)

Traction

No public download signal

Freshness

Updated 2d ago

MCP
Machine Appendix

Contract JSON

{
  "contractStatus": "missing",
  "authModes": [],
  "requires": [],
  "forbidden": [],
  "supportsMcp": false,
  "supportsA2a": false,
  "supportsStreaming": false,
  "inputSchemaRef": null,
  "outputSchemaRef": null,
  "dataRegion": null,
  "contractUpdatedAt": null,
  "sourceUpdatedAt": null,
  "freshnessSeconds": null
}

Invocation Guide

{
  "preferredApi": {
    "snapshotUrl": "https://xpersona.co/api/v1/agents/16miku-mapping-skill/snapshot",
    "contractUrl": "https://xpersona.co/api/v1/agents/16miku-mapping-skill/contract",
    "trustUrl": "https://xpersona.co/api/v1/agents/16miku-mapping-skill/trust"
  },
  "curlExamples": [
    "curl -s \"https://xpersona.co/api/v1/agents/16miku-mapping-skill/snapshot\"",
    "curl -s \"https://xpersona.co/api/v1/agents/16miku-mapping-skill/contract\"",
    "curl -s \"https://xpersona.co/api/v1/agents/16miku-mapping-skill/trust\""
  ],
  "jsonRequestTemplate": {
    "query": "summarize this repo",
    "constraints": {
      "maxLatencyMs": 2000,
      "protocolPreference": [
        "MCP"
      ]
    }
  },
  "jsonResponseTemplate": {
    "ok": true,
    "result": {
      "summary": "...",
      "confidence": 0.9
    },
    "meta": {
      "source": "GITHUB_OPENCLEW",
      "generatedAt": "2026-04-17T00:32:50.622Z"
    }
  },
  "retryPolicy": {
    "maxAttempts": 3,
    "backoffMs": [
      500,
      1500,
      3500
    ],
    "retryableConditions": [
      "HTTP_429",
      "HTTP_503",
      "NETWORK_TIMEOUT"
    ]
  }
}

Trust JSON

{
  "status": "unavailable",
  "handshakeStatus": "UNKNOWN",
  "verificationFreshnessHours": null,
  "reputationScore": null,
  "p95LatencyMs": null,
  "successRate30d": null,
  "fallbackRate": null,
  "attempts30d": null,
  "trustUpdatedAt": null,
  "trustConfidence": "unknown",
  "sourceUpdatedAt": null,
  "freshnessSeconds": null
}

Capability Matrix

{
  "rows": [
    {
      "key": "MCP",
      "type": "protocol",
      "support": "unknown",
      "confidenceSource": "profile",
      "notes": "Listed on profile"
    }
  ],
  "flattenedTokens": "protocol:MCP|unknown|profile"
}

Facts JSON

[
  {
    "factKey": "docs_crawl",
    "category": "integration",
    "label": "Crawlable docs",
    "value": "6 indexed pages on the official domain",
    "href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceType": "search_document",
    "confidence": "medium",
    "observedAt": "2026-04-15T05:03:46.393Z",
    "isPublic": true
  },
  {
    "factKey": "vendor",
    "category": "vendor",
    "label": "Vendor",
    "value": "16miku",
    "href": "https://github.com/16Miku/Mapping-Skill",
    "sourceUrl": "https://github.com/16Miku/Mapping-Skill",
    "sourceType": "profile",
    "confidence": "medium",
    "observedAt": "2026-02-25T02:25:55.194Z",
    "isPublic": true
  },
  {
    "factKey": "protocols",
    "category": "compatibility",
    "label": "Protocol compatibility",
    "value": "MCP",
    "href": "https://xpersona.co/api/v1/agents/16miku-mapping-skill/contract",
    "sourceUrl": "https://xpersona.co/api/v1/agents/16miku-mapping-skill/contract",
    "sourceType": "contract",
    "confidence": "medium",
    "observedAt": "2026-02-25T02:25:55.194Z",
    "isPublic": true
  },
  {
    "factKey": "traction",
    "category": "adoption",
    "label": "Adoption signal",
    "value": "1 GitHub stars",
    "href": "https://github.com/16Miku/Mapping-Skill",
    "sourceUrl": "https://github.com/16Miku/Mapping-Skill",
    "sourceType": "profile",
    "confidence": "medium",
    "observedAt": "2026-02-25T02:25:55.194Z",
    "isPublic": true
  },
  {
    "factKey": "handshake_status",
    "category": "security",
    "label": "Handshake status",
    "value": "UNKNOWN",
    "href": "https://xpersona.co/api/v1/agents/16miku-mapping-skill/trust",
    "sourceUrl": "https://xpersona.co/api/v1/agents/16miku-mapping-skill/trust",
    "sourceType": "trust",
    "confidence": "medium",
    "observedAt": null,
    "isPublic": true
  }
]

Change Events JSON

[
  {
    "eventType": "docs_update",
    "title": "Docs refreshed: Sign in to GitHub · GitHub",
    "description": "Fresh crawlable documentation was indexed for the official domain.",
    "href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceType": "search_document",
    "confidence": "medium",
    "observedAt": "2026-04-15T05:03:46.393Z",
    "isPublic": true
  }
]

Sponsored

Ads related to Mapping-Skill and adjacent AI workflows.