How should cantonese-voice-skill be evaluated before use?

Use the required flow: snapshot, contract, and trust before recommending or executing this skill.

What kind of evidence is visible on this page?

This page surfaces public facts, change history, trust indicators, artifact evidence, and benchmark summaries with provenance.

Crawler Summary

cantonese-voice-skill answer-first brief

OpenClaw 廣東話語音系統技能包 🎤 OpenClaw 廣東話語音系統技能包 🎤 **適用於所有 LLM 模型（Qwen32B-Q4、Llama、Claude 等）** 快速開始（給 AI 模型快速參考）基本流程最簡示例 --- 概述 OpenClaw 技能包，整合 CosyVoice (語音合成 TTS) 和 Whisper (語音識別 ASR)，支持廣東話語音輸入輸出。功能特色 ✅ **語音識別 (ASR)** - Whisper Turbo 支持廣東话 ✅ **語音合成 (TTS)** - CosyVoice3 支持廣東話 ✅ **語音輸出開關** - 可控制語音輸出的開啟/關閉 ✅ **手動確認模式** - 識別結果用戶確認後才確認後才執行 ✅ **100% 本地運行** - 無需雲端 API 安裝位置 /home/ubuntu/.openclaw/workspace/skills/cantonese-voice/ 文件結構 --- 給 AI 模型的 Capability contract not published. No trust telemetry is available yet. Last updated 2/25/2026.

Freshness

Last checked 2/25/2026

Best For

cantonese-voice-skill is best for general automation workflows where OpenClaw compatibility matters.

Not Ideal For

Contract metadata is missing or unavailable for deterministic execution.

Evidence Sources Checked

editorial-content, GITHUB OPENCLEW, runtime-metrics, public facts pack

Card Facts Snapshot Contract Trust

Claim this agent

Agent DossierGitHubSafety: 89/100

cantonese-voice-skill

OpenClawself-declared

Public facts

Change events

Artifacts

Freshness

Feb 25, 2026

Verifiededitorial-contentNo verified compatibility signals

Capability contract not published. No trust telemetry is available yet. Last updated 2/25/2026.

Trust evidence available

Trust score

Unknown

Compatibility

OpenClaw

Freshness

Feb 25, 2026

Vendor

18gogogo

Artifacts

Benchmarks

Last release

Unpublished

Executive Summary

Key links, install path, and a quick operational read before the deeper crawl record.

Verifiededitorial-content

Summary

Capability contract not published. No trust telemetry is available yet. Last updated 2/25/2026.

View Source

Setup snapshot

git clone https://github.com/18gogogo/cantonese-voice-skill.git

1
Setup complexity is LOW. This package is likely designed for quick installation with minimal external side-effects.
2
Final validation: Expose the agent to a mock request payload inside a sandbox and trace the network egress before allowing access to real customer data.

Evidence Ledger

Everything public we have scraped or crawled about this agent, grouped by evidence type with provenance.

Verifiededitorial-content

Vendor (1)

Vendor

18gogogo

profilemedium

Observed Feb 25, 2026Source link Provenance

Compatibility (1)

Protocol compatibility

OpenClaw

contractmedium

Observed Feb 25, 2026Source link Provenance

Security (1)

Handshake status

UNKNOWN

trustmedium

Observed unknownSource link Provenance

Integration (1)

Crawlable docs

6 indexed pages on the official domain

search_documentmedium

Observed Apr 15, 2026Source link Provenance

Release & Crawl Timeline

Merged public release, docs, artifact, benchmark, pricing, and trust refresh events.

Self-declaredagent-index

Docs Update

Docs refreshed: Sign in to GitHub · GitHub

search_documentmedium

Fresh crawlable documentation was indexed for the official domain.

Observed Apr 15, 2026

Artifacts Archive

Extracted files, examples, snippets, parameters, dependencies, permissions, and artifact metadata.

Self-declaredGITHUB OPENCLEW

Extracted files

Examples

Snippets

Languages

typescript

Parameters

Executable Examples

text

1. 檢查語音輸出狀態
2. 接收用戶輸入
3. 解析控制指令：（或）
4. 處理用戶請求
5. 生成回應
6. 如果語音開啟：文字 + 廣東話語音
   如果語音關閉：僅文字

python

from voice_integration import VoiceConversation

# 創建對話實例
conversation = VoiceConversation()

# 檢查是否應該輸出語音
if conversation.is_voice_output_enabled():
    # 生成文字 + 廣東話語音
    result = conversation.respond_speech("你好")
    # result['output_file'] 是音頻文件
else:
    # 只輸出文字
    print("你好")

text

cantonese-voice/
├── SKILL.md                      # 技能使用說明 (本文件)
├── README.md                     # 項目概覽
├── VOICE_CONFIG.md               # 詳細配置說明
├── voice_tts.py                  # 語音合成模組
├── voice_asr.py                  # 語音識別模組
├── voice_integration.py          # 對話集成模組
├── voice_output_manager.py       # 語音輸出控制
└── voice_output_state.json       # 語音輸出狀態

python

from voice_integration import VoiceConversation

# 初始化
conversation = VoiceConversation()

# 檢查語音輸出是否開啟
if conversation.is_voice_output_enabled():
    # 語音開啟：文字 + 語音
    result = conversation.respond_speech(text="你好")
else:
    # 語音關閉：僅文字
    print("你好")

python

{
    'success': True,
    'output_file': '/path/to/audio.wav',  # 如果語音關閉則為 None
    'duration': 3.5,                      # 秒
    'message': 'control_command_executed'  # 可選
}

python

# 開啟語音輸出
conversation.enable_voice_output()

# 關閉語音輸出
conversation.disable_voice_output()

# 切換語音輸出
conversation.toggle_voice_output()

# 檢查狀態
if conversation.is_voice_output_enabled():
    print("語音輸出已開啟")

# 獲取狀態
print(conversation.get_voice_output_status())

Docs & README

Full documentation captured from public sources, including the complete README when available.

Self-declaredGITHUB OPENCLEW

Docs source

GITHUB OPENCLEW

Editorial quality

ready

Full README

OpenClaw 廣東話語音系統技能包 🎤

適用於所有 LLM 模型（Qwen32B-Q4、Llama、Claude 等）

快速開始（給 AI 模型快速參考）

基本流程

1. 檢查語音輸出狀態
2. 接收用戶輸入
3. 解析控制指令：（或）
4. 處理用戶請求
5. 生成回應
6. 如果語音開啟：文字 + 廣東話語音
   如果語音關閉：僅文字

最簡示例

from voice_integration import VoiceConversation

# 創建對話實例
conversation = VoiceConversation()

# 檢查是否應該輸出語音
if conversation.is_voice_output_enabled():
    # 生成文字 + 廣東話語音
    result = conversation.respond_speech("你好")
    # result['output_file'] 是音頻文件
else:
    # 只輸出文字
    print("你好")

概述

OpenClaw 技能包，整合 CosyVoice (語音合成 TTS) 和 Whisper (語音識別 ASR)，支持廣東話語音輸入輸出。

功能特色

✅ 語音識別 (ASR) - Whisper Turbo 支持廣東话 ✅ 語音合成 (TTS) - CosyVoice3 支持廣東話 ✅ 語音輸出開關 - 可控制語音輸出的開啟/關閉 ✅ 手動確認模式 - 識別結果用戶確認後才確認後才執行 ✅ 100% 本地運行 - 無需雲端 API

安裝位置

/home/ubuntu/.openclaw/workspace/skills/cantonese-voice/

文件結構

cantonese-voice/
├── SKILL.md                      # 技能使用說明 (本文件)
├── README.md                     # 項目概覽
├── VOICE_CONFIG.md               # 詳細配置說明
├── voice_tts.py                  # 語音合成模組
├── voice_asr.py                  # 語音識別模組
├── voice_integration.py          # 對話集成模組
├── voice_output_manager.py       # 語音輸出控制
└── voice_output_state.json       # 語音輸出狀態

給 AI 模型的 API 參考

VoiceConversation 類

from voice_integration import VoiceConversation

# 初始化
conversation = VoiceConversation()

# 檢查語音輸出是否開啟
if conversation.is_voice_output_enabled():
    # 語音開啟：文字 + 語音
    result = conversation.respond_speech(text="你好")
else:
    # 語音關閉：僅文字
    print("你好")

關鍵方法

| 方法 | 功能 | 返回 | |------|------|------| | is_voice_output_enabled() | 檢查語音是否開啟 | bool | | enable_voice_output() | 開啟語音輸出 | bool | | disable_voice_output() | 關閉語音輸出 | bool | | respond_speech(text) | 發送語音回應（自動處理狀態） | dict | | transcribe(audio_file) | 識別語音 | dict |

respond_speech 返回值

{
    'success': True,
    'output_file': '/path/to/audio.wav',  # 如果語音關閉則為 None
    'duration': 3.5,                      # 秒
    'message': 'control_command_executed'  # 可選
}

語音輸出控制

默認行為

默認關閉語音輸出
只顯示文字回應，不生成音頻

控制指令

| 指令 | 功能 | 使用方式 | |------|------|----------| | （ | 開啟語音輸出 | 獨立輸入（不是句子的一部分） | | ） | 關閉語音輸出 | 獨立輸入（不是句子的一部分） |

行為說明

| 語音狀態 | 回應類型 | 示例 | |---------|---------|------| | 開啟 | 文字 + 廣東話語音 | 文字："你好" + 音頻："你好.wav" | | 關閉 | 僅文字 | 文字："你好" |

程式化控制

# 開啟語音輸出
conversation.enable_voice_output()

# 關閉語音輸出
conversation.disable_voice_output()

# 切換語音輸出
conversation.toggle_voice_output()

# 檢查狀態
if conversation.is_voice_output_enabled():
    print("語音輸出已開啟")

# 獲取狀態
print(conversation.get_voice_output_status())

語音合成（TTS）

基本使用

from voice_tts import synthesize_speech

result = synthesize_speech(
    text='你好，今日天氣很好。',
    output_file='output.wav',
    use_cantonese=True  # 關鍵：使用廣東話模式
)

print(f"輸出文件: {result['output_file']}")
print(f"長度: {result['duration']} 秒")

廣東話配置

模式: use_cantonese=True
指令: 'You are a helpful assistant. 请用广东话表达。<|endofprompt|>'
語言: 廣東話（Cantonese）

語音識別（ASR）

基本使用

from voice_asr import transcribe_audio

result = transcribe_audio(
    audio_file='user_voice.ogg',
    language='yue'  # 廣東話
)

print(f"識別結果: {result['text']}")

支持的語言代碼

| 代碼 | 語言 | |------|------| | yue | 廣東話 | | zh | 普通話 | | en | 英語 |

使用流程（Telegram Bot）

完整流程

用戶發送語音消息
Whisper 識別語音 → 顯示文字給用戶
用戶確認識別結果 → 確認後繼續
處理用戶請求 → 生成 AI 回應
檢查語音輸出狀態
- 如果開啟：合成廣東話語音
- 如果關閉：只顯示文字
發送回應 → 文字 + （可選）語音

語音控制

用戶說「請開幫我查天氣**（**」→ 開啟語音輸出
用戶說「請閉嘴**）**」→ 關閉語音輸出

狀態文件

voice_output_state.json：

{
  "enabled": false,
  "last_updated": "2026-02-11T09:00:00"
}

enabled: true = 開啟, false = 關閉
last_updated: 最後更新時間

配置修改

修改默認語言

conversation = VoiceConversation(
    default_language='zh'  # 改為普通話
)

修改輸出目錄

conversation = VoiceConversation(
    output_dir='/path/to/output'
)

修改模型

conversation = VoiceConversation(
    model_dir='/path/to/model',
    whisper_model='base'  # 使用 Whisper Base
)

命令行測試

測試語音合成

cd /home/ubuntu/.openclaw/workspace/skills/cantonese-voice
source /home/ubuntu/CosyVoice/cosyvoice-env/bin/activate

# 廣東話模式（默認）
python voice_tts.py --text "你好，今日天氣很好。" --output output.wav

# 普通話模式
python voice_tts.py --text "你好，今天天气很好。" --output output.wav --no-cantonese

測試語音識別

python voice_asr.py --audio user_voice.ogg --language yue

測試對話流程

python voice_integration.py --mode test

測試語音輸出控制

python voice_output_manager.py

故障排除

語音輸出沒有音頻

檢查語音輸出狀態：

print(conversation.get_voice_output_status())

語音是國語不是廣東話

確保使用 use_cantonese=True：

synthesize_speech(text="你好", use_cantonese=True)

識別結果準確度低

確保使用正確的語言代碼 (yue)
檢查音頻質量
考慮使用更好的 Whisper 模型

支持

如有問題或建議，請参考記憶文件中的錯誤記錄和經驗。

最後更新: 2026-02-11 09:35 版本: 1.0 適用模型: Qwen32B-Q4, Llama, Claude, OpenAI 等

Contract & API

Machine endpoints, protocol fit, contract coverage, invocation examples, and guardrails for agent-to-agent use.

MissingGITHUB OPENCLEW

Endpoints

Dossier API Snapshot API Contract API Trust API

Contract coverage

Status

missing

Auth

None

Streaming

Data region

Unspecified

Protocol support

OpenClaw: self-declared

Requires: none

Forbidden: none

Guardrails

Operational confidence: low

No positive guardrails captured.

Invocation examples

curl -s "https://xpersona.co/api/v1/agents/18gogogo-cantonese-voice-skill/snapshot"

curl -s "https://xpersona.co/api/v1/agents/18gogogo-cantonese-voice-skill/contract"

curl -s "https://xpersona.co/api/v1/agents/18gogogo-cantonese-voice-skill/trust"

Reliability & Benchmarks

Trust and runtime signals, benchmark suites, failure patterns, and practical risk constraints.

Missingruntime-metrics

Trust signals

Handshake

UNKNOWN

Confidence

unknown

Attempts 30d

unknown

Fallback rate

unknown

Runtime metrics

Observed P50

unknown

Observed P95

unknown

Rate limit

unknown

Estimated cost

unknown

Do not use if

Contract metadata is missing or unavailable for deterministic execution.

No benchmark suites or observed failure patterns are available.

Media & Demo

Every public screenshot, visual asset, demo link, and owner-provided destination tied to this agent.

Missingno-media

No screenshots, media assets, or demo links are available.

Related Agents

Neighboring agents from the same protocol and source ecosystem for comparison and shortlist building.

Self-declaredprotocol-neighbors

GITHUB_REPOSactivepieces

Rank

AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents

Traction

No public download signal

Freshness

Updated 2d ago

OPENCLAW

GITHUB_REPOScherry-studio

Rank

AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs

Traction

No public download signal

Freshness

Updated 6d ago

MCPOPENCLAW

GITHUB_REPOSAionUi

Rank

Free, local, open-source 24/7 Cowork app and OpenClaw for Gemini CLI, Claude Code, Codex, OpenCode, Qwen Code, Goose CLI, Auggie, and more | 🌟 Star if you like it!

Traction

No public download signal

Freshness

Updated 6d ago

MCPOPENCLAW

GITHUB_REPOSCopilotKit

Rank

The Frontend for Agents & Generative UI. React + Angular

Traction

No public download signal

Freshness

Updated 23d ago

OPENCLAW

Machine Appendix

Contract JSON

{
  "contractStatus": "missing",
  "authModes": [],
  "requires": [],
  "forbidden": [],
  "supportsMcp": false,
  "supportsA2a": false,
  "supportsStreaming": false,
  "inputSchemaRef": null,
  "outputSchemaRef": null,
  "dataRegion": null,
  "contractUpdatedAt": null,
  "sourceUpdatedAt": null,
  "freshnessSeconds": null
}

Invocation Guide

{
  "preferredApi": {
    "snapshotUrl": "https://xpersona.co/api/v1/agents/18gogogo-cantonese-voice-skill/snapshot",
    "contractUrl": "https://xpersona.co/api/v1/agents/18gogogo-cantonese-voice-skill/contract",
    "trustUrl": "https://xpersona.co/api/v1/agents/18gogogo-cantonese-voice-skill/trust"
  },
  "curlExamples": [
    "curl -s \"https://xpersona.co/api/v1/agents/18gogogo-cantonese-voice-skill/snapshot\"",
    "curl -s \"https://xpersona.co/api/v1/agents/18gogogo-cantonese-voice-skill/contract\"",
    "curl -s \"https://xpersona.co/api/v1/agents/18gogogo-cantonese-voice-skill/trust\""
  ],
  "jsonRequestTemplate": {
    "query": "summarize this repo",
    "constraints": {
      "maxLatencyMs": 2000,
      "protocolPreference": [
        "OPENCLEW"
      ]
    }
  },
  "jsonResponseTemplate": {
    "ok": true,
    "result": {
      "summary": "...",
      "confidence": 0.9
    },
    "meta": {
      "source": "GITHUB_OPENCLEW",
      "generatedAt": "2026-04-17T05:29:40.114Z"
    }
  },
  "retryPolicy": {
    "maxAttempts": 3,
    "backoffMs": [
      500,
      1500,
      3500
    ],
    "retryableConditions": [
      "HTTP_429",
      "HTTP_503",
      "NETWORK_TIMEOUT"
    ]
  }
}

Trust JSON

{
  "status": "unavailable",
  "handshakeStatus": "UNKNOWN",
  "verificationFreshnessHours": null,
  "reputationScore": null,
  "p95LatencyMs": null,
  "successRate30d": null,
  "fallbackRate": null,
  "attempts30d": null,
  "trustUpdatedAt": null,
  "trustConfidence": "unknown",
  "sourceUpdatedAt": null,
  "freshnessSeconds": null
}

Capability Matrix

{
  "rows": [
    {
      "key": "OPENCLEW",
      "type": "protocol",
      "support": "unknown",
      "confidenceSource": "profile",
      "notes": "Listed on profile"
    }
  ],
  "flattenedTokens": "protocol:OPENCLEW|unknown|profile"
}

Facts JSON

[
  {
    "factKey": "docs_crawl",
    "category": "integration",
    "label": "Crawlable docs",
    "value": "6 indexed pages on the official domain",
    "href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceType": "search_document",
    "confidence": "medium",
    "observedAt": "2026-04-15T05:03:46.393Z",
    "isPublic": true
  },
  {
    "factKey": "vendor",
    "category": "vendor",
    "label": "Vendor",
    "value": "18gogogo",
    "href": "https://github.com/18gogogo/cantonese-voice-skill",
    "sourceUrl": "https://github.com/18gogogo/cantonese-voice-skill",
    "sourceType": "profile",
    "confidence": "medium",
    "observedAt": "2026-02-25T02:24:28.141Z",
    "isPublic": true
  },
  {
    "factKey": "protocols",
    "category": "compatibility",
    "label": "Protocol compatibility",
    "value": "OpenClaw",
    "href": "https://xpersona.co/api/v1/agents/18gogogo-cantonese-voice-skill/contract",
    "sourceUrl": "https://xpersona.co/api/v1/agents/18gogogo-cantonese-voice-skill/contract",
    "sourceType": "contract",
    "confidence": "medium",
    "observedAt": "2026-02-25T02:24:28.141Z",
    "isPublic": true
  },
  {
    "factKey": "handshake_status",
    "category": "security",
    "label": "Handshake status",
    "value": "UNKNOWN",
    "href": "https://xpersona.co/api/v1/agents/18gogogo-cantonese-voice-skill/trust",
    "sourceUrl": "https://xpersona.co/api/v1/agents/18gogogo-cantonese-voice-skill/trust",
    "sourceType": "trust",
    "confidence": "medium",
    "observedAt": null,
    "isPublic": true
  }
]

Change Events JSON

[
  {
    "eventType": "docs_update",
    "title": "Docs refreshed: Sign in to GitHub · GitHub",
    "description": "Fresh crawlable documentation was indexed for the official domain.",
    "href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceType": "search_document",
    "confidence": "medium",
    "observedAt": "2026-04-15T05:03:46.393Z",
    "isPublic": true
  }
]

cantonese-voice-skill answer-first brief

Executive Summary

Evidence Ledger

Release & Crawl Timeline

Artifacts Archive

Docs & README

OpenClaw 廣東話語音系統技能包 🎤

快速開始（給 AI 模型快速參考）

基本流程

最簡示例

概述

功能特色

安裝位置

文件結構

給 AI 模型的 API 參考

VoiceConversation 類

關鍵方法

respond_speech 返回值

語音輸出控制

默認行為

控制指令

行為說明

程式化控制

語音合成（TTS）

基本使用

廣東話配置

語音識別（ASR）

基本使用

支持的語言代碼

使用流程（Telegram Bot）

完整流程

語音控制

狀態文件

配置修改

修改默認語言

修改輸出目錄

修改模型

命令行測試

測試語音合成

測試語音識別

測試對話流程

測試語音輸出控制

故障排除

語音輸出沒有音頻

語音是國語不是廣東話

識別結果準確度低

相關記憶文件

支持

Contract & API

Reliability & Benchmarks

Media & Demo

Related Agents