Crawler Summary

audio-to-subtitles answer-first brief

Convert audio files to subtitle files using OpenAI Whisper AI transcription. Generates SRT, FCPXML (for Final Cut Pro), and plain text formats. Use when you need to create subtitles, captions, or transcripts from audio or video files. --- name: audio-to-subtitles description: Convert audio files to subtitle files using OpenAI Whisper AI transcription. Generates SRT, FCPXML (for Final Cut Pro), and plain text formats. Use when you need to create subtitles, captions, or transcripts from audio or video files. homepage: https://github.com/openai/whisper metadata: { "openclaw": { "emoji": "🎬", "requires": { "bins": ["python3"], "python": ["openai-whis Capability contract not published. No trust telemetry is available yet. Last updated 4/14/2026.

Freshness

Last checked 4/14/2026

Best For

audio-to-subtitles is best for srt workflows where OpenClaw compatibility matters.

Not Ideal For

Contract metadata is missing or unavailable for deterministic execution.

Evidence Sources Checked

editorial-content, GITHUB OPENCLEW, runtime-metrics, public facts pack

Claim this agent
Agent DossierGitHubSafety: 94/100

audio-to-subtitles

Convert audio files to subtitle files using OpenAI Whisper AI transcription. Generates SRT, FCPXML (for Final Cut Pro), and plain text formats. Use when you need to create subtitles, captions, or transcripts from audio or video files. --- name: audio-to-subtitles description: Convert audio files to subtitle files using OpenAI Whisper AI transcription. Generates SRT, FCPXML (for Final Cut Pro), and plain text formats. Use when you need to create subtitles, captions, or transcripts from audio or video files. homepage: https://github.com/openai/whisper metadata: { "openclaw": { "emoji": "🎬", "requires": { "bins": ["python3"], "python": ["openai-whis

OpenClawself-declared

Public facts

4

Change events

1

Artifacts

0

Freshness

Apr 14, 2026

Verifiededitorial-contentNo verified compatibility signals

Capability contract not published. No trust telemetry is available yet. Last updated 4/14/2026.

Trust evidence available

Trust score

Unknown

Compatibility

OpenClaw

Freshness

Apr 14, 2026

Vendor

Openai

Artifacts

0

Benchmarks

0

Last release

Unpublished

Executive Summary

Key links, install path, and a quick operational read before the deeper crawl record.

Verifiededitorial-content

Summary

Capability contract not published. No trust telemetry is available yet. Last updated 4/14/2026.

Setup snapshot

git clone https://github.com/kantylee/audio-to-subtitles.git
  1. 1

    Setup complexity is LOW. This package is likely designed for quick installation with minimal external side-effects.

  2. 2

    Final validation: Expose the agent to a mock request payload inside a sandbox and trace the network egress before allowing access to real customer data.

Evidence Ledger

Everything public we have scraped or crawled about this agent, grouped by evidence type with provenance.

Verifiededitorial-content
Vendor (1)

Vendor

Openai

profilemedium
Observed Apr 14, 2026Source linkProvenance
Compatibility (1)

Protocol compatibility

OpenClaw

contractmedium
Observed Apr 14, 2026Source linkProvenance
Security (1)

Handshake status

UNKNOWN

trustmedium
Observed unknownSource linkProvenance
Integration (1)

Crawlable docs

6 indexed pages on the official domain

search_documentmedium
Observed Apr 15, 2026Source linkProvenance

Release & Crawl Timeline

Merged public release, docs, artifact, benchmark, pricing, and trust refresh events.

Self-declaredagent-index

Artifacts Archive

Extracted files, examples, snippets, parameters, dependencies, permissions, and artifact metadata.

Self-declaredGITHUB OPENCLEW

Extracted files

0

Examples

6

Snippets

0

Languages

typescript

Parameters

Executable Examples

bash

pip3 install openai-whisper

bash

{baseDir}/scripts/audio-to-subtitles.py audio.mp3

bash

{baseDir}/scripts/audio-to-subtitles.py audio.mp3 -f srt
{baseDir}/scripts/audio-to-subtitles.py audio.mp3 -f fcpxml

bash

{baseDir}/scripts/audio-to-subtitles.py audio.mp3 -o ~/Desktop/my-subtitles.srt

text

audio-to-subtitles.py [-h] [-o OUTPUT] [-f {srt,fcpxml,txt,all}]
                      [-m {tiny,base,small,medium,large}]
                      [-l LANGUAGE] [--fps FPS]
                      input

text

1
00:00:01,000 --> 00:00:04,000
Hello, this is the first subtitle.

2
00:00:05,000 --> 00:00:08,000
This is the second subtitle.

Docs & README

Full documentation captured from public sources, including the complete README when available.

Self-declaredGITHUB OPENCLEW

Docs source

GITHUB OPENCLEW

Editorial quality

ready

Convert audio files to subtitle files using OpenAI Whisper AI transcription. Generates SRT, FCPXML (for Final Cut Pro), and plain text formats. Use when you need to create subtitles, captions, or transcripts from audio or video files. --- name: audio-to-subtitles description: Convert audio files to subtitle files using OpenAI Whisper AI transcription. Generates SRT, FCPXML (for Final Cut Pro), and plain text formats. Use when you need to create subtitles, captions, or transcripts from audio or video files. homepage: https://github.com/openai/whisper metadata: { "openclaw": { "emoji": "🎬", "requires": { "bins": ["python3"], "python": ["openai-whis

Full README

name: audio-to-subtitles description: Convert audio files to subtitle files using OpenAI Whisper AI transcription. Generates SRT, FCPXML (for Final Cut Pro), and plain text formats. Use when you need to create subtitles, captions, or transcripts from audio or video files. homepage: https://github.com/openai/whisper metadata: { "openclaw": { "emoji": "🎬", "requires": { "bins": ["python3"], "python": ["openai-whisper"] }, "install": [ { "id": "pip", "kind": "pip", "package": "openai-whisper", "label": "Install Whisper (pip3 install openai-whisper)", }, ], }, }

Audio to Subtitles Generator

Convert audio files to subtitle files using OpenAI Whisper AI transcription. Supports SRT and FCPXML formats for use in Final Cut Pro and other video editing software.

Features

  • πŸ€– AI-powered transcription using OpenAI Whisper
  • πŸ“ Multiple output formats: SRT, FCPXML, plain text
  • 🎯 Optimized for Final Cut Pro workflow
  • 🌍 Automatic language detection
  • ⚑ Fast processing with GPU acceleration (if available)

Prerequisites

pip3 install openai-whisper

Note: First run will download the AI model (base model ~150MB, large model ~3GB).

Quick Start

Generate all formats

{baseDir}/scripts/audio-to-subtitles.py audio.mp3

This creates:

  • audio.srt - Standard subtitle format
  • audio.fcpxml - Final Cut Pro import format
  • audio.txt - Plain text transcript

Specific format only

{baseDir}/scripts/audio-to-subtitles.py audio.mp3 -f srt
{baseDir}/scripts/audio-to-subtitles.py audio.mp3 -f fcpxml

Custom output location

{baseDir}/scripts/audio-to-subtitles.py audio.mp3 -o ~/Desktop/my-subtitles.srt

Usage

audio-to-subtitles.py [-h] [-o OUTPUT] [-f {srt,fcpxml,txt,all}]
                      [-m {tiny,base,small,medium,large}]
                      [-l LANGUAGE] [--fps FPS]
                      input

Options

| Option | Description | Default | |--------|-------------|---------| | input | Audio file path | (required) | | -o, --output | Output file path | Auto-generated | | -f, --format | Output format (srt/fcpxml/txt/all) | all | | -m, --model | Whisper model size | base | | -l, --language | Language code (zh/en/ja/ko) | Auto-detect | | --fps | Frame rate for FCPXML | 30 |

Model Sizes

| Model | Speed | Accuracy | Size | Best For | |-------|-------|----------|------|----------| | tiny | Fastest | Basic | 39MB | Quick tests | | base | Fast | Good | 74MB | General use ⭐ | | small | Medium | Better | 244MB | Better accuracy | | medium | Slow | Great | 769MB | Professional | | large | Slowest | Best | 3GB | Maximum quality |

Supported Audio Formats

  • MP3, WAV, M4A, FLAC, OGG, OPUS
  • AAC, WMA
  • Video files (MP4, MOV) - extracts audio automatically

Output Formats

SRT (SubRip Subtitle)

Standard subtitle format supported by:

  • Final Cut Pro (File β†’ Import β†’ Captions)
  • Adobe Premiere Pro
  • DaVinci Resolve
  • YouTube, Vimeo
  • VLC, IINA, and most players

Format:

1
00:00:01,000 --> 00:00:04,000
Hello, this is the first subtitle.

2
00:00:05,000 --> 00:00:08,000
This is the second subtitle.

FCPXML (Final Cut Pro XML)

Native Final Cut Pro format that imports as titles.

Usage in Final Cut Pro:

  1. File β†’ Import β†’ XML...
  2. Select the .fcpxml file
  3. Subtitles appear as titles in your timeline
  4. Customize font, size, position as needed

TXT (Plain Text)

Simple transcript without timestamps.

Examples

Basic transcription

{baseDir}/scripts/audio-to-subtitles.py recording.mp3

Chinese audio with large model

{baseDir}/scripts/audio-to-subtitles.py interview.m4a -m large -l zh

YouTube video to subtitles

# First extract audio
video-audio-extractor "https://youtube.com/watch?v=..." -o video.mp3

# Then generate subtitles
audio-to-subtitles.py video.mp3

Bilibili video workflow

# Extract audio from Bilibili
yt-dlp "https://bilibili.com/video/..." --extract-audio --audio-format mp3

# Generate subtitles
audio-to-subtitles.py *.mp3 -f fcpxml

Language Codes

Common language codes:

  • zh - Chinese (δΈ­ζ–‡)
  • en - English
  • ja - Japanese
  • ko - Korean
  • es - Spanish
  • fr - French
  • de - German

Leave blank for automatic detection.

Performance Tips

  1. Use smaller models for quick drafts: -m tiny or -m base
  2. Use larger models for final output: -m medium or -m large
  3. Specify language for better accuracy: -l zh
  4. GPU acceleration: Whisper automatically uses GPU if available

Troubleshooting

Slow transcription

  • Use smaller model: -m base instead of -m large
  • Check if GPU is being used (should show in console)

Poor accuracy

  • Try larger model: -m medium or -m large
  • Specify language explicitly: -l zh
  • Check audio quality (noisy audio reduces accuracy)

Out of memory

  • Use smaller model: -m tiny or -m base
  • Process shorter audio segments

Missing dependencies

pip3 install --upgrade openai-whisper

Workflow Example

Complete workflow from video to Final Cut Pro subtitles:

# Step 1: Extract audio from video
video-audio-extractor video.mp4 -o audio.mp3

# Step 2: Generate subtitles
audio-to-subtitles.py audio.mp3 -f fcpxml

# Step 3: Import to Final Cut Pro
# File β†’ Import β†’ XML β†’ select video.fcpxml

Contract & API

Machine endpoints, protocol fit, contract coverage, invocation examples, and guardrails for agent-to-agent use.

MissingGITHUB OPENCLEW

Contract coverage

Status

missing

Auth

None

Streaming

No

Data region

Unspecified

Protocol support

OpenClaw: self-declared

Requires: none

Forbidden: none

Guardrails

Operational confidence: low

No positive guardrails captured.
Invocation examples
curl -s "https://xpersona.co/api/v1/agents/kantylee-audio-to-subtitles/snapshot"
curl -s "https://xpersona.co/api/v1/agents/kantylee-audio-to-subtitles/contract"
curl -s "https://xpersona.co/api/v1/agents/kantylee-audio-to-subtitles/trust"

Reliability & Benchmarks

Trust and runtime signals, benchmark suites, failure patterns, and practical risk constraints.

Missingruntime-metrics

Trust signals

Handshake

UNKNOWN

Confidence

unknown

Attempts 30d

unknown

Fallback rate

unknown

Runtime metrics

Observed P50

unknown

Observed P95

unknown

Rate limit

unknown

Estimated cost

unknown

Do not use if

Contract metadata is missing or unavailable for deterministic execution.
No benchmark suites or observed failure patterns are available.

Media & Demo

Every public screenshot, visual asset, demo link, and owner-provided destination tied to this agent.

Missingno-media
No screenshots, media assets, or demo links are available.

Related Agents

Neighboring agents from the same protocol and source ecosystem for comparison and shortlist building.

Self-declaredprotocol-neighbors
GITHUB_REPOSactivepieces

Rank

70

AI Agents & MCPs & AI Workflow Automation β€’ (~400 MCP servers for AI agents) β€’ AI Automation / AI Agent with MCPs β€’ AI Workflows & AI Agents β€’ MCPs for AI Agents

Traction

No public download signal

Freshness

Updated 2d ago

OPENCLAW
GITHUB_REPOScherry-studio

Rank

70

AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs

Traction

No public download signal

Freshness

Updated 5d ago

MCPOPENCLAW
GITHUB_REPOSAionUi

Rank

70

Free, local, open-source 24/7 Cowork app and OpenClaw for Gemini CLI, Claude Code, Codex, OpenCode, Qwen Code, Goose CLI, Auggie, and more | 🌟 Star if you like it!

Traction

No public download signal

Freshness

Updated 6d ago

MCPOPENCLAW
GITHUB_REPOSCopilotKit

Rank

70

The Frontend for Agents & Generative UI. React + Angular

Traction

No public download signal

Freshness

Updated 23d ago

OPENCLAW
Machine Appendix

Contract JSON

{
  "contractStatus": "missing",
  "authModes": [],
  "requires": [],
  "forbidden": [],
  "supportsMcp": false,
  "supportsA2a": false,
  "supportsStreaming": false,
  "inputSchemaRef": null,
  "outputSchemaRef": null,
  "dataRegion": null,
  "contractUpdatedAt": null,
  "sourceUpdatedAt": null,
  "freshnessSeconds": null
}

Invocation Guide

{
  "preferredApi": {
    "snapshotUrl": "https://xpersona.co/api/v1/agents/kantylee-audio-to-subtitles/snapshot",
    "contractUrl": "https://xpersona.co/api/v1/agents/kantylee-audio-to-subtitles/contract",
    "trustUrl": "https://xpersona.co/api/v1/agents/kantylee-audio-to-subtitles/trust"
  },
  "curlExamples": [
    "curl -s \"https://xpersona.co/api/v1/agents/kantylee-audio-to-subtitles/snapshot\"",
    "curl -s \"https://xpersona.co/api/v1/agents/kantylee-audio-to-subtitles/contract\"",
    "curl -s \"https://xpersona.co/api/v1/agents/kantylee-audio-to-subtitles/trust\""
  ],
  "jsonRequestTemplate": {
    "query": "summarize this repo",
    "constraints": {
      "maxLatencyMs": 2000,
      "protocolPreference": [
        "OPENCLEW"
      ]
    }
  },
  "jsonResponseTemplate": {
    "ok": true,
    "result": {
      "summary": "...",
      "confidence": 0.9
    },
    "meta": {
      "source": "GITHUB_OPENCLEW",
      "generatedAt": "2026-04-16T23:45:56.971Z"
    }
  },
  "retryPolicy": {
    "maxAttempts": 3,
    "backoffMs": [
      500,
      1500,
      3500
    ],
    "retryableConditions": [
      "HTTP_429",
      "HTTP_503",
      "NETWORK_TIMEOUT"
    ]
  }
}

Trust JSON

{
  "status": "unavailable",
  "handshakeStatus": "UNKNOWN",
  "verificationFreshnessHours": null,
  "reputationScore": null,
  "p95LatencyMs": null,
  "successRate30d": null,
  "fallbackRate": null,
  "attempts30d": null,
  "trustUpdatedAt": null,
  "trustConfidence": "unknown",
  "sourceUpdatedAt": null,
  "freshnessSeconds": null
}

Capability Matrix

{
  "rows": [
    {
      "key": "OPENCLEW",
      "type": "protocol",
      "support": "unknown",
      "confidenceSource": "profile",
      "notes": "Listed on profile"
    },
    {
      "key": "srt",
      "type": "capability",
      "support": "supported",
      "confidenceSource": "profile",
      "notes": "Declared in agent profile metadata"
    }
  ],
  "flattenedTokens": "protocol:OPENCLEW|unknown|profile capability:srt|supported|profile"
}

Facts JSON

[
  {
    "factKey": "docs_crawl",
    "category": "integration",
    "label": "Crawlable docs",
    "value": "6 indexed pages on the official domain",
    "href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceType": "search_document",
    "confidence": "medium",
    "observedAt": "2026-04-15T05:03:46.393Z",
    "isPublic": true
  },
  {
    "factKey": "vendor",
    "category": "vendor",
    "label": "Vendor",
    "value": "Openai",
    "href": "https://github.com/openai/whisper",
    "sourceUrl": "https://github.com/openai/whisper",
    "sourceType": "profile",
    "confidence": "medium",
    "observedAt": "2026-04-14T22:24:53.046Z",
    "isPublic": true
  },
  {
    "factKey": "protocols",
    "category": "compatibility",
    "label": "Protocol compatibility",
    "value": "OpenClaw",
    "href": "https://xpersona.co/api/v1/agents/kantylee-audio-to-subtitles/contract",
    "sourceUrl": "https://xpersona.co/api/v1/agents/kantylee-audio-to-subtitles/contract",
    "sourceType": "contract",
    "confidence": "medium",
    "observedAt": "2026-04-14T22:24:53.046Z",
    "isPublic": true
  },
  {
    "factKey": "handshake_status",
    "category": "security",
    "label": "Handshake status",
    "value": "UNKNOWN",
    "href": "https://xpersona.co/api/v1/agents/kantylee-audio-to-subtitles/trust",
    "sourceUrl": "https://xpersona.co/api/v1/agents/kantylee-audio-to-subtitles/trust",
    "sourceType": "trust",
    "confidence": "medium",
    "observedAt": null,
    "isPublic": true
  }
]

Change Events JSON

[
  {
    "eventType": "docs_update",
    "title": "Docs refreshed: Sign in to GitHub Β· GitHub",
    "description": "Fresh crawlable documentation was indexed for the official domain.",
    "href": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceUrl": "https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fopenclaw%2Fskills%2Ftree%2Fmain%2Fskills%2Fasleep123%2Fcaldav-calendar",
    "sourceType": "search_document",
    "confidence": "medium",
    "observedAt": "2026-04-15T05:03:46.393Z",
    "isPublic": true
  }
]

Sponsored

Ads related to audio-to-subtitles and adjacent AI workflows.