← 返回
未分类 中文

Bilibili Audio Transcribe

Download audio from Bilibili or b23.tv links and transcribe it into txt, srt, and segment JSON with yt-dlp, ffmpeg, and faster-whisper. Use when a user asks...
从Bilibili或b23.tv链接下载音频并使用yt-dlp、ffmpeg和faster-whisper转写为txt、srt和分段JSON,适用于用户请求时。
yizh4ng yizh4ng 来源
未分类 clawhub v0.1.0 1 版本 100000 Key: 无需
★ 2
Stars
📥 425
下载
💾 0
安装
1
版本
#latest

概述

Bilibili Audio Transcribe

Convert a Bilibili link into local transcript files. Prefer this skill for Bilibili and b23.tv URLs only; do not use it for YouTube or generic web pages.

Quick start

  1. Ensure ffmpeg and ffprobe are on PATH.
  2. If Python dependencies are missing, run scripts/bootstrap_env.sh or install yt-dlp and faster-whisper manually.
  3. Run scripts/transcribe_bilibili.py with the target URL.
  4. Return the generated transcript artifacts or summarize them if the user asked for analysis rather than raw text.

Example:

python scripts/transcribe_bilibili.py "https://b23.tv/SSx810h" \
  --out-dir ./downloads/bilibili-audio \
  --model-size base \
  --beam-size 3

Workflow

1. Validate the request

Accept only bilibili.com or b23.tv URLs. If the URL points elsewhere, stop and say this skill is the wrong tool.

2. Prefer the bundled script

Use scripts/transcribe_bilibili.py instead of rewriting yt-dlp / whisper glue code in the session. The script:

  • downloads the best available audio
  • probes audio duration with ffprobe
  • transcribes with faster-whisper
  • writes .txt, .srt, and .segments.json
  • prints coarse ETA / progress during ASR

3. Use sensible defaults

Default to:

  • --model-size base
  • --beam-size 3
  • --language zh
  • VAD enabled

These defaults are optimized for Chinese Bilibili speech. If the video is clearly non-Chinese or mixed-language, rerun with --language auto or a specific language code.

4. Keep outputs predictable

By default the script writes into downloads/bilibili-audio/ relative to the current working directory. Keep all three artifacts unless the user explicitly asks for fewer outputs:

  • transcript text: .txt
  • subtitles: .srt
  • segment metadata: .segments.json

5. Handle failures directly

If a run fails:

  • missing ffmpeg / ffprobe → install system dependency first
  • missing Python packages → run scripts/bootstrap_env.sh
  • extractor or redirect issues → retry with the resolved canonical Bilibili URL
  • poor transcript quality → rerun with --language auto, a larger model, or --no-vad when speech is clipped

Read references/troubleshooting.md when dependency setup or extraction fails.

Resources

scripts/

  • transcribe_bilibili.py — main downloader + ASR pipeline
  • bootstrap_env.sh — create a virtualenv and install Python dependencies
  • requirements.txt — Python package list for the bootstrap script

references/

  • troubleshooting.md — dependency and extraction failure playbook

版本历史

共 1 个版本

  • v0.1.0 当前
    2026-05-03 08:52 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

design-media

Nano Banana Pro

steipete
使用 Nano Banana Pro (Gemini 3 Pro Image) 生成或编辑图像。支持文生图、图生图及 1K/2K/4K 分辨率,适用于图像创建、修改及编辑请求,使用 --input-image 指定输入图像。
★ 431 📥 117,546
design-media

Video Frames

steipete
使用 ffmpeg 从视频中提取帧或短片。
★ 136 📥 53,092
design-media

UI/UX Pro Max

xobi667
提供 UI/UX 设计智能与实现指导,帮助打造精美界面。适用于 UI 设计、UX 流程、信息架构、视觉风格、设计系统/标记、组件规格、文案/微文案、无障碍及前端 UI(HTML/CSS/JS、React、Next.js、Vue、Svelte
★ 223 📥 48,393