← 返回
内容创作 中文

YouTube Video Transcript

Fetch, summarize, and save YouTube transcripts with timestamp navigation, chapter detection, and searchable content.
{"translation": "获取、摘要并保存 YouTube 字幕,支持时间戳导航、章节检测和内容搜索。"}
ivangdavila
内容创作 clawhub v1.0.0 1 版本 99781.2 Key: 无需
★ 1
Stars
📥 892
下载
💾 22
安装
1
版本
#latest

概述

Most YouTube transcript tools either require paid APIs, use suspicious proxies, or just dump raw text without structure. This skill extracts transcripts locally using yt-dlp, preserves timestamps for navigation, detects chapters automatically, and exports to any format you need.

When to Use

User shares a YouTube link and wants to read instead of watch. User asks what someone says about a topic at a specific moment. User needs to extract quotes with timestamps for research or content creation. User wants to summarize a video or search within its content.

How It Works

         ┌──────────────────────────────────────────────┐
         │           YOUTUBE TRANSCRIPT FLOW            │
         └──────────────────────────────────────────────┘
                              │
         ┌────────────────────┼────────────────────┐
         ▼                    ▼                    ▼
    ┌─────────┐         ┌──────────┐         ┌─────────┐
    │  VIDEO  │         │ METADATA │         │SUBTITLES│
    │   URL   │         │  FETCH   │         │  CHECK  │
    └────┬────┘         └────┬─────┘         └────┬────┘
         │                   │                    │
         │  youtube.com/     │  Title, duration,  │  Manual first,
         │  watch?v=...      │  chapters, lang    │  auto fallback
         │                   │                    │
         └───────────────────┴────────────────────┘
                              │
                              ▼
                    ┌─────────────────┐
                    │ EXTRACT + CLEAN │
                    │ VTT → Markdown  │
                    │ with timestamps │
                    └────────┬────────┘
                              │
              ┌───────────────┼───────────────┐
              ▼               ▼               ▼
        ┌──────────┐   ┌───────────┐   ┌──────────┐
        │ CHAPTERS │   │  SEARCH   │   │  EXPORT  │
        │ detected │   │ by topic  │   │ MD/SRT/  │
        │ or smart │   │ timestamp │   │ TXT/JSON │
        └──────────┘   └───────────┘   └──────────┘

The Extraction Process

1. 📋 Get Metadata First

Always fetch video info before extracting subtitles:

yt-dlp -j "VIDEO_URL"

This gives you title, duration, official chapters, and available languages. Use it to confirm the right video and check what subtitles exist.

2. 📝 Prefer Manual Subtitles

Manual (uploaded) subtitles are higher quality than auto-generated:

# Try manual first
yt-dlp --write-sub --sub-lang en --skip-download "VIDEO_URL"

# Fall back to auto-generated if manual unavailable
yt-dlp --write-auto-sub --sub-lang en --skip-download "VIDEO_URL"

Auto-generated transcripts often have errors, missing punctuation, and wrong word boundaries. Manual subtitles are human-verified.

3. 🕐 Preserve Timestamps Always

Every segment must include timestamps. Format: [HH:MM:SS] or [MM:SS] for videos under 1 hour.

Why this matters: Users need to jump to specific moments. "Take me to where they discuss pricing" requires knowing the timestamp.

Output format:

[00:00] Welcome to this video about machine learning
[00:15] Today we'll cover three main topics
[00:30] First, let's talk about neural networks

Chapter Detection

From Video Markers

Many videos have chapter markers embedded. Extract from metadata:

yt-dlp -j "VIDEO_URL" | jq '.chapters'

Smart Detection (No Markers)

When video lacks chapters, detect natural breaks from transcript:

  • Topic changes (semantic shift in content)
  • Speaker changes (different voice patterns)
  • Explicit transitions ("Now let's talk about...", "Moving on...")
  • Long pauses between segments

Search Within Transcripts

When user asks "where do they talk about X":

  1. Search transcript for keywords and semantic matches
  2. Return segments with timestamps
  3. Include surrounding context (10-15 seconds before/after)

Response format:

Found 3 mentions of "machine learning":

[05:23] "...this is where machine learning really shines..."
Context: Discussing data processing approaches

[12:45] "...traditional methods vs machine learning..."
Context: Comparison section

Generate clickable links: https://youtube.com/watch?v=VIDEO_ID&t=323

Architecture

Memory lives in ~/youtube-video-transcript/. See memory-template.md for structure.

~/youtube-video-transcript/
├── memory.md          # Preferences + recent videos
├── videos/            # Cached transcripts (with consent)
│   └── {video_id}.md  # Individual video data
└── exports/           # Exported files

Quick Reference

TopicFile
-------------
Setup processsetup.md
Memory templatememory-template.md
Advanced patternspatterns.md

Core Rules

1. Metadata Before Extraction

Always run yt-dlp -j URL first. This confirms the video, shows available languages, and reveals official chapters. Never extract blind.

2. Manual Over Auto

Subtitle TypeQualityWhen to Use
-------------------------------------
ManualHighAlways try first
Auto-generatedMediumFallback only

Check with yt-dlp --list-subs URL for unfamiliar channels.

3. Timestamps Are Sacred

Never strip timestamps during any operation. They enable navigation, citation, and deep linking into the video.

4. Cache With Consent

User ResponseAction
-----------------------
"Yes, save it"Cache to ~/youtube-video-transcript/videos/
"No thanks"Don't cache, show once
Not asked yetAsk after first extraction

Always tell user where files are saved and offer to show or delete them.

5. Handle Multiple Languages

If user doesn't specify:

  1. Check available languages
  2. Prefer manual over auto
  3. Default to English
  4. Report which language was used
yt-dlp --list-subs "VIDEO_URL"

6. Quote Extraction Includes Context

When extracting quotes for research:

  • 10-15 seconds before/after for context
  • Exact timestamp for the quote start
  • Speaker identification if multiple speakers

7. Transparency on Quality

Subtitle TypeTell User
--------------------------
Manual"Using official subtitles"
Auto-generated"Using auto-generated (may have errors)"
None available"No subtitles found for this video"

Export Formats

FormatUse CaseCommand
---------------------------
MarkdownReading, notesDefault
SRTVideo editors--sub-format srt
Plain textSearch, grepStrip timestamps
JSONProgrammatic--write-info-json

Common Traps

TrapConsequencePrevention
-------------------------------
Not checking subtitles firstWasted time on unavailable videoAlways --list-subs first
Ignoring auto-generated qualityGarbage text with errorsPrefer manual, warn about auto
Losing timestampsCan't navigate videoNever strip in any operation
Extracting without metadataMissing title, chaptersAlways fetch -j first
Caching without consentPrivacy violationAsk before saving

Quick Commands

User SaysAction
-------------------
"Transcribe this video"Extract + display
"What do they say about X?"Search + timestamps
"Save this transcript"Cache with confirmation
"Export as SRT"Convert format
"Show saved videos"List ~/youtube-video-transcript/videos/
"Delete video X"Remove from cache

Security & Privacy

Data that stays local (with your consent):

  • Transcripts cached in ~/youtube-video-transcript/ (only if you agree)
  • Preferences stored locally (only after confirmation)
  • No external API calls beyond YouTube's public subtitle endpoints

Transparency guarantees:

  • Always asks before saving transcripts locally
  • Tells you where files are saved
  • Offers to show or delete saved data anytime

This skill does NOT:

  • Use proxy services or third-party APIs
  • Send your queries to external services
  • Store credentials or authentication
  • Save anything without your explicit consent

Related Skills

Install with clawhub install if user confirms:

  • summarizer — create summaries from any content
  • video-captions — generate and edit video subtitles
  • ffmpeg — advanced video and audio processing

Feedback

  • If useful: clawhub star youtube-video-transcript
  • Stay updated: clawhub sync

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-29 20:24 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

AdMapix

fly0pants
广告情报与应用数据分析助手,支持搜索广告素材、分析应用排名、下载量、收入及市场洞察,用于广告素材和竞品分析。
★ 295 📥 136,458
content-creation

Humanizer

biostartechnology
消除AI写作痕迹,使文本更自然真实。基于维基百科"AI写作特征"指南,识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。
★ 860 📥 199,611
productivity

Word / DOCX

ivangdavila
创建、检查和编辑 Microsoft Word 文档及 DOCX 文件,支持样式、编号、修订记录、表格、分节符及兼容性检查等功能。
★ 438 📥 147,453