Use a transcript-first workflow.
scripts/fetch_youtube.py --out to collect metadata and subtitles. If behind a proxy, add --proxy .
summary.json and the generated transcript file first.yt-dlp + a speech-to-text path (for example Whisper) before promising a result.For normal requests, return:
The fetch script writes an output directory containing:
summary.json — chosen subtitle file, title, uploader, duration, and extraction statustranscript.txt — plain text transcript when subtitles are availableyt-dlp (VTT/SRT)Read summary.json first to decide what to do next.
yt-dlp for metadata + subtitle extractiondeno as JS runtime (required by yt-dlp 2026+)ffmpeg for media conversion / optional frame extraction (optional)Basic extraction:
python3 scripts/fetch_youtube.py "<youtube-url>" --out /tmp/youtube-digest
With proxy:
python3 scripts/fetch_youtube.py "<youtube-url>" --proxy http://your-proxy:port --out /tmp/youtube-digest
Prefer specific subtitle languages:
python3 scripts/fetch_youtube.py "<youtube-url>" --langs zh.*,en.* --out /tmp/youtube-digest
yt-dlp is missing, stop and install it instead of improvising.references/install-and-deploy.md for deployment instructions.references/usage-patterns.md for output templates for summaries, translations, or Q&A.共 1 个版本