Fetch new episodes from the Hacker News Morning Brief podcast RSS feed, transcribe with Whisper, and archive as searchable markdown.
pip install openai-whisper)Run the main script to fetch and transcribe all new episodes:
bash scripts/fetch_and_transcribe.sh --archive ~/hn-podcast-archive
First run processes all episodes. Subsequent runs only process new ones (tracked via state.json).
| Flag | Default | Description |
|---|---|---|
| ------ | --------- | ------------- |
--feed URL | HN Morning Brief RSS | Podcast RSS feed URL |
--archive DIR | ./hn-podcast-archive | Archive root directory |
--model MODEL | turbo | Whisper model (tiny/base/small/medium/large/turbo) |
--limit N | 0 (all) | Max new episodes to process per run |
Point at any podcast RSS feed:
bash scripts/fetch_and_transcribe.sh --feed "https://example.com/podcast/feed.xml" --archive ./my-podcast-archive
Set up an OpenClaw cron job for daily checks:
See references/archive-layout.md for directory layout and state.json schema.
entrieswhisper to produce .txt transcripttranscript.md with title + date header~/.cache/whisper after first download--model tiny for speed, --model large for best accuracy共 1 个版本