← 返回
内容创作 Key 中文

Youtube Podcast Generator

Extracts the original text of a Youtube video and converts it into a multi-voice AI podcast using Gemini for script generation, OpenAI for TTS, and a local N...
从YouTube视频中提取原始文字,使用Gemini生成脚本、OpenAI进行语音合成,转换为多声部AI播客。
kaudata
内容创作 clawhub v3.0.0 2 版本 99844.5 Key: 需要
★ 1
Stars
📥 622
下载
💾 18
安装
2
版本
#language#latest#notebooklm-style#podcast#youtube

概述

YouTube Podcaster

This skill enables the automated conversion of YouTube videos into multi-host AI podcasts. It manages transcription, script generation via Gemini, and audio synthesis via OpenAI locally.

Security Setup

For maximum security, the backend server binds strictly to 127.0.0.1. It is not accessible from your local network or the internet.

  1. Install Dependencies: You must run the install command once before the first use. Say:

Run the npm install command for the youtube-podcaster skill.

  1. Credentials: Place your Gemini API Key and OpenAI API Key in the .env file within the skill folder (skills/youtube-podcaster/.env) using the variable names GEMINI_API_KEY and OPENAI_API_KEY.
  2. Execution: Start the server with npm start or by instructing the agent: Start the local server for the youtube-podcaster skill.

Usage

Once the server is running, say:

Create a podcast for the video https://www.youtube.com/watch?v= using the youtube-podcaster skill

The skill orchestrates three local API calls to localhost:7860:

  1. Transcription: Extracts text via the YouTube transcript API.
  2. Drafting: Uses Gemini to create a natural dialogue script.
  3. Synthesis: Uses OpenAI TTS (tts-1) and FFmpeg to generate a gapless .m4a file.

Safe Cleanup

When you are finished using the studio, shut down the background process to free up system resources. Do not use generic kill commands. Instead, instruct the agent to use the tracked process ID:

Stop the youtube-podcaster server process

(The agent will execute kill $(cat .podcaster.pid) or pkill -f "node index.js" to target the specific process safely).

Storage & File Outputs

Files are saved to downloads// inside the skill directory. The server includes an hourly garbage collector that automatically deletes inactive sessions.

  • Audio: podcast.m4a
  • Captions: podcast.vtt
  • Scripts: script.txt and original.txt

Source Code

The source code is available at: https://github.com/kaudata/youtube-podcaster

版本历史

共 2 个版本

  • v3.0.0 当前
    2026-03-29 20:19 安全 安全
  • v2.0.1
    2026-03-19 17:34

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

AdMapix

fly0pants
广告情报与应用数据分析助手,支持搜索广告素材、分析应用排名、下载量、收入及市场洞察,用于广告素材和竞品分析。
★ 295 📥 136,492
content-creation

Baidu Wenku AIPPT

ide-rea
使用百度文库 AI 智能生成 PPT,自动根据内容选择模板。
★ 66 📥 46,203
content-creation

Humanizer

biostartechnology
消除AI写作痕迹,使文本更自然真实。基于维基百科"AI写作特征"指南,识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。
★ 860 📥 199,856