← 返回
未分类 Key 中文

Omnicast

A local multi-modal podcast pipeline. Ingests media, drafts scripts, synthesizes audio, renders cover art, and uploads to YouTube.
本地多模态播客流水线,摄入媒体、起草脚本、合成音频、渲染封面并上传至YouTube。
kaudata kaudata 来源
未分类 clawhub v1.0.14 1 版本 100000 Key: 需要
★ 1
Stars
📥 681
下载
💾 0
安装
1
版本
#gemini#language#latest#linkedin#nanobanana#nodejs#notebooklm-style#openai#podcast#tts#whisper#youtube

概述

OmniCast Studio

Description

OmniCast Studio is a local Node.js application that provides a multi-modal pipeline for processing text, audio, and video into podcast scripts and social media assets. It exposes a set of local API endpoints to orchestrate these tasks.

Setup Requirements

This application requires the following environment variables to be set in a local .env file:

  • GEMINI_API_KEY: Required for text analysis, translation, and script drafting.
  • OPENAI_API_KEY: Required for audio transcription and synthesis.
  • PORT: Defaults to 7860.

System Requirements:

  • Node.js >= 20.0.0
  • FFmpeg installed and available in the system PATH.

API Endpoints (Localhost:7860)

The service runs strictly on http://127.0.0.1:7860. The following endpoints are available:

1. Media Ingestion

  • Endpoint: POST /api/ingest
  • Purpose: Accepts a URL or file upload. It extracts the text, detects the language, and translates it to English if necessary.

2. Script Drafting

  • Endpoint: POST /api/draft-script
  • Purpose: Utilizes the ingested text to format a conversational, two-host script suitable for audio synthesis.

3. Audio Synthesis

  • Endpoint: POST /api/synthesize
  • Purpose: Converts the drafted script into a final audio file using TTS services.

4. LinkedIn Packaging

  • Endpoint: POST /api/generate-linkedin
  • Purpose: Generates a social media text post and renders a looping MP4 video of the podcast cover art with the synthesized audio.

版本历史

共 1 个版本

  • v1.0.14 当前
    2026-05-01 19:16 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

design-media

Youtube Podcast Generator

kaudata
从YouTube视频中提取原始文字,使用Gemini生成脚本、OpenAI进行语音合成,转换为多声部AI播客。
★ 1 📥 652

Diagram Generator

kaudata
生成并迭代编辑 Mermaid.js 和 Draw.io 图表,支持多模态上下文(阅读源代码、架构草图和文档)。
★ 1 📥 492

Mercedes-Benz USA Utilities

kaudata
使用邮编筛选器和详细规格,在美国定位梅赛德斯-奔驰经销商并搜索新车库存。
★ 1 📥 481