← 返回
未分类

Video Insight

Cross-platform video transcript extraction and optional AI summarization for YouTube and Bilibili. GPU auto-detect. Transcript-first with opt-in LLM summary.
跨平台视频字幕提取,可选AI摘要;支持YouTube和哔哩哔哩,自动检测GPU;字幕优先提取,LLM摘要可选。
huuuwnnn-droid huuuwnnn-droid 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 305
下载
💾 0
安装
1
版本
#latest

概述

video-insight

Cross-platform video transcript extraction and optional AI summarization for YouTube and Bilibili.

Description

Extract transcripts, metadata, and optional keyframes from YouTube and Bilibili videos. Outputs structured JSON to stdout. By default, no LLM summarization is performed — the agent receives the full transcript and does its own summarization with full context window.

Supports: macOS, Linux, WSL, Windows VM.

Usage

# Single video — transcript only (default, recommended)
video-insight --url "https://www.youtube.com/watch?v=VIDEO_ID"

# Bilibili video
video-insight --url "https://www.bilibili.com/video/BV1xxxxx"

# With LLM summary (opt-in)
video-insight --url "https://..." --summarize

# Channel scan (recent videos)
video-insight --channel "UC_x5XG1OV2P6uZZ5FSM9Ttw" --hours 24

# Quiet mode (no stderr progress)
video-insight --url "https://..." --quiet

# Force refresh (ignore cache)
video-insight --url "https://..." --no-cache

# Extract keyframes too
video-insight --url "https://..." --frames

Triggers

Summarize video, extract transcript, YouTube summary, Bilibili transcript, video transcript, 视频摘要, 视频总结, B站视频, YouTube视频

Output Schema

{
  "ok": true,
  "data": {
    "video_id": "dQw4w9WgXcQ",
    "platform": "youtube",
    "title": "Video Title",
    "channel": "Channel Name",
    "duration_seconds": 212,
    "transcript": "Full transcript text without truncation...",
    "transcript_with_timestamps": "[0.0-3.2] First segment\n[3.2-6.5] Second...",
    "frames": [{"file": "/tmp/.../frame_001.jpg", "time_sec": 30}],
    "cached": false
  },
  "error": null
}

Cache

Transcripts are permanently cached at ~/.cache/video-insight/{platform}_{video_id}.json. The .json format stores metadata + transcript together for richer cache hits (title, channel, duration, timestamps). Use --no-cache to force re-fetch.

Agent Integration Guide

  1. Default workflow: Call video-insight --url , receive JSON with full transcript. Use your own LLM context to summarize — you have 128K+ tokens, no need for the script to truncate.
  1. For keyframes: Add --frames flag. Only needed when the user explicitly asks for a visual/image review.
  1. Long videos (2h+): The transcript may be very large. Use map-reduce or chunked summarization.
  1. Bilibili videos require ffmpeg and faster-whisper (installed via setup.sh). YouTube videos typically have captions and are much faster.
  1. Cron/headless: Use --summarize --quiet for automated pipelines.

Setup

cd ~/.openclaw/skills/video-insight && bash setup.sh

Dependencies

Required: yt-dlp, youtube-transcript-api, innertube, ffmpeg (system)

Optional: faster-whisper (for Bilibili/captionless videos), requests (for --summarize)

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-07 23:31 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

Poster Forge

huuuwnnn-droid
通用图像和海报生成器,支持中英文文字。适用于需要创建海报、社交媒体图片、封面图、信息图等场景。
★ 0 📥 470

Pingcode Timelogger

huuuwnnn-droid
Automate PingCode timesheet filling — create sub-tasks and log work hours. Use when asked to fill PingCode timesheets, l
★ 0 📥 401

WeChat MP Plus

huuuwnnn-droid
自动化微信公众号管理,包括草稿发布、菜单编辑、自动回复检查以及 Markdown 转 HTML 格式。
★ 0 📥 333