将本地视频文件的语音提取并转换为文字文案。
当用户提到以下关键词时,使用此技能:
Windows:
# 使用 winget
winget install ffmpeg
# 或使用 chocolatey
choco install ffmpeg
macOS:
brew install ffmpeg
Linux:
sudo apt install ffmpeg # Ubuntu/Debian
sudo yum install ffmpeg # CentOS
需要在环境变量或 .env 文件中配置:
# 阿里云 DashScope API Key(必填)
DASHSCOPE_API_KEY=your-dashscope-api-key
# 阿里云 OSS 配置(必填)
ALIYUN_ACCESS_KEY_ID=your-access-key-id
ALIYUN_ACCESS_KEY_SECRET=your-access-key-secret
ALIYUN_OSS_BUCKET=your-bucket-name
ALIYUN_OSS_ENDPOINT=oss-cn-beijing.aliyuncs.com
python3 scripts/video2text.py '{"video": "/path/to/video.mp4"}'
python3 scripts/video2text.py '{"videos": ["/path/video1.mp4", "/path/video2.mp4"]}'
python3 scripts/video2text.py '{"video": "/path/to/video.mp4", "output_dir": "/path/to/output"}'
输出为 JSONL(每行一个 JSON),支持流式读取:
{"type": "video_result", "index": 0, "video_path": "/path/to/video.mp4", "status": "completed", "text": "视频文案内容...", "duration": 65000}
{"type": "summary", "status": "completed", "total": 3, "completed": 2, "failed": 1}
共 1 个版本