← 返回
内容创作 Key 中文

dy-video-to-text

Extract speech-to-text from Douyin (TikTok China) videos, get watermark-free download links, and download videos. Use when user shares a Douyin link, asks to...
提取抖音视频的语音转文字,获取无水印下载链接并下载视频。用户分享抖音链接并请求下载时使用。
hljwwyy123
内容创作 clawhub v1.0.0 1 版本 100000 Key: 需要
★ 1
Stars
📥 622
下载
💾 85
安装
1
版本
#latest

概述

Douyin Video Processor

Process Douyin (TikTok China) share links: extract video speech to text, get watermark-free download URLs, and download videos locally.

First-Time Setup

Before using any script, install the Python dependencies:

bash scripts/install_deps.sh

This installs requests and dashscope. Only needed once.

Environment Variable

The text extraction feature requires DASHSCOPE_API_KEY from Alibaba Cloud Bailian.

The user must set this before you can call douyin_extract_text.py. If it is not set, tell the user:

> To use text extraction, you need an Alibaba Cloud Bailian API key.

> 1. Go to https://help.aliyun.com/zh/model-studio/get-api-key

> 2. Create an API key (starts with sk-)

> 3. Set it: /secrets set DASHSCOPE_API_KEY sk-your-key

The other two scripts (parse and download) do not need any API key.

Available Scripts

1. Parse Video Info (no API key needed)

Extract video metadata (ID, title, watermark-free URL) from a Douyin share link.

python3 scripts/douyin_parse.py "SHARE_LINK"

Input: A Douyin share link or text containing one (e.g. https://v.douyin.com/xxxxx/ or a full share text like "7.29 复制打开抖音... https://v.douyin.com/xxxxx/")

Output: JSON to stdout:

{
  "status": "success",
  "video_id": "7345678901234567890",
  "title": "Video title here",
  "download_url": "https://..."
}

When to use: User wants to see video info, or you need the download URL without downloading the file.


2. Download Video (no API key needed)

Download a watermark-free video file to local disk.

python3 scripts/douyin_download.py "SHARE_LINK" [output_directory]

Input:

  • Arg 1: Douyin share link or text containing one
  • Arg 2 (optional): Output directory, defaults to current directory

Output: JSON to stdout:

{
  "status": "success",
  "video_id": "7345678901234567890",
  "title": "Video title here",
  "file_path": "/absolute/path/to/video.mp4",
  "size_bytes": 12345678
}

When to use: User asks to download a Douyin video, save a video, or get the actual video file.


3. Extract Text from Video (requires DASHSCOPE_API_KEY)

Parse a Douyin share link, then transcribe the video speech to text using Alibaba Cloud ASR.

DASHSCOPE_API_KEY="$DASHSCOPE_API_KEY" python3 scripts/douyin_extract_text.py "SHARE_LINK" [model]

Input:

  • Arg 1: Douyin share link or text containing one
  • Arg 2 (optional): ASR model name, defaults to paraformer-v2

Output: JSON to stdout:

{
  "status": "success",
  "video_id": "7345678901234567890",
  "title": "Video title here",
  "text": "The full transcribed text content from the video..."
}

When to use: User wants to know what's said in a Douyin video, asks to transcribe, extract text, get subtitles, or summarize video content.

Important: Always pass DASHSCOPE_API_KEY as an environment variable in the command. If the key is not set, the script will return an error with setup instructions.

Error Handling

All scripts return JSON even on failure:

{
  "status": "error",
  "error": "Human-readable error message"
}

Common errors and what to tell the user:

Error message containsUser-facing guidance
------
No valid share linkThe input doesn't contain a valid Douyin URL. Ask the user to paste the full share link.
Failed to parse video infoThe video may be deleted, private, or region-locked. Ask the user to verify the link opens in a browser.
DASHSCOPE_API_KEY.*not setThe API key is missing. Guide the user through setup (see Environment Variable section above).
Transcription failedASR API error — the key may be invalid or quota exhausted. Ask the user to check their Alibaba Cloud console.
dashscope package not installedRun bash scripts/install_deps.sh to install dependencies.

Batch Processing

When the user provides multiple Douyin links, process them sequentially. For each link:

  1. Call the appropriate script
  2. Parse the JSON output
  3. Collect results
  4. Present a summary table to the user

For batch text extraction, first parse all links with douyin_parse.py to validate them, then extract text only from the valid ones with douyin_extract_text.py. This avoids wasting API calls on broken links.

Cost Awareness

ScriptAPI CostSpeed
-------------------------
douyin_parse.pyFree~1-2s
douyin_download.pyFreedepends on video size
douyin_extract_text.pyAlibaba Cloud ASR (very low cost)~10-30s

Always prefer douyin_parse.py first when you just need to verify a link or get the download URL.

External Endpoints

EndpointPurposeData Sent
------------------------------
https://v.douyin.com/*Resolve Douyin share link redirectsShare URL
https://www.iesdouyin.com/share/video/*Fetch video page to extract metadataVideo ID
Douyin CDNDownload video file (douyin_download.py only)None (GET request)
https://dashscope.aliyuncs.com/api/*Alibaba Cloud ASR (douyin_extract_text.py only)Video URL + API key

Security & Privacy

  • DASHSCOPE_API_KEY is only sent to Alibaba Cloud's official API endpoint (dashscope.aliyuncs.com)
  • No data is collected, stored, or transmitted by this skill beyond the API calls listed above
  • Downloaded video files are saved only where the user specifies
  • No persistent background processes

Trust Statement

By using this skill, Douyin share links are sent to Douyin/ByteDance servers for URL resolution, and video URLs may be sent to Alibaba Cloud (Aliyun) for speech-to-text transcription. Only install this skill if you trust these services.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-29 18:17 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

YouTube

byungkyu
使用托管OAuth集成YouTube Data API,支持搜索视频、管理播放列表、获取频道数据及评论互动,适用于用户需要时使用此技能。
★ 142 📥 41,096
content-creation

Humanizer

biostartechnology
消除AI写作痕迹,使文本更自然真实。基于维基百科"AI写作特征"指南,识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。
★ 860 📥 200,040
content-creation

AdMapix

fly0pants
广告情报与应用数据分析助手,支持搜索广告素材、分析应用排名、下载量、收入及市场洞察,用于广告素材和竞品分析。
★ 295 📥 136,524