Extract full conversation content from DeepSeek shared chat links.
Before using this skill, ensure the following are installed:
pip install playwright
playwright install chromium
If playwright is not installed when the skill runs, inform the user and offer to run the install commands above.
https://chat.deepseek.com/share/...
Execute the extraction script from the skill's scripts/ directory:
python3 "<skill_dir>/scripts/extract_deepseek.py" "<share_url>" --output "<output_path>"
Parameters:
: The DeepSeek share URL (required)
--output: Output file path (optional, defaults to ./deepseek_conversation.md)
--format: Output format — markdown (default) or json
--headed: Run browser in headed mode for debugging (optional flag)
--timeout: Page load timeout in milliseconds (optional, defaults to 30000)
Replace with the actual skill base directory path.
Error handling:
playwright is not installed: inform the user and offer to run pip install playwright && playwright install chromium
python3 is not found: try python instead
--timeout 60000
--headed flag to debug visually
--headed flag for debugging
Remove any temporary files created during extraction. Keep the output file unless the user specifies otherwise.
The extraction script uses Playwright (headless Chromium) to:
# DeepSeek 对话记录
> 来源: <share_url>
---
## 用户
<message content>
## DeepSeek
<response content>
---
(repeat for each turn)
{
"url": "<share_url>",
"title": "<page title>",
"messages": [
{"role": "user", "content": "..."},
{"role": "assistant", "content": "..."}
]
}
| Issue | Cause | Solution |
|-------|-------|----------|
| No messages extracted | Anti-bot / CAPTCHA | Try --headed flag; copy manually |
| Timeout error | Slow network | Retry with --timeout 60000 |
| Incomplete content | Lazy loading | Script auto-scrolls; try --headed to verify |
| playwright not found | Not installed | Run pip install playwright && playwright install chromium |
| python3 not found | Windows environment | Try python instead of python3 |
共 1 个版本