← 返回
未分类 中文

Deepseek Extract

Extract full conversation content from DeepSeek shared chat links. Use when: user provides a DeepSeek share URL (chat.deepseek.com/share/...), wants to extra...
Extract full conversation content from DeepSeek shared chat links. Use when: user provides a DeepSeek share URL (chat.deepseek.com/share/...), wants to extra...
zz0116
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 376
下载
💾 0
安装
1
版本
#latest

概述

DeepSeek Extract

Extract full conversation content from DeepSeek shared chat links.

Prerequisites

Before using this skill, ensure the following are installed:

pip install playwright
playwright install chromium

If playwright is not installed when the skill runs, inform the user and offer to run the install commands above.

Workflow

Step 1: Validate the URL

  1. Check if the provided URL matches https://chat.deepseek.com/share/...
  2. If the URL does not match this pattern, inform the user:
    • "This skill only supports DeepSeek share links (chat.deepseek.com/share/...)."
    • If the URL is from another AI platform (ChatGPT, Claude, etc.), suggest the appropriate skill or manual copy.
  3. If valid, proceed to Step 2.

Step 2: Run the extraction script

Execute the extraction script from the skill's scripts/ directory:

python3 "<skill_dir>/scripts/extract_deepseek.py" "<share_url>" --output "<output_path>"

Parameters:

  • : The DeepSeek share URL (required)
  • --output: Output file path (optional, defaults to ./deepseek_conversation.md)
  • --format: Output format — markdown (default) or json
  • --headed: Run browser in headed mode for debugging (optional flag)
  • --timeout: Page load timeout in milliseconds (optional, defaults to 30000)

Replace with the actual skill base directory path.

Error handling:

  • If playwright is not installed: inform the user and offer to run pip install playwright && playwright install chromium
  • If python3 is not found: try python instead
  • If the script exits with a timeout error: retry with --timeout 60000
  • If the script exits with an empty extraction: try with --headed flag to debug visually

Step 3: Verify and deliver

  1. Read the output file to verify content was extracted successfully.
  2. If the output is empty or contains fewer than 2 messages:
    • Try again with --headed flag for debugging
    • The DeepSeek page may have anti-bot protection — inform the user
    • Suggest: "DeepSeek may be blocking automated access. You can try manually copying the conversation."
  3. If content was extracted successfully, present it to the user.
  4. If the user wants a different format (Word, PDF, etc.), use other skills (docx, pdf) to convert.

Step 4: Cleanup

Remove any temporary files created during extraction. Keep the output file unless the user specifies otherwise.

How It Works

The extraction script uses Playwright (headless Chromium) to:

  1. Navigate to the DeepSeek share URL
  2. Wait for the SPA to fully render (JavaScript execution)
  3. Try multiple CSS selector strategies to find message elements
  4. Classify messages as user or assistant based on class names and data attributes
  5. If primary selectors fail, use aggressive text-parsing fallback
  6. Output the result as Markdown or JSON

Output Format

Markdown (default)

# DeepSeek 对话记录

> 来源: <share_url>

---

## 用户
<message content>

## DeepSeek
<response content>

---
(repeat for each turn)

JSON

{
  "url": "<share_url>",
  "title": "<page title>",
  "messages": [
    {"role": "user", "content": "..."},
    {"role": "assistant", "content": "..."}
  ]
}

Troubleshooting

| Issue | Cause | Solution |

|-------|-------|----------|

| No messages extracted | Anti-bot / CAPTCHA | Try --headed flag; copy manually |

| Timeout error | Slow network | Retry with --timeout 60000 |

| Incomplete content | Lazy loading | Script auto-scrolls; try --headed to verify |

| playwright not found | Not installed | Run pip install playwright && playwright install chromium |

| python3 not found | Windows environment | Try python instead of python3 |

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-07 11:30 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

self-improving agent

pskoett
捕获经验教训、错误和纠正,以实现持续改进。使用时机:(1)命令或操作意外失败;(2)用户纠正……
★ 4,056 📥 796,258
security-compliance

Skill Vetter

spclaudehome
AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前,检查风险信号、权限范围及可疑模式。
★ 1,211 📥 266,194
ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,350 📥 317,725