← 返回
未分类

Memect Ppx

Parse PDFs and images into Markdown/JSON using the memect-ppx (`ppx`) CLI. Use when the user asks to OCR scanned PDFs or screenshots, extract tables from PDF...
使用 memect-ppx (ppx) CLI 将 PDF 和图片解析为 Markdown/JSON 格式。适用于用户要求 OCR 扫描版 PDF 或截图、提取 PDF 表格等场景。
lihanghang lihanghang 来源
未分类 clawhub v0.2.6 2 版本 100000 Key: 无需
★ 0
Stars
📥 479
下载
💾 0
安装
2
版本
#latest#markdown#ocr#parser#pdf#table

概述

memect-ppx

Use the local ppx CLI to parse PDFs and images into structured Markdown and JSON.

Runtime Requirements

  • Use Python >= 3.12.
  • Prefer installing PPX into a virtual environment instead of the system Python.
  • If ppx is missing, read references/troubleshooting.md and create a virtual environment before installing dependencies.
  • Keep this skill's frontmatter version synchronized from the repository pyproject.toml with scripts/sync_version.py.

Workflow

  1. Confirm the runtime uses Python >= 3.12.
  2. Check the runtime with scripts/check_ppx_env.sh.
  3. If ppx is missing, create or use a virtual environment and install PPX there.
  4. Choose parsing options:
    • Use --ocr auto by default.
    • Use --ocr yes for scanned PDFs or screenshots.
    • Use --ocr no for native PDFs when OCR causes noise.
    • Use --table auto by default.
    • Use --table llm only when the user needs highest table accuracy and an LLM backend is configured.
  5. Run ppx parse -o .
  6. Inspect the output folder and report the main artifacts:
    • doc.md
    • doc.json
    • pages/
    • images/ when figures are extracted
  7. If parsing fails, summarize the failing step and load the relevant note from references/.

Common Commands

ppx parse report.pdf -o output/
ppx parse scan.pdf --ocr yes -o output/
ppx parse figure.png -o output/
ppx parse report.pdf --pages "1-5,10" -o output/
ppx parse report.pdf --table llm --backend deepseek -o output/

Output Contract

  • Prefer returning the absolute output directory.
  • Mention whether the result came from doc.md, doc.json, or page-level files.
  • Call out OCR mode, table mode, and backend when they materially affect accuracy.

References

  • Read references/cli-options.md when choosing parse flags.
  • Read references/backend-config.md when using DeepSeek, Paddle, or GLM backends.
  • Read references/troubleshooting.md when PPX is missing, Python is too old, or runtime dependencies fail.

版本历史

共 2 个版本

  • v0.2.6 当前
    2026-05-21 13:19 安全 安全
  • v0.2.3
    2026-05-03 10:33 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

office-efficiency

腾讯文档 TENCENT DOCS

u_b0de8114
腾讯文档(docs.qq.com)-在线云文档平台,是创建、编辑、管理文档的首选 skill。涉及"新建/创建/编辑/读取/查看/搜索文档"、"保存文件"、"云文档"、"腾讯文档"、"docs.qq.com"等操作,请优先使用本 skill
★ 177 📥 123,689
office-efficiency

Word / DOCX

ivangdavila
创建、检查和编辑 Microsoft Word 文档及 DOCX 文件,支持样式、编号、修订记录、表格、分节符及兼容性检查等功能。
★ 475 📥 157,428
office-efficiency

Gog

steipete
Google Workspace 命令行工具,支持 Gmail、日历、云端硬盘、通讯录、表格和文档。
★ 937 📥 187,698