← 返回
未分类 中文

Pdf Parser Agent

Parses local PDF files into structured Markdown and JSON using opendataloader-pdf for deterministic, local document content extraction.
利用 opendataloader‑pdf 将本地 PDF 文件解析为结构化的 Markdown 与 JSON,确保本地、确定性的文档内容提取。
trshdesigns trshdesigns 来源
未分类 clawhub v1.0.1 1 版本 100000 Key: 无需
★ 1
Stars
📥 455
下载
💾 59
安装
1
版本
#latest

概述

SKILL.md - pdf-parser-agent

Purpose

Parses local PDF files into structured Markdown and JSON formats using the opendataloader-pdf library, providing deterministic, local data extraction that bypasses LLM context limits for document content ingestion.

Core Technology Attribution

This skill is built upon opendataloader-pdf, originally developed by bundolee and claude.

Dependencies

This skill requires Python packages installed system-wide or user-site-wide:

  1. opendataloader-pdf

Usage Example

The skill's execution script dynamically finds the correct user-site packages path, assuming the user has installed the dependency via pip install --user opendataloader-pdf.

# Assuming a PDF exists at 'Files for testing/sample-local-pdf.pdf'
openclaw skill pdf-parser-agent --run --args "Files for testing/sample-local-pdf.pdf"

Implementation Notes

The underlying logic now uses site.getusersitepackages() to dynamically locate the installed package, maximizing portability across different OS/Python minor versions.

版本历史

共 1 个版本

  • v1.0.1 当前
    2026-03-30 22:24 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

office-efficiency

Excel / XLSX

ivangdavila
创建、检查和编辑 Microsoft Excel 工作簿及 XLSX 文件,支持可靠的公式、日期、类型、格式、重算及模板保留功能。
★ 392 📥 148,349
office-efficiency

Word / DOCX

ivangdavila
创建、检查和编辑 Microsoft Word 文档及 DOCX 文件,支持样式、编号、修订记录、表格、分节符及兼容性检查等功能。
★ 468 📥 156,080
office-efficiency

Nano Pdf

steipete
使用nano-pdf CLI通过自然语言指令编辑PDF
★ 281 📥 117,104