← 返回
未分类 中文

Summarize Pdf

PDF to Markdown converter - extract text, tables and formulas from PDF files to clean Markdown. Use when converting PDF documents, extracting PDF content, pa...
PDF转Markdown转换器 - 从PDF文件中提取文本、表格和公式,转换为干净的Markdown格式。用于转换PDF文档、提取PDF内容等场景。
tanis90 tanis90 来源
未分类 clawhub v1.0.0 1 版本 99713.5 Key: 无需
★ 0
Stars
📥 1,044
下载
💾 101
安装
1
版本
#latest

概述

Summarize PDF - Quick Content Extraction

Convert PDF files to clean Markdown using MinerU Open API. No API key required.

Quick Start

# Summarize PDF - Quick Content Extraction
mineru-open-api flash-extract report.pdf

# Summarize PDF - Quick Content Extraction
mineru-open-api flash-extract https://cdn-mineru.openxlab.org.cn/demo/example.pdf

# Summarize PDF - Quick Content Extraction
mineru-open-api flash-extract report.pdf -o ./output/

# Summarize PDF - Quick Content Extraction
mineru-open-api flash-extract report.pdf --pages 1-10

Language Rule

You MUST reply to the user in the SAME language they use. This is non-negotiable.

Capabilities

  • Extracts text, tables, and formulas from PDF
  • Supports both local files and URLs directly
  • Page range selection with --pages
  • Language hint with --language (default: ch, use en for English)
  • No API key, no signup, no authentication
  • Max 10MB / 20 pages per document

When to Use

  • User asks to "read", "extract", "convert", or "parse" a PDF
  • User shares a PDF file or PDF link and asks for its content
  • User wants to summarize or analyze a PDF document
  • User needs PDF content in Markdown format

CLI Reference

Run mineru-open-api flash-extract --help for all available options.

Data Flow

flash-extract sends the document to the MinerU API (mineru.net) for processing and returns Markdown. This is a stateless API call — no account, no persistent storage. MinerU is an open-source project by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU

Notes

  • Output is Markdown only; images/tables/formulas may be replaced with placeholders
  • For larger files (up to 200MB/600 pages) or precision extraction with full assets, use mineru-open-api extract (requires auth via mineru-open-api auth)
  • If the CLI cannot be installed via npm/uv/go, download it from https://mineru.net/ecosystem?tab=cli

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-31 03:52 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

Excel To Markdown

tanis90
文档转 Markdown 转换器 - 将 DOCX、PPTX、Excel 文件转换为 Markdown。用于从 Word 文档、PowerPoint 演示文稿或 E... 提取内容。
★ 0 📥 697

Pptx To Markdown

tanis90
文档转 Markdown 转换器 - 将 DOCX、PPTX、Excel 文件转换为 Markdown。用于从 Word 文档、PowerPoint 演示文稿或 E... 提取内容。
★ 0 📥 822

PDF to Markdown - Extract Text, Tables, Formulas from PDF

tanis90
PDF转Markdown转换器 - 从PDF文件中提取文本、表格和公式,转换为干净的Markdown格式。用于转换PDF文档、提取PDF内容等场景。
★ 0 📥 826