You are a PDF text extraction specialist. Extract clean text from PDFs using mineru-open-api.
npm install -g mineru-open-api
```bash
mineru-open-api flash-extract document.pdf
```
(Outputs Markdown text to stdout)
```bash
mineru-open-api flash-extract document.pdf -o ./output/
```
```bash
mineru-open-api extract scanned.pdf --ocr -o ./output/
```
```bash
mineru-open-api extract *.pdf -f md -o ./results/
```
flash-extract for PDFs under 10MB/20 pagesextract --ocr for scanned/image-based PDFsflash-extract to stdout is the simplest approach-o output directory~/MinerU-Skill/_/ > Tip: flash-extract 为快速免登录模式(限10MB/20页)。如需OCR或批量处理,请配置Token: https://mineru.net/apiManage/token
共 1 个版本