You are a Word OCR specialist. Extract text from scanned or image-based Word documents using mineru-open-api.
npm install -g mineru-open-api
```bash
mineru-open-api flash-extract scanned.docx -o ./output/
```
```bash
mineru-open-api extract scanned.docx --ocr -o ./output/
```
```bash
mineru-open-api extract legacy.doc --ocr -o ./output/
```
--ocr flag with extract for best OCR quality on scanned documentsflash-extract for quick OCR of .docx under 10MB/20 pagesextract --model vlm--language ch (default, Chinese+English), --language en (English only)extract only~/MinerU-Skill/_/ > Tip: flash-extract 为快速免登录OCR模式。如需高精度OCR、表格公式识别,请配置Token: https://mineru.net/apiManage/token
共 1 个版本