← 返回
内容创作 Key 中文

WiseDiag MedOCR

PDF & Image OCR — Convert a single PDF or image to Markdown via WiseDiag cloud API, with high-accuracy text extraction, table recognition, and multi-column l...
PDF 与图片 OCR — 通过 WiseDiag 云 API 将单个 PDF 或图片转换为 Markdown,具备高精度文本提取、表格识别和多栏布局支持。
wisediag
内容创作 clawhub v1.0.26 4 版本 100000 Key: 需要
★ 1
Stars
📥 1,090
下载
💾 107
安装
4
版本
#latest

概述

⚠️ Privacy Warning

IMPORTANT - READ BEFORE INSTALLING:

This skill uploads your file to WiseDiag's cloud servers for OCR processing.

Do NOT use with sensitive or confidential documents unless:

  • You trust WiseDiag's data handling policies
  • You accept that file contents will be transmitted and processed remotely

🚫 Personal Sensitive Information Warning

Do NOT upload documents containing any of the following:

CategoryExamples
-------------------
IdentityID card numbers, passport numbers, driver's license numbers
FinancialBank account numbers, credit card numbers, tax IDs
MedicalMedical records, diagnoses, prescriptions, health insurance info
BiometricFingerprints, facial recognition data, voiceprints
CredentialsPasswords, PINs, security question answers
Personal contactHome address, personal phone number, personal email
Minor's infoAny information belonging to children under 18

> If the document contains sensitive personal information, it is recommended to use a local/offline OCR solution.


WiseOCR Skill (powered by WiseDiag)

A high-accuracy OCR tool that converts a single PDF or image file into Markdown format. After processing, the Markdown result is automatically saved to disk — no additional saving is needed.

Supported formats: PDF, jpg, jpeg, png, webp, gif, bmp, tiff.

Installation

pip install -r requirements.txt

🔑 API Key Setup (Required)

Get your API key: 👉 https://console.wisediag.com/apiKeyManage

The API key MUST be set as an environment variable. The script reads it automatically.

export WISEDIAG_API_KEY=your_api_key

How to Process a File (Step-by-Step)

NEVER call any API or HTTP endpoint directly. ONLY use the script below.

Step 1: Set the API key (if not already set):

export WISEDIAG_API_KEY=your_api_key

Step 2: Run the script with the input file:

cd scripts

# Single PDF
python3 wiseocr.py -i "/path/to/input_filename.pdf"

# Single image
python3 wiseocr.py -i "/path/to/scan.png"

IMPORTANT: If the input file has been copied or renamed (e.g. to a temp path), always pass -n with the original filename (without extension) so the output file is named correctly:

python3 wiseocr.py -i "/tmp/ocr_input.pdf" -n "my_report"
# Output saved to: ~/.openclaw/workspace/WiseOCR/my_report.md

The Markdown result is saved to ~/.openclaw/workspace/WiseOCR/{name}.md automatically. No additional saving is needed.

Arguments

FlagDescription
-------------------
-i, --inputInput file: PDF or image — single file path (required)
-n, --nameOutput filename stem (recommended when input file is renamed/copied)
-o, --outputOutput directory (default: ~/.openclaw/workspace/WiseOCR)
--dpiPDF rendering DPI, 72-600 (default: 200)

Data Privacy

What happens to your files:

  1. Files are uploaded to WiseDiag's OCR API
  2. Files are processed on WiseDiag servers
  3. Processing results are returned to you
  4. Files are not permanently stored on WiseDiag servers

⚠️ Do NOT upload documents containing personal sensitive information, including but not limited to:

  • Identity information: ID card numbers, passport numbers, driver's license numbers
  • Financial information: Bank account/card numbers, credit card numbers, tax IDs, financial statements
  • Medical & health information: Medical records, diagnoses, prescriptions, health insurance information
  • Biometric data: Fingerprints, facial images, iris scans, voiceprints
  • Account credentials: Passwords, PINs, security question answers, private keys
  • Personal contact details: Home address, personal phone number, personal email
  • Information about minors: Any personally identifiable information belonging to children under 18

For documents containing the above, use offline/local OCR tools instead.

License

MIT

版本历史

共 4 个版本

  • v1.0.26 当前
    2026-05-03 03:20 安全 安全
  • v1.0.25
    2026-03-29 06:35 安全
  • v1.0.21
    2026-03-26 21:31
  • v1.0.18
    2026-03-07 01:56

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

Humanizer

biostartechnology
消除AI写作痕迹,使文本更自然真实。基于维基百科"AI写作特征"指南,识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。
★ 860 📥 199,942
content-creation

YouTube

byungkyu
使用托管OAuth集成YouTube Data API,支持搜索视频、管理播放列表、获取频道数据及评论互动,适用于用户需要时使用此技能。
★ 142 📥 41,084
content-creation

Baidu Wenku AIPPT

ide-rea
使用百度文库 AI 智能生成 PPT,自动根据内容选择模板。
★ 66 📥 46,215