← 返回
未分类 Key 中文

Image OCR Parse

Extract text from images via the PDFAPIHub cloud OCR API. Images are uploaded to pdfapihub.com for Tesseract OCR processing. Supports preprocessing (grayscal...
使用 PDFAPIHub 云 OCR API 将图像文字提取并上传至 pdfapihub.com 进行 Tesseract OCR 处理,支持灰度等预处理。
rishabhdugar rishabhdugar 来源
未分类 clawhub v1.0.2 1 版本 100000 Key: 需要
★ 1
Stars
📥 452
下载
💾 1
安装
1
版本
#latest

概述

Image OCR Parse

What It Does

Extracts text from images via the PDFAPIHub hosted OCR API. Your image is uploaded to PDFAPIHub servers where Tesseract OCR processes it and returns the extracted text. Supports optional image preprocessing to improve OCR quality on low-resolution or noisy inputs.

When to Use

  • Extract text from photos of receipts, signs, or documents
  • OCR business cards, ID cards, or labels
  • Process low-quality images with preprocessing

Required Inputs

Provide one of:

  • image_url — URL to an image
  • base64_image — base64-encoded image
  • Multipart upload with file field

Authentication

This skill calls the PDFAPIHub hosted API at https://pdfapihub.com/api. Your image is uploaded to PDFAPIHub servers for OCR processing.

Send your API key in the CLIENT-API-KEY header.

Get your free API key at https://pdfapihub.com. Full API documentation is available at https://pdfapihub.com/docs.

Privacy note: Images you process are uploaded to PDFAPIHub's cloud service and the extracted text is returned. Do not send sensitive images unless you trust the service. Files are auto-deleted after 30 days.

Use Cases

  • Receipt Scanning — Extract text from receipt photos for expense tracking
  • Business Card Reader — OCR business card images to extract name, phone, email
  • License Plate Recognition — Extract plate numbers from photos (with char_whitelist)
  • Meter Reading — Extract digits from utility meter photos for automated logging
  • Whiteboard Capture — OCR whiteboard or handwritten note photos into text
  • Product Label Scanning — Extract ingredient lists or nutrition info from product label photos

Image Preprocessing Options

ParamDefaultDescription
-----------------------------
grayscalefalseConvert to grayscale
sharpenfalseApply sharpening
threshold0Binarization threshold (1–255)
resize0Scale factor (max 4x)

Example Usage

curl -X POST https://pdfapihub.com/api/v1/image/ocr/parse \
  -H "CLIENT-API-KEY: your_api_key" \
  -H "Content-Type: application/json" \
  -d '{
    "image_url": "https://pdfapihub.com/sample-invoicepage.png",
    "lang": "eng",
    "grayscale": true,
    "sharpen": true,
    "detail": "words"
  }'

版本历史

共 1 个版本

  • v1.0.2 当前
    2026-05-07 08:05 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

PDF Parse

rishabhdugar
将 PDF 解析为结构化 JSON:包含文本、带有边界框的布局感知块、表格和图片元数据。
★ 0 📥 481

Generate Image

rishabhdugar
使用无头 Chromium 将 HTML 内容或公开 URL 生成PNG 图片,支持自定义尺寸、Retina 高清、全页面截图以及 Cookie 配置。
★ 0 📥 528

Generate PDF from HTML

rishabhdugar
根据HTML内容或公开URL生成PDF文档,支持自定义页面尺寸、字体、页边距、视口尺寸、动态参数替换等功能。
★ 0 📥 728