jpocr

Japanese OCR via NDLOCR-Lite (National Diet Library). Trigger on 'OCR this image', '日文OCR', 'recognize Japanese text', or any request to extract text from Ja...

realwaynesun

内容创作 clawhub v1.0.0 1 版本 99777 Key: 无需

★ 0

Stars

📥 1,342

下载

💾 1

安装

版本

#latest

概述

jpocr — Japanese OCR Skill

Local Japanese OCR powered by NDLOCR-Lite from Japan's National Diet Library.

Runs on CPU (Apple Silicon / x86), no GPU or API key required.

Capabilities

Target	Quality
--------	---------
Printed Japanese (活字)	Excellent
Vertical text (縦書き)	Excellent
English text	Good
Handwritten Japanese (手書き)	Experimental

How to call

Run scripts/ocr-cli.sh from the skill root directory:

<SKILL_ROOT>/scripts/ocr-cli.sh <image_path>              # → plain text to stdout
<SKILL_ROOT>/scripts/ocr-cli.sh <image_path> --json        # → JSON with bounding boxes
<SKILL_ROOT>/scripts/ocr-cli.sh <image_path> --viz         # → also saves visualization
<SKILL_ROOT>/scripts/ocr-cli.sh <dir_path>                 # → batch all images in dir

Output formats

text (default): one line per detected text region.

json:

{
  "contents": [[
    {
      "boundingBox": [[x1,y1],[x1,y2],[x2,y1],[x2,y2]],
      "text": "recognized text",
      "confidence": 0.95,
      "isVertical": "true"
    }
  ]],
  "imginfo": { "img_width": 1920, "img_height": 1080 }
}

viz: saves viz_ bounding-box overlay image to the output directory.

Performance

~2-3 seconds per image on Apple Silicon (CPU)
Formats: JPG, PNG, TIFF, JP2, BMP
Charset: ~7000 characters (JIS kanji + kana + ASCII + Greek)

Tech stack

Layout detection: DEIMv2 (ONNX)
Text recognition: PARSeq cascade (30/50/100 char models, ONNX)
Reading order: xy-cut algorithm

版本历史

共 1 个版本

v1.0.0 当前

2026-03-30 02:23 安全安全

安全检测

腾讯云安全 (Keen)

安全，无风险

查看报告

腾讯云安全 (Sanbu)

安全，无风险

查看报告

🔗 相关推荐

content-creation

Humanizer

biostartechnology

消除AI写作痕迹，使文本更自然真实。基于维基百科"AI写作特征"指南，识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。

★ 859 📥 199,546

content-creation

YouTube

byungkyu

使用托管OAuth集成YouTube Data API，支持搜索视频、管理播放列表、获取频道数据及评论互动，适用于用户需要时使用此技能。

★ 142 📥 41,041

content-creation

Baidu Wenku AIPPT

ide-rea

使用百度文库 AI 智能生成 PPT，自动根据内容选择模板。

★ 66 📥 46,160