← 返回
内容创作

jpocr

Japanese OCR via NDLOCR-Lite (National Diet Library). Trigger on 'OCR this image', '日文OCR', 'recognize Japanese text', or any request to extract text from Ja...
Japanese OCR via NDLOCR-Lite (National Diet Library). Trigger on 'OCR this image', '日文OCR', 'recognize Japanese text', or any request to extract text from Ja...
realwaynesun
内容创作 clawhub v1.0.0 1 版本 99777 Key: 无需
★ 0
Stars
📥 1,342
下载
💾 1
安装
1
版本
#latest

概述

jpocr — Japanese OCR Skill

Local Japanese OCR powered by NDLOCR-Lite from Japan's National Diet Library.

Runs on CPU (Apple Silicon / x86), no GPU or API key required.

Capabilities

TargetQuality
-----------------
Printed Japanese (活字)Excellent
Vertical text (縦書き)Excellent
English textGood
Handwritten Japanese (手書き)Experimental

How to call

Run scripts/ocr-cli.sh from the skill root directory:

<SKILL_ROOT>/scripts/ocr-cli.sh <image_path>              # → plain text to stdout
<SKILL_ROOT>/scripts/ocr-cli.sh <image_path> --json        # → JSON with bounding boxes
<SKILL_ROOT>/scripts/ocr-cli.sh <image_path> --viz         # → also saves visualization
<SKILL_ROOT>/scripts/ocr-cli.sh <dir_path>                 # → batch all images in dir

Output formats

text (default): one line per detected text region.

json:

{
  "contents": [[
    {
      "boundingBox": [[x1,y1],[x1,y2],[x2,y1],[x2,y2]],
      "text": "recognized text",
      "confidence": 0.95,
      "isVertical": "true"
    }
  ]],
  "imginfo": { "img_width": 1920, "img_height": 1080 }
}

viz: saves viz_ bounding-box overlay image to the output directory.

Performance

  • ~2-3 seconds per image on Apple Silicon (CPU)
  • Formats: JPG, PNG, TIFF, JP2, BMP
  • Charset: ~7000 characters (JIS kanji + kana + ASCII + Greek)

Tech stack

  • Layout detection: DEIMv2 (ONNX)
  • Text recognition: PARSeq cascade (30/50/100 char models, ONNX)
  • Reading order: xy-cut algorithm

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-30 02:23 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

Humanizer

biostartechnology
消除AI写作痕迹,使文本更自然真实。基于维基百科"AI写作特征"指南,识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。
★ 859 📥 199,546
content-creation

YouTube

byungkyu
使用托管OAuth集成YouTube Data API,支持搜索视频、管理播放列表、获取频道数据及评论互动,适用于用户需要时使用此技能。
★ 142 📥 41,041
content-creation

Baidu Wenku AIPPT

ide-rea
使用百度文库 AI 智能生成 PPT,自动根据内容选择模板。
★ 66 📥 46,160