Local Japanese OCR powered by NDLOCR-Lite from Japan's National Diet Library.
Runs on CPU (Apple Silicon / x86), no GPU or API key required.
| Target | Quality |
|---|---|
| -------- | --------- |
| Printed Japanese (活字) | Excellent |
| Vertical text (縦書き) | Excellent |
| English text | Good |
| Handwritten Japanese (手書き) | Experimental |
Run scripts/ocr-cli.sh from the skill root directory:
<SKILL_ROOT>/scripts/ocr-cli.sh <image_path> # → plain text to stdout
<SKILL_ROOT>/scripts/ocr-cli.sh <image_path> --json # → JSON with bounding boxes
<SKILL_ROOT>/scripts/ocr-cli.sh <image_path> --viz # → also saves visualization
<SKILL_ROOT>/scripts/ocr-cli.sh <dir_path> # → batch all images in dir
text (default): one line per detected text region.
json:
{
"contents": [[
{
"boundingBox": [[x1,y1],[x1,y2],[x2,y1],[x2,y2]],
"text": "recognized text",
"confidence": 0.95,
"isVertical": "true"
}
]],
"imginfo": { "img_width": 1920, "img_height": 1080 }
}
viz: saves viz_ bounding-box overlay image to the output directory.
共 1 个版本