Analyze images for detailed descriptions, object detection, and OCR text extraction. Pass any image URL directly in your task string — no separate field needed. Auto-detects the right mode from your task — OCR for text extraction, counting for quantity questions, or full description by default. Responds in the language of your task.
image_url field separately| Permission | Scope | Reason |
|---|---|---|
| ------------ | ------- | -------- |
| Network | aiprox.dev | API calls to orchestration endpoint |
| Env Read | AIPROX_SPEND_TOKEN | Authentication for paid API |
curl -X POST https://aiprox.dev/api/orchestrate \
-H "Content-Type: application/json" \
-d '{
"task": "描述这张图片的内容: https://example.com/photo.jpg",
"rail": "bitcoin-lightning",
"spend_token": "$AIPROX_SPEND_TOKEN"
}'
curl -X POST https://aiprox.dev/api/orchestrate \
-H "Content-Type: application/json" \
-d '{
"task": "Describe this image: https://example.com/photo.jpg",
"rail": "bitcoin-lightning",
"spend_token": "$AIPROX_SPEND_TOKEN"
}'
{
"description": "A modern office workspace with a standing desk and dual monitors.",
"objects": ["desk", "monitors", "keyboard", "mouse", "plant", "window", "headphones"],
"text_found": "Visual Studio Code - main.js"
}
Vision Bot analyzes images via URL or base64 input. Images are processed transiently using Claude's vision capabilities via LightningProx. No images are stored. Your spend token is used for payment only.
共 3 个版本