← 返回
开发者工具 Key 中文

Doubleword API

Create, submit, monitor, and retrieve asynchronous batch AI inference jobs via the Doubleword API using JSONL files for large or cost-sensitive workloads.
通过 Doubleword API,使用 JSONL 文件创建、提交、监控和获取异步批量 AI 推理任务,适用于大规模或成本敏感的工作负载。
pjb157 pjb157 来源
开发者工具 clawhub v1.0.0 1 版本 99904.3 Key: 需要
★ 2
Stars
📥 2,047
下载
💾 61
安装
1
版本
#latest

概述

Doubleword Batch Inference

Process multiple AI inference requests asynchronously using the Doubleword batch API.

When to Use Batches

Batches are ideal for:

  • Multiple independent requests that can run simultaneously
  • Workloads that don't require immediate responses
  • Large volumes that would exceed rate limits if sent individually
  • Cost-sensitive workloads (24h window offers better pricing)

Quick Start

Basic workflow for any batch job:

  1. Create JSONL file with requests (one JSON object per line)
  2. Upload file to get file ID
  3. Create batch using file ID
  4. Poll status until complete
  5. Download results from output_file_id

Workflow

Step 1: Create Batch Request File

Create a .jsonl file where each line contains a single request:

{"custom_id": "req-1", "method": "POST", "url": "/v1/chat/completions", "body": {"model": "anthropic/claude-3-5-sonnet", "messages": [{"role": "user", "content": "What is 2+2?"}]}}
{"custom_id": "req-2", "method": "POST", "url": "/v1/chat/completions", "body": {"model": "anthropic/claude-3-5-sonnet", "messages": [{"role": "user", "content": "What is the capital of France?"}]}}

Required fields per line:

  • custom_id: Unique identifier (max 64 chars) - use descriptive IDs like "user-123-question-5" for easier result mapping
  • method: Always "POST"
  • url: Always "/v1/chat/completions"
  • body: Standard API request with model and messages

Optional body parameters:

  • temperature: 0-2 (default: 1.0)
  • max_tokens: Maximum response tokens
  • top_p: Nucleus sampling parameter
  • stop: Stop sequences

File limits:

  • Max size: 200MB
  • Format: JSONL only (JSON Lines - newline-delimited JSON)
  • Split large batches into multiple files if needed

Helper script:

Use scripts/create_batch_file.py to generate JSONL files programmatically:

python scripts/create_batch_file.py output.jsonl

Modify the script's requests list to generate your specific batch requests.

Step 2: Upload File

Upload the JSONL file:

curl https://api.doubleword.ai/v1/files \
  -H "Authorization: Bearer $DOUBLEWORD_API_KEY" \
  -F purpose="batch" \
  -F file="@batch_requests.jsonl"

Response contains id field - save this file ID for next step.

Step 3: Create Batch

Create the batch job using the file ID:

curl https://api.doubleword.ai/v1/batches \
  -H "Authorization: Bearer $DOUBLEWORD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "input_file_id": "file-abc123",
    "endpoint": "/v1/chat/completions",
    "completion_window": "24h"
  }'

Parameters:

  • input_file_id: File ID from upload step
  • endpoint: Always "/v1/chat/completions"
  • completion_window: Choose "24h" (better pricing) or "1h" (50% premium, faster results)

Response contains batch id - save this for status polling.

Step 4: Poll Status

Check batch progress:

curl https://api.doubleword.ai/v1/batches/batch-xyz789 \
  -H "Authorization: Bearer $DOUBLEWORD_API_KEY"

Status progression:

  1. validating - Checking input file format
  2. in_progress - Processing requests
  3. completed - All requests finished

Other statuses:

  • failed - Batch failed (check error_file_id)
  • expired - Batch timed out
  • cancelling/cancelled - Batch cancelled

Response includes:

  • output_file_id - Download results here
  • error_file_id - Failed requests (if any)
  • request_counts - Total/completed/failed counts

Polling frequency: Check every 30-60 seconds during processing.

Early access: Results available via output_file_id before batch fully completes - check X-Incomplete header.

Step 5: Download Results

Download completed results:

curl https://api.doubleword.ai/v1/files/file-output123/content \
  -H "Authorization: Bearer $DOUBLEWORD_API_KEY" \
  > results.jsonl

Response headers:

  • X-Incomplete: true - Batch still processing, more results coming
  • X-Last-Line: 45 - Resume point for partial downloads

Output format (each line):

{
  "id": "batch-req-abc",
  "custom_id": "request-1",
  "response": {
    "status_code": 200,
    "body": {
      "id": "chatcmpl-xyz",
      "choices": [{
        "message": {
          "role": "assistant",
          "content": "The answer is 4."
        }
      }]
    }
  }
}

Download errors (if any):

curl https://api.doubleword.ai/v1/files/file-error123/content \
  -H "Authorization: Bearer $DOUBLEWORD_API_KEY" \
  > errors.jsonl

Error format (each line):

{
  "id": "batch-req-def",
  "custom_id": "request-2",
  "error": {
    "code": "invalid_request",
    "message": "Missing required parameter"
  }
}

Additional Operations

List All Batches

curl https://api.doubleword.ai/v1/batches?limit=10 \
  -H "Authorization: Bearer $DOUBLEWORD_API_KEY"

Cancel Batch

curl https://api.doubleword.ai/v1/batches/batch-xyz789/cancel \
  -X POST \
  -H "Authorization: Bearer $DOUBLEWORD_API_KEY"

Notes:

  • Unprocessed requests are cancelled
  • Already-processed results remain downloadable
  • Cannot cancel completed batches

Common Patterns

Processing Results

Parse JSONL output line-by-line:

import json

with open('results.jsonl') as f:
    for line in f:
        result = json.loads(line)
        custom_id = result['custom_id']
        content = result['response']['body']['choices'][0]['message']['content']
        print(f"{custom_id}: {content}")

Handling Partial Results

Check for incomplete batches and resume:

import requests

response = requests.get(
    'https://api.doubleword.ai/v1/files/file-output123/content',
    headers={'Authorization': f'Bearer {api_key}'}
)

if response.headers.get('X-Incomplete') == 'true':
    last_line = int(response.headers.get('X-Last-Line', 0))
    print(f"Batch incomplete. Processed {last_line} requests so far.")
    # Continue polling and download again later

Retry Failed Requests

Extract failed requests from error file and resubmit:

import json

failed_ids = []
with open('errors.jsonl') as f:
    for line in f:
        error = json.loads(line)
        failed_ids.append(error['custom_id'])

print(f"Failed requests: {failed_ids}")
# Create new batch with only failed requests

Best Practices

  1. Descriptive custom_ids: Include context in IDs for easier result mapping
    • Good: "user-123-question-5"
    • Bad: "1", "req1"
  1. Validate JSONL locally: Ensure each line is valid JSON before upload
  1. Split large files: Keep under 200MB limit
  1. Choose appropriate window: Use 24h for cost savings, 1h only when time-sensitive
  1. Handle errors gracefully: Always check error_file_id and retry failed requests
  1. Monitor request_counts: Track progress via completed/total ratio
  1. Save file IDs: Store batch_id, input_file_id, output_file_id for later retrieval

Reference Documentation

For complete API details including authentication, rate limits, and advanced parameters, see:

  • API Reference: references/api_reference.md - Full endpoint documentation and schemas

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-28 16:18 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-agent

Skill Vetter

spclaudehome
AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前,检查风险信号、权限范围及可疑模式。
★ 1,233 📥 268,709
dev-programming

Doubleword API

pjb157
通过支持 OpenAI 兼容端点、工具调用和结构化输出的 Doubleword API,提交并管理异步批量 AI 推理任务。
★ 2 📥 2,062
ai-agent

self-improving agent

pskoett
捕获经验教训、错误及修正内容,以实现持续改进。适用于以下场景:(1)命令或操作意外失败;(2)用户纠正Claude(如“不,那不对……”“实际上……”);(3)用户请求的功能不存在;(4)外部API或工具出现故障;(5)Claude发现自身
★ 4,090 📥 818,604