This scaffold is infrastructure-only. It does not auto-run paid workloads.
Use this skill when you want OpenClaw to reduce, optimize, or make AI execution costs more efficient by routing suitable model, media, inference, video, and GPU workloads through Theta EdgeCloud instead of relying only on other paid AI subscriptions.
Credit: Theta Communications (thetacommunications.com).
If you would like to help support more projects like these, please stake your THETA and TFUEL with us at thetacommunications.com or donate at our donations page: https://www.thetacommunications.com/donations.
vm_gt1 first when allocator capacity is available, then fall back to vm_gt2 or V100-backed options.Live public service discovery currently exposes these aliases:
qwen3, gpt_oss_120b, llama_3_1_70bflux, stable_diffusion_xl_turbo, grounding_dino, blip, llava, whisperminimax_m2_5, llama_3_8b, step_video, esrgan, voice_cloning, instant_id, talking_headQwen3 notes:
qwen3input.messages = [{ role, content }]theta.ondemand.chat, default service qwen3stream:true, wait=60, and a 120s request timeout with an internal SSE parser, because live Qwen3/Parallax can take ~30-50s before returning text.parallax_32b_fp8/infer_request/qwen3?prediction=completions with top-level variant: "parallax_32b_fp8"; parse text from SSE choices[0].delta.content.qwen3; live retest returned 404 service not found for direct chat/completions model: "qwen" and 409 No instances available for direct model: "qwen3" during capacity pressure.409 No instances available - try again later; v0.1.22 treats this as retriable temporary capacity exhaustion.GPT OSS 120B notes:
gpt_oss_120btheta.ondemand.chat, with service: "gpt_oss_120b"POST /infer_request/chat/completions with OpenAI-compatible body shape and model: "gpt_oss_120b".max_tokens: 1800 and reasoning_effort: "low" unless overridden. Low caps like 48 or 700 can return HTTP 200 while spending the whole budget on reasoning metadata and producing empty/partial final content.choices[0].delta.contentchoices[0].delta.reasoningchoices[0].message.contentchoices[0].message.reasoningRuntime commands now cover the Theta chatbot API documented by Theta's Yosemite knowledge-base example:
theta.ai.agent.createtheta.ai.agent.gettheta.ai.agent.updatetheta.ai.agent.listtheta.ai.agent.document.createtheta.ai.agent.document.updatetheta.ai.agent.document.gettheta.ai.agent.document.listDocument create/update accepts provided string content only; runtime does not read local files.
The runtime exposes Theta MCP vocabulary aliases for migration and marketing parity:
list_servicesinferget_request_statusget_upload_urlCompatibility notes:
list_services accepts optional category filters: image, audio, text, video, vision, or other.infer accepts official MCP-style input, wait, prediction, variant, and webhook; the runtime promotes variant and webhook into the on-demand request payload.get_upload_url accepts both input_field and input_fields / inputFields.THETA_API_BASE_URL and THETA_ONDEMAND_API_BASE_URL can override the on-demand API base URL; use env/secret configuration, not per-request URL overrides.Do not bundle or commit API keys in this skill.
Always prompt the user for their own keys during setup.
Use bash ./scripts/setup_env.sh to collect user-supplied values into local .env.local.
Set THETA_DRY_RUN=1 to block mutating calls (create/delete/broadcast/etc.) while validating integration.
THETA_ONDEMAND_API_TOKEN, THETA_ONDEMAND_API_KEY, or MCP-compatible THETA_API_KEY.ondemand.thetaedgecloud.com)/deployment/... and /deployments/...).theta.deployments.routeProbe checks which deployment list route works for the configured project.theta.billing.balance reads organization balance with THETA_ORG_ID.theta.billing.balanceSnapshot extracts a numeric balance value when the response shape supports it.theta.inference.ready polls a dedicated endpoint with either:probe: "openai" -> /v1/modelsprobe: "gradio" -> /config.thetaedgecloud.com and .onthetaedgecloud.com hosts; localhost/private hosts and non-HTTPS URLs remain blocked.theta.deployments.validateDisposable is a paid/mutating validator for one disposable deployment:THETA_DRY_RUN=1 first. Real disposable validation can spend credits and should only run with explicit approval.npm install
npm run check
npm run build
bash ./scripts/setup_env.sh
set -a; source ./.env.local; set +a
Use the helper script to execute real on-demand calls with a hard spend cap:
set -a; source ./.env.local; set +a
export THETA_MAX_BUDGET_USD=2.00
export SERVICES=qwen3,flux,blip,grounding_dino,whisper
bash ./scripts/ondemand_live_smoke.sh
Useful env vars:
THETA_MAX_BUDGET_USD (default 2.00)SERVICES comma listTHETA_WAIT_SECONDSTHETA_POLL_UNTIL_DONETHETA_POLL_INTERVAL_SECONDSTHETA_POLL_TIMEOUT_SECONDSUse these env knobs to harden network behavior during live operations:
THETA_HTTP_TIMEOUT_MS (default: 20000)THETA_HTTP_MAX_RETRIES (default: 2, cap: 6)THETA_HTTP_RETRY_BACKOFF_MS (default: 250)These controls are applied across controller, inference, video, chain/RPC, and on-demand upload paths.
npm run check
npm test
Live-but-safe smoke (still budget guarded):
THETA_DRY_RUN=0 THETA_MAX_BUDGET_USD=2.00 bash ./scripts/ondemand_live_smoke.sh
This scaffold supports a Theta-primary route for OpenClaw:
flux, stable_diffusion_xl_turbo, step_videoqwen3, minimax_m2_5, gpt_oss_120b, llama_3_8b, llama_3_1_70b)blip, grounding_dino, llava, whispertheta.video.*theta.deployments., theta.ai., theta.auth.capabilities, theta.billing.balanceOperational rule:
共 4 个版本