← 返回
AI智能 Key

Train Robotic AI Models using Qualia

Fine-tune robot foundation models (VLA, vision-language-action) on cloud GPUs: pi0, pi0.5 (π0.5), GR00T N1.5, ACT, SmolVLA, SARM reward models. Robotics and...
Fine-tune robot foundation models (VLA, vision-language-action) on cloud GPUs: pi0, pi0.5 (π0.5), GR00T N1.5, ACT, SmolVLA, SARM reward models. Robotics and...
fabbe1999 fabbe1999 来源
AI智能 clawhub v2.1.0 3 版本 99875.5 Key: 需要
★ 2
Stars
📥 762
下载
💾 25
安装
3
版本
#latest

概述

Qualia

Fine-tune Vision-Language-Action (VLA) models for robotics on cloud GPUs.

Setup

  1. Sign up at app.qualiastudios.dev
  2. Create an API key (Settings → API Keys)
  3. Set the env var:

```bash

export QUALIA_API_KEY="your-api-key"

```

Verify your install

First thing after setup, run the self-test:

python3 {baseDir}/scripts/qualia.py doctor

It checks the API key, auth/connectivity (/v1/credits), and the models endpoint. Exit 0 means everything works. Add --json for machine-readable results.

Machine-readable output

Every command accepts a global --json flag. In JSON mode, stdout carries exactly one JSON object or array and nothing else. Errors are emitted on stdout as {"error": {"code": , "message": "...", "details": ...}} with the matching exit code.

python3 {baseDir}/scripts/qualia.py --json credits
# {"balance": 90784}

Prefer --json when driving the CLI programmatically; parse stdout, branch on exit code.

Exit codes

CodeMeaning
---------------
0Success
1Generic/unknown failure
2Usage error (bad arguments, unknown command)
3Auth error (HTTP 401/403 or missing QUALIA_API_KEY)
4Insufficient credits (HTTP 402)
5Validation error (bad camera mapping, hyperparams, or dataset; HTTP 400/422)
6Not found (HTTP 404)
7Connection/network error

When Someone Asks to Train a Model

They probably won't give you everything upfront. Here's what you need and how to get it:

  1. Dataset: ask for their HuggingFace dataset ID (e.g. your-org/your-dataset)
  2. Model type: if they don't specify, run models and help them choose:
    • Quick prototyping → suggest ACT (fast, no base model needed)
    • Production quality → suggest π0.5 or π0
    • Humanoid robots → suggest GR00T N1.5
    • Resource-conscious → suggest SmolVLA
  3. Training duration: if unspecified, suggest 2 to 4 hours for a first run
  4. Camera mapping: run dataset-keys on their dataset, then models to see required slots, and map them automatically. Confirm with the user before launching.

If the user already has a project, use it. Otherwise create one.

When Things Go Wrong

SymptomLikely causeFix
---------------------------
Job stuck at credit_validationInsufficient creditsRun credits, tell user to top up
Fails at dataset_preprocessingBad camera mapping or invalid datasetRe-check dataset-keys output, verify mapping
Fails at instance_bootingGPU capacity issueTry a different instance type or region
Job failed with no clear errorCheck phase eventsRun status and read the event messages

Always run status and share the full phase history with the user when debugging.

Quick Start

# See what models are available (always check, new ones are added regularly)
python3 {baseDir}/scripts/qualia.py models

# Check GPU options and pricing
python3 {baseDir}/scripts/qualia.py instances

# Check your credit balance
python3 {baseDir}/scripts/qualia.py credits

Train a Model

# 1. Discover image keys in your dataset
python3 {baseDir}/scripts/qualia.py dataset-keys your-org/your-dataset

# 2. Create a project
python3 {baseDir}/scripts/qualia.py project-create "My Robot"

# 3. Launch training
python3 {baseDir}/scripts/qualia.py finetune <project_id> <vla_type> your-org/your-dataset 4 \
  '{"cam_1": "observation.images.top"}' \
  --model <base_model_id> \
  --name "My run"

# 4. Monitor
python3 {baseDir}/scripts/qualia.py status <job_id>

Notes:

  • Run models first to see which VLA types require --model and which don't
  • Camera mappings map model slots (from models) to dataset image keys (from dataset-keys)
  • Smart camera mapping: The API returns generic slot names (cam_1, cam_2, cam_3) but the underlying models have a specific input order. Map semantically using these known orders:
  • π0.5 / π0: cam_1 = base/overview camera, cam_2 = left wrist/arm, cam_3 = right wrist/arm
  • GR00T N1.5: cam_1 = base/overview camera, cam_2 = left wrist/arm, cam_3 = right wrist/arm
  • ACT / SmolVLA: cam_1 = primary camera, cam_2/cam_3 = secondary views
  • Fuzzy-match dataset keys to these roles: context_camera or base_0cam_1; left_wristleft_armcam_2; right_wristright_armcam_3
  • Omit --model for types that don't support custom models
  • Estimate cost before launching: run instances to get credits/hr, multiply by hours. Tell the user the estimated cost before confirming.
  • Dataset IDs on HuggingFace are case-sensitive, double-check the exact ID

Manage Jobs & Projects

python3 {baseDir}/scripts/qualia.py projects                     # List projects and jobs
python3 {baseDir}/scripts/qualia.py status <job_id>              # Job status and phase history
python3 {baseDir}/scripts/qualia.py cancel <job_id>              # Cancel a running job
python3 {baseDir}/scripts/qualia.py project-delete <project_id>  # Delete a project

Custom Hyperparameters

# Get defaults
python3 {baseDir}/scripts/qualia.py hyperparams <vla_type> [model_id]

# Validate overrides
python3 {baseDir}/scripts/qualia.py hyperparams-validate <vla_type> '{"learning_rate": 1e-4}'

# Use in training
python3 {baseDir}/scripts/qualia.py finetune ... --hyper-spec '{"learning_rate": 1e-4, "num_epochs": 50}'

Finetune Flags

FlagDescription
-------------------
--model Base model ID (required for some VLA types)
--name Job display name
--instance GPU instance type
--region Cloud region
--batch-size Batch size (1-512, default 32)
--hyper-spec ''Custom hyperparameters
--rabc Enable RA-BC with SARM reward model (HF path)
--rabc-image-key Image key for reward annotations
--rabc-head-mode RA-BC head mode (e.g. sparse)

RA-BC (Reward-Aware Behavior Cloning)

Use a trained SARM reward model to weight training samples. Supported on smolvla, pi0, pi05.

python3 {baseDir}/scripts/qualia.py finetune \
  <project_id> pi0 your-org/your-dataset 4 \
  '{"cam_1": "observation.images.top"}' \
  --model lerobot/pi0 \
  --rabc your-org/sarm-reward-model \
  --rabc-image-key observation.images.top \
  --rabc-head-mode sparse

Job Phases

queuing → credit_validation → instance_booting → instance_activation → instance_setup → dataset_preprocessing → training_running → model_uploading → completed

Terminal: completed, failed, cancelled

Live Docs

For the latest models, endpoints, and capabilities, always check the live documentation:

Links

  • Platform: https://app.qualiastudios.dev
  • Docs: https://docs.qualiastudios.dev
  • API: https://api.qualiastudios.dev (auth via X-API-Key header)

版本历史

共 3 个版本

  • v2.1.0 当前
    2026-06-11 17:17
  • v2.0.0
    2026-05-03 03:48 安全 安全
  • v1.0.0
    2026-03-30 04:59 安全 安全

安全检测

腾讯云安全 (Keen)

队列中

腾讯云安全 (Sanbu)

队列中

🔗 相关推荐

ai-intelligence

Nano Banana Pro

steipete
使用 Nano Banana Pro (Gemini 3 Pro Image) 生成或编辑图像。支持文生图、图生图及 1K/2K/4K 分辨率,适用于图像创建、修改及编辑请求,使用 --input-image 指定输入图像。
★ 418 📥 115,264
ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,360 📥 318,618
ai-intelligence

ontology

oswalpalash
类型化知识图谱,用于结构化智能体记忆与可组合技能。支持创建/查询实体(人员、项目、任务、事件、文档)及关联...
★ 713 📥 243,958