← 返回
AI智能

Smart Model Switching

Auto-route tasks to the cheapest Claude model that works correctly. Three-tier progression: Haiku → Sonnet → Opus. Classify before responding. HAIKU (default): factual Q&A, greetings, reminders, status checks, lookups, simple file ops, heartbeats, casual chat, 1-2 sentence tasks. ESCALATE TO SONNET: code >10 lines, analysis, comparisons, planning, reports, multi-step reasoning, tables, long writing >3 paragraphs, summarization, research synthesis, most user conversations. ESCALATE TO OPUS: archi
自动将任务路由至能胜任的最低成本Claude模型,采用三级递进:Haiku → Sonnet → Opus。响应前先分类。HAIKU(默认):事实问答、问候、提醒、状态查询、简单文件操作、心跳、闲聊、1-2句任务。升配至SONNET:>10行代码、分析对比、规划报告、多步推理、表格、>3段长文、摘要、研究综合及多数对话。升配至OPUS:架构决策、复杂调试、多文件重构、战略规划、深度研判、深度研究及关键生产决策。规则:若需人类专注思考超30秒,或Sonnet处理吃力,则升至Opus。低成本起步按需升级,可节省50-90% API成本。
millibus
AI智能 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 24
Stars
📥 9,142
下载
💾 2,458
安装
1
版本
#latest

概述

Smart Model Switching

Three-tier Claude routing: Haiku → Sonnet → Opus

Start with the cheapest model. Escalate only when needed. Save 50-90% on API costs.

The Golden Rule

> If a human would need more than 30 seconds of focused thinking, escalate from Haiku to Sonnet.

> If the task involves architecture, complex tradeoffs, or deep reasoning, escalate to Opus.

Cost Reality

ModelInputOutputRelative Cost
-------------------------------------
Haiku\$0.25/M\$1.25/M1x (baseline)
Sonnet\$3.00/M\$15.00/M12x
Opus\$15.00/M\$75.00/M60x

Bottom line: Wrong model selection wastes money OR time. Haiku for simple, Sonnet for standard, Opus for complex.


💚 HAIKU — Default for Simple Tasks

Stay on Haiku for:

  • Factual Q&A — "what is X", "who is Y", "when did Z"
  • Quick lookups — definitions, unit conversions, short translations
  • Status checks — calendar, file reads, session monitoring
  • Heartbeats — periodic checks, HEARTBEAT_OK responses
  • Memory & reminders — "remember this", "remind me to..."
  • Casual conversation — greetings, small talk, acknowledgments
  • Simple file ops — read, list, basic writes
  • One-liner tasks — anything answerable in 1-2 sentences

NEVER do these on Haiku

  • ❌ Write code longer than 10 lines
  • ❌ Create comparison tables
  • ❌ Write more than 3 paragraphs
  • ❌ Do multi-step analysis
  • ❌ Write reports or proposals

💛 SONNET — Standard Work (The Workhorse)

Escalate to Sonnet for:

Code & Technical

  • Code generation — write functions, build features, scripts
  • Code review — PR reviews, quality checks
  • Debugging — standard bug investigation
  • Documentation — README, comments, user guides

Analysis & Planning

  • Analysis & evaluation — compare options, assess trade-offs
  • Planning — project plans, roadmaps, task breakdowns
  • Research synthesis — combining multiple sources
  • Multi-step reasoning — "first... then... finally"

Writing & Content

  • Long-form writing — reports, proposals, articles (>3 paragraphs)
  • Creative writing — blog posts, descriptions, copy
  • Summarization — long documents, transcripts
  • Structured output — tables, outlines, formatted docs

❤️ OPUS — Complex Reasoning Only

Escalate to Opus for:

Architecture & Design

  • System architecture decisions
  • Major codebase refactoring
  • Design pattern selection with tradeoffs
  • Database schema design

Deep Analysis

  • Complex debugging (multi-file, race conditions)
  • Security reviews
  • Performance optimization strategy
  • Root cause analysis of subtle bugs

Strategic & Creative

  • Strategic planning — business decisions, roadmaps
  • Nuanced judgment — ethics, ambiguity, competing values
  • Deep research — comprehensive multi-source analysis

🔄 Implementation

For Subagents

\\\`javascript

// Routine monitoring

sessions_spawn(task="Check backup status", model="haiku")

// Standard code work

sessions_spawn(task="Build the REST API endpoint", model="sonnet")

// Architecture decisions

sessions_spawn(task="Design the database schema for multi-tenancy", model="opus")

\\\`

For Cron Jobs

\\\`json

{

"payload": {

"kind": "agentTurn",

"model": "haiku"

}

}

\\\`

Always use Haiku for cron unless the task genuinely needs reasoning.


📊 Quick Decision Tree

\\\`

Is it a greeting, lookup, status check, or 1-2 sentence answer?

YES → HAIKU

NO ↓

Is it code, analysis, planning, writing, or multi-step?

YES → SONNET

NO ↓

Is it architecture, deep reasoning, or critical decision?

YES → OPUS

NO → Default to SONNET, escalate if struggling

\\\`


📋 Quick Reference Card

\\\`

┌─────────────────────────────────────────────────────────────┐

│ SMART MODEL SWITCHING │

│ Haiku → Sonnet → Opus │

├─────────────────────────────────────────────────────────────┤

│ 💚 HAIKU (cheapest) │

│ • Greetings, status checks, quick lookups │

│ • Factual Q&A, definitions, reminders │

│ • Simple file ops, 1-2 sentence answers │

├─────────────────────────────────────────────────────────────┤

│ 💛 SONNET (standard) │

│ • Code > 10 lines, debugging │

│ • Analysis, comparisons, planning │

│ • Reports, proposals, long writing │

├─────────────────────────────────────────────────────────────┤

│ ❤️ OPUS (complex) │

│ • Architecture decisions │

│ • Complex debugging, multi-file refactoring │

│ • Strategic planning, deep research │

├─────────────────────────────────────────────────────────────┤

│ 💡 RULE: If a human needs > 30 sec thinking → escalate │

│ 💰 COST: Haiku 1x → Sonnet 12x → Opus 60x │

└─────────────────────────────────────────────────────────────┘

\\\`


Built for Claude-only setups with Haiku, Sonnet, and Opus.

Inspired by save-money skill, extended with three-tier progression.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-28 00:07 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

Proactive Agent

halthelobster
将AI智能体从任务执行者升级为主动预判需求、持续优化的智能伙伴。集成WAL协议、工作缓冲区、自主定时任务及实战验证模式。Hal Stack核心组件 🦞
★ 833 📥 212,782
ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,349 📥 317,700
ai-intelligence

self-improving agent

pskoett
捕获经验教训、错误和纠正,以实现持续改进。使用时机:(1)命令或操作意外失败;(2)用户纠正……
★ 4,055 📥 795,992