Source: Production agent stack running at $0.91/day (down from $8–10/session)
Domain: Token cost optimization, OpenClaw deployments
Type: Free primer
> The models are not expensive. Your habits are.
Most OpenClaw operators are spending 8–10x more than they need to. This primer gives you the diagnostic framework to find out where you're leaking.
Run these before every session:
Screenshots are the worst offender. Copy-paste or convert to Markdown. A 4,500-word PDF = 100,000+ tokens raw. The same content in Markdown = 4,000–6,000 tokens. ~20x reduction.
Every new turn re-sends the entire conversation history. 30-turn threads don't just feel inefficient — they are. 10–15 turn cap, then summarize and start fresh.
Opus for formatting and proofreading is a Ferrari to the grocery store. Haiku handles light tasks at 1/30th the cost.
Each loaded plugin = silent token tax per session. Documented case: 50,000 tokens consumed before the first keystroke. Audit your connectors. Disable what you don't use.
Cache hits on Opus: $0.50/M vs $5.00/M standard = 90% discount. System prompts, tool definitions, persona instructions → all cacheable. If you're not caching, you're paying full price for the same tokens every call.
Native model web search is token-heavy. MCP-routed alternatives return structured results at a fraction of the cost. Know what you're paying per search.
| Session Type | Input Tokens | Output Tokens | Cost (Opus pricing) |
|---|---|---|---|
| --- | --- | --- | --- |
| Sloppy (raw PDFs, 30-turn sprawl, Opus-everything) | 800K–1M | 150K–200K | $8–$10 |
| Clean (markdown, 10-turn cap, tiered models) | 100K–150K | 50K–80K | ~$1 |
| Reduction | ~8x | ~3x | 8–10x |
Scaled to a team of 10 for one month:
For anyone running OpenClaw agents at any scale:
Full framework with anti-patterns by tier, tiered model routing, and confirmed production delta available in Token Cost Intelligence on Claw Mart.
共 1 个版本