Find token waste, repeated calls, and risky routing changes before they become real cost.
Finds agents that may be using too many tokens for the work they do.
Spots retries, loops, repeated tool calls, or duplicate scheduled jobs.
Flags model or routing changes that may increase token use.
Avoids judging from one noisy run or mixed workloads.
Tells you whether to watch, investigate one path, keep status quo, or consider rollback.
openclaw skills install agent-cost-eval-kit
openclaw skills install agent-cost-eval-kit --global
openclaw skills install agent-cost-eval-kit --global --force
eval cost change for [My_Agent]
eval token change for [My_Agent]
check token waste for [My_Agent]
No meaningful token waste is visible from the available evidence.
There may be waste, but evidence is not strong enough to act.
One specific agent, job, route, or task path looks suspicious.
The current setup looks better than the previous one. Do not rollback or make more changes yet.
A recent change may have increased token use or reduced reliability. Review before rollback.
The workflow is too risky, mixed, or under-sampled to make a safe judgment.
Every result starts with a status label, then gives the priority item, reason, evidence, risk level, and next action.
Status: Investigate One Path
Fix First:
daily_empty_signal_reminder
Reason:
Recent runs repeatedly returned NO_REPLY or produced no useful downstream action while still starting a full agent loop.
Evidence:
- Recent scheduled runs repeatedly returned NO_REPLY
- The job still consumed tokens on each run
- The upstream signal source appears empty or inactive
Risk:
Medium
Next Action:
Do read-only verification first. Confirm whether the reminder still has a valid upstream signal source before pausing, reducing frequency, or adding pre-agent filtering.
A short output is not waste by itself. Waste means the task consumed tokens without producing useful downstream value.
It will not:
This skill is read-only. It may recommend an action, but it will not ask to immediately disable, edit, delete, switch, or change production configs.
Redact sensitive data before pasting logs.
共 2 个版本