← 返回
AI智能 中文

agent-architecture-evaluator

Use when evaluating, testing, and optimizing an agent architecture or multi-agent system. Best for reviewing planning, routing, memory, tool use, reliability...
用于评估、测试和优化智能体架构或多智能体系统。最适用于审查规划、路由、记忆、工具使用、可靠性……
ada01325150-alt
AI智能 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 558
下载
💾 10
安装
1
版本
#latest

概述

Agent Architecture Evaluator

Version: 1.0.0

Overview

This skill reviews the architecture of an agent system, not just its prompts or its attached skills.

Use it for architectures involving components such as:

  • planner / executor splits
  • routers and specialists
  • tool-use layers
  • memory systems
  • human approval gates
  • multi-agent coordination

Use this skill when

  • A user wants to assess an existing agent architecture.
  • Reliability, latency, cost, or coordination problems appear to be architectural.
  • A team needs a structured architecture review and optimization roadmap.
  • You need system-level test scenarios rather than single-skill evals.

Do not use this skill when

  • The problem is one isolated skill.
  • The task is to create a new skill from scratch.
  • The main need is portfolio review across many related skills.

Use agent-test-measure-refine or agent-skill-portfolio-evaluator in those cases.

Output contract

Always produce these named outputs:

  • architecture_inventory
  • failure_mode_map
  • architecture_test_plan
  • optimization_roadmap
  • measurement_plan
  • architecture_recommendation

Review dimensions

Evaluate at least these dimensions:

  1. component clarity
  2. routing correctness
  3. memory usefulness
  4. coordination reliability
  5. cost and latency efficiency
  6. observability and debuggability

Quick start

  1. Map the current architecture.
  2. Identify critical paths and failure-prone handoffs.
  3. Define architecture-level test scenarios.
  4. Identify bottlenecks in routing, memory, tools, or coordination.
  5. Recommend the smallest structural changes with the highest leverage.

Workflow

1. Build the architecture inventory

Capture:

  • components
  • responsibilities
  • inputs and outputs
  • state or memory boundaries
  • human approval points
  • observability signals

2. Map failure modes

Look for:

  • planner produces unusable tasks
  • router sends work to the wrong specialist
  • memory pollutes current decisions
  • tool calls are slow, redundant, or poorly validated
  • multi-agent handoffs lose context
  • approval gates appear too late

3. Design system tests

Cover:

  • happy path
  • degraded upstream input
  • partial component failure
  • tool unavailability
  • stale or noisy memory
  • high-latency coordination
  • rollback or recovery behavior

See references/architecture-review-framework-v1.0.0.md.

4. Prioritize architectural changes

Prefer:

  • clarifying responsibilities before adding components
  • removing weak indirection
  • tightening interface contracts
  • adding observability before adding complexity
  • isolating state when cross-contamination is likely

5. Define measurement

Recommend concrete metrics where available:

  • task success rate
  • retry rate
  • fallback rate
  • cost per successful task
  • latency by stage
  • human intervention rate

Anti-patterns

  • adding new components to hide unclear ownership
  • keeping weak memory because it sounds sophisticated
  • optimizing one stage without measuring system impact
  • blaming prompts for structural routing failures

Resources

  • references/architecture-review-framework-v1.0.0.md for system review steps.
  • references/optimization-patterns-v1.0.0.md for architecture optimization guidance.
  • assets/architecture-review-template.md for the final report structure.
  • assets/example-architecture-review.md for a realistic filled review.
  • assets/architecture-input-example.json for structured input.
  • scripts/render_architecture_review.py to normalize a structured architecture review into Markdown.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-30 03:22 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,353 📥 317,862
ai-intelligence

Proactive Agent

halthelobster
将AI智能体从任务执行者升级为主动预判需求、持续优化的智能伙伴。集成WAL协议、工作缓冲区、自主定时任务及实战验证模式。Hal Stack核心组件 🦞
★ 834 📥 212,904
ai-intelligence

ontology

oswalpalash
类型化知识图谱,用于结构化智能体记忆与可组合技能。支持创建/查询实体(人员、项目、任务、事件、文档)及关联...
★ 710 📥 243,623