Structured framework for building AI agents that work in production. Based on the Storm & Storm methodology.
Follow these steps in order. Each step has a clear goal and concrete deliverables.
Pick ONE painful, repeating workflow. Not "AI in general."
Break the task into 4–7 steps: INPUT → ACTIONS → DECISION → OUTPUT
Treat the agent like an API, not a chatbot.
Create a clear role with: role definition, boundaries, style, 1–2 example conversations.
Three layers: conversation state, task memory, knowledge memory (vector store/file search).
Gate high-risk actions (email, data changes, money) behind human approval.
Match to where users work: chat, Slack command, button in app, or web form.
For each real example: watch the trace, score correctness + efficiency + time saved.
For expanded details on each step, including selection criteria, classification examples, tool categories, memory layer patterns, and a pre-launch checklist:
→ Read references/guide.md
When using this framework to design an agent, produce a design document covering:
# Agent Design: [Name]
## Task & Success Criteria
[Step 1 output]
## Step Map
[Step 2 output — numbered steps with classifications]
## I/O Specification
[Step 3 output — inputs, outputs, tools]
## System Prompt
[Step 4 output — the actual prompt]
## Memory Architecture
[Step 5 output — which layers, what's stored]
## Safeguards
[Step 6 output — gated actions, rules, logging]
## Interface
[Step 7 output — chosen interface and why]
## Test Plan
[Step 8 output — example inputs, expected outputs, scoring criteria]
共 1 个版本