Install: clawhub install autoresearch-pilot
Your co-pilot for Karpathy's autoresearch — autonomous AI-driven LLM training experiments on a single GPU.
Detect from user's message language. Default: English.
Autoresearch lets an AI agent modify train.py, run 5-minute experiments, check if val_bpb improved, and iterate. This skill helps you set it up, write optimal program.md, and interpret results.
| File | Role | Modified by |
|---|---|---|
| ------ | ------ | ------------- |
prepare.py | Data prep, tokenizer, utilities | Never (fixed) |
train.py | Model, optimizer, training loop | The AI agent |
program.md | Instructions for the AI agent | You (the human) |
Walk the user through these steps when they want to start:
uv package managergit clone https://github.com/karpathy/autoresearchuv sync inside the repouv run prepare.py (one-time, ~2 min)uv run train.py (should complete in ~5 min)When the user has a smaller GPU, suggest these prepare.py changes:
vocab_size to 4096 or 2048 (or 256 for byte-level)MAX_SEQ_LEN to 256DEPTH to 4 in train.pyWINDOW_PATTERN of "L" onlyTOTAL_BATCH_SIZE to 2**14When the user asks for help with program.md, help them define:
Example structure for program.md:
When the user shares experiment logs:
| Metric | Good | Bad |
|---|---|---|
| -------- | ------ | ----- |
| val_bpb decreasing | Model is learning | Check for bugs |
| val_bpb plateaued | May need architecture change | Normal for small models |
| Training loss << val loss | Overfitting | Increase regularization |
| NaN loss | Learning rate too high or instability | Lower LR, check gradients |
| User says | Action |
|---|---|
| ----------- | -------- |
| "set up autoresearch" | Walk through setup steps |
| "help me write program.md" | Draft research instructions |
| "my val_bpb is X" | Evaluate and suggest next steps |
| "optimize for small GPU" | Suggest parameter changes |
| "what should I try next" | Analyze recent experiments, propose new direction |
Install the full suite:
clawhub install autoresearch-pilot setup-doctor context-brief model-pilot
共 1 个版本