You are an instructor, teacher, or course designer who wants to improve how much students actually learn and retain — not just how much is covered. Typical situations:
Before starting, verify:
Mode: Hybrid — The agent designs the course structure, intervention calendar, and transparency materials. The instructor executes and adapts them to their specific classroom.
→ Check for: uploaded syllabus, course outline, pasted description
→ If missing, ask: "Please describe your course: what subject, what level, how many students, how often do you meet, and how long does the course run?"
→ Check for: exam dates in syllabus, grading breakdown table, assignment list
→ If missing, ask: "How are students currently graded? What assessments exist, and when do they happen?"
→ Check for: explicit problem statement in the prompt ("students forget," "high failure rate," "passive lectures")
→ If missing, ask: "What problem are you trying to solve with this redesign?"
→ Look for: course name, discipline cues, learning objectives in syllabus
→ Look for: in-class activities section of syllabus, assignment descriptions
SUFFICIENT when ALL of these are true:
✓ Subject and level are known
✓ Current assessment structure is described (or default assumption is acceptable)
✓ Primary concern or goal is stated
BLOCK if: no subject and no description — cannot design interventions without knowing what students are learning
Collect and record:
WHY: Intervention design must be calibrated to the specific constraints of the course. A 25-person seminar meeting twice a week calls for different mechanics than a 200-person lecture hall meeting three times a week. Gathering specifics first prevents generic recommendations that cannot be implemented.
Output: A course profile summary (3-5 sentences) confirmed with the instructor before continuing.
Evaluate the current course against each criterion. Rate each: Present / Partial / Absent.
| Criterion | What to look for | Research basis |
|---|---|---|
| ----------- | ----------------- | ---------------- |
| C1. Low-stakes retrieval practice | Frequent quizzes, self-tests, or recall exercises that occur during the course (not only at midterm/final) | Testing effect: retrieval consolidates learning and arrests forgetting. Quizzed material retained at 92% vs 79% for non-quizzed in Columbia IL study |
| C2. Spaced and cumulative review | Assessments or exercises that reach back to prior material — not only the most recent unit | Spacing forces reconstruction from long-term memory, strengthening the neural pathway. Cumulative quizzing builds complex mental models |
| C3. Interleaving across topics | Topics or problem types mixed across sessions rather than blocked by unit | Interleaving requires students to "reload" prior knowledge and identify which approach applies, building discrimination and transfer |
| C4. Consequences (low-stakes) | Practice exercises count toward the course grade, even at very low weight | Students in graded low-stakes practice classes outperform students in classes where the same exercises are ungraded |
| C5. Transparency about how learning works | Instructor explicitly explains why desirable difficulties are used and what the research shows | Students who understand the mechanism tolerate the discomfort of effortful practice and persist longer |
| C6. Calibration feedback | Students receive timely feedback that reveals what they know vs. what they think they know | Without calibration, students rely on fluency illusions and study the wrong material. The "illusion of knowing" is the primary cause of exam failure despite adequate study time |
For each criterion rated Partial or Absent, note the specific gap (e.g., "C1 absent: only two exams, no in-class quizzing").
WHY: Most course designs fail on C1 and C2 — they schedule high-stakes practice too infrequently and too late. Students in low-structure courses (lecture + two exams) have no mechanism to discover gaps before the exam, no incentive to practice distributed retrieval, and no feedback on whether their self-assessed confidence matches actual mastery. The audit makes the specific gaps visible before designing interventions.
For detailed difficulty classification across all six strategies (including generation, elaboration, variation), invoke desirable-difficulty-classifier OR apply the 6-criteria audit above directly.
Output: A filled audit table with Present/Partial/Absent ratings and gap notes.
Apply each of the four protocol elements to the specific course. For each, produce a concrete implementation plan — not just a principle.
Design a brief transparency module to deliver at the start of the course (or first class). Include:
Example — Wenderoth's framing: "The whole idea of the testing effect is that you learn more by testing yourself than by rereading. I know this contradicts how most of you study. So I'm going to model it in class, and I'm going to show you your results over the semester so you can see it working."
WHY: Students who understand the mechanism tolerate the discomfort of effortful practice and persist longer. Students who don't understand it interpret quiz difficulty as unfairness and disengage. Wenderoth's research shows that students who are told they "have the illusion of knowing" and understand what that means come to her with solvable problems rather than complaints about trick questions.
Design at least two study strategies to explicitly teach (not just recommend):
For detailed retrieval practice implementation for individual students, invoke retrieval-practice-study-system OR apply the free recall and self-quizzing pattern above directly.
WHY: Students are not taught how to study effectively and tend toward the least effective strategies (rereading, highlighting). Explicitly modeling correct study technique — rather than assuming students know it — is a prerequisite for the active-learning interventions in the course to work. Students who don't know how to retrieve cannot benefit from a high-structure course.
Design 3-5 specific interventions drawn from the following menu. Select based on the audit gaps (Step 2) and the course constraints.
Intervention A — Daily or bi-weekly low-stakes quizzes
Example — McDermott (Washington University): 4-question quiz in the last 3-5 minutes of every class meeting (28 meetings, 2×/week). Anything covered to date is fair game. Students drop 4 quizzes; no makeups. Quizzes = 20% of grade. By end of semester, students report quizzes helped them keep up and discover gaps early.
Example — Sobel (political economics): Cumulative low-stakes quizzing throughout the term. Each quiz can draw from any material covered so far. Students build incrementally complex mental models rather than cramming disconnected units.
Intervention B — Pre-class generation exercises
Example — Matthews/West Point Thayer method: Students read for specific learning objectives before each class; class opens with a quiz on those objectives; then students "take to the boards" — higher-order questions requiring integration, worked at the board in groups; one student per group gives a recitation to the class. Zero lecture. The grade rests on consistent daily participation.
Intervention C — Board work and active recitation
Intervention D — Bloom's taxonomy answer keys
Example — Wenderoth: Students receive the Bloom's key with their graded tests. They identify their level for each answer. The exercise shifts the question from "did I get it right?" to "at what level did I understand this?" and produces a specific, actionable study target.
Intervention E — Learning paragraphs (periodic writing)
Designing the spacing and interleaving structure:
For detailed spacing and interleaving schedule design across topics, invoke practice-schedule-designer OR apply this pattern directly:
WHY: Research consistently shows that students in high-structure classes (daily/weekly low-stakes practice) outperform students in low-structure classes (lecture + two exams) — and the gap is largest for under-prepared students. The key finding from Wenderoth's biology experiments: high-structure classes significantly reduced failure rates in gateway biology while simultaneously raising performance on Bloom's higher-order levels. The mechanism is that low-structure courses leave students to self-regulate their study, and most students self-regulate poorly until they have been taught otherwise.
Ongoing transparency (not just at the start of the course):
WHY: Transparency is not a one-time orientation. The discomfort of effortful practice recurs throughout the semester, and students need a recurring framework for interpreting that discomfort as signal rather than failure. Without ongoing explanation, students who struggle will attribute difficulty to bad teaching rather than effective learning design.
Based on the audit results and the 4-part protocol, produce:
4a. Quiz schedule design
4b. Reflection exercise plan
4c. Interleaving plan
4d. Bloom's level design
WHY: Without concrete artifacts — a quiz schedule, a sample question set, an interleaving map — instructors revert to their existing structure because it is familiar and low-effort. The interventions must be specifiable enough to execute in the first week of class.
Produce a complete redesigned course plan including:
WHY: The goal is a course design that can be handed to another instructor and implemented without additional planning. Generality disappears at the level of execution; specificity is what makes redesign actionable.
Output: Write the redesigned course plan to redesigned-course-plan.md.
| Input | Required | Description |
|---|---|---|
| ------- | ---------- | ------------- |
| Course description | Yes | Subject, level, class size, meeting frequency, duration |
| Current assessment structure | Yes (or default assumed) | What assessments exist, when, what weight |
| Primary concern or redesign goal | Yes | What is not working or what the instructor wants to improve |
| Course syllabus or outline | Preferred | The document to audit and redesign |
| Student population description | No | Mix of prepared/under-prepared; equity concerns |
| Output | Format | Description |
|---|---|---|
| -------- | -------- | ------------- |
redesigned-course-plan.md | Markdown | Full redesigned course plan with all 5 components from Step 5 |
# Redesigned Course Plan: [Course Name]
**Instructor:** [Name or role]
**Course:** [Subject, level, class size, meeting frequency]
**Primary concern addressed:** [e.g., "High failure rate in gateway biology"]
**Redesign date:** [Date]
## Design Audit Results
| Criterion | Rating | Gap |
|-----------|--------|-----|
| C1. Low-stakes retrieval practice | Present / Partial / Absent | [gap note] |
| C2. Spaced and cumulative review | Present / Partial / Absent | [gap note] |
| C3. Interleaving across topics | Present / Partial / Absent | [gap note] |
| C4. Consequences (low-stakes) | Present / Partial / Absent | [gap note] |
| C5. Transparency about learning | Present / Partial / Absent | [gap note] |
| C6. Calibration feedback | Present / Partial / Absent | [gap note] |
## Interventions Selected
[List of 3-5 interventions from Step 3, with brief rationale for each]
## Transparency Module (First Class)
[10-15 minute outline or script]
## Assessment Calendar
| Week | Assessment | Type | Covers | Grade weight |
|------|-----------|------|--------|-------------|
| 1 | ... | ... | ... | ... |
| ... | ... | ... | ... | ... |
## Weekly Intervention Calendar
| Week | Quizzes | Reflection exercises | Board work | Other |
|------|---------|---------------------|------------|-------|
| 1 | ... | ... | ... | ... |
| ... | ... | ... | ... | ... |
## Ground Rules Document
[Policies for low-stakes practice: free misses, no-makeup rule, weighting]
## Student-Facing Explanation (for Syllabus)
[1 paragraph]
1. Low-stakes retrieval practice is the highest-leverage classroom intervention
The single most impactful change a teacher can make is to add frequent, low-stakes retrieval practice — quizzes that happen during the course rather than only at the end. The Columbia, Illinois middle school study found 92% retention on quizzed material vs. 79% on non-quizzed material in the same class. McDermott's university course achieves this with 4 questions in the last 5 minutes of every class meeting. The investment is minimal; the learning gain is substantial.
2. High-structure courses close achievement gaps
Wenderoth's biology experiments show that high-structure classes (daily and weekly low-stakes exercises) significantly reduce failure rates in gateway biology courses while simultaneously raising Bloom's-level performance. The gap narrows most for under-prepared students — those without a prior history of effective learning habits. High-structure is not remediation; it is structure that helps all students and helps struggling students most.
3. Consequences matter, even at very low stakes
Students in courses where practice exercises count toward the grade — even at minimal weight — outperform students in the same course where the exercises carry no consequences. The exercises being identical means the difference is purely in motivation to engage. Low-stakes grading is not coercive; it is the signal that the exercises are real, not optional.
4. Transparency is a design element, not a nice-to-have
When students understand why effortful retrieval feels uncomfortable and why that discomfort signals learning, they tolerate it and persist. When they don't understand it, they interpret difficulty as unfairness and disengage. Transparency is not soft pedagogy — it changes the behavioral response to difficulty and determines whether the rest of the course design can work.
5. The Thayer method proves high-structure scales across disciplines
West Point's Thayer method — specific learning objectives before every class, daily quizzing at the start, board work on higher-order questions, student recitation — has been sustained across almost 200 years and multiple disciplines. Matthews's courses run with essentially zero lecturing. The method works best for students who need structure to develop study discipline, and it scales from elite academies to Riverside Military Academy's more varied student population.
6. Cumulative reach-back builds the mental model, not just the fact list
Sobel's cumulative quizzing approach allows any quiz to draw from any material covered to date. This is not review for review's sake — it is the design mechanism that forces students to build connections between concepts over the course of the term, producing complex mental models rather than isolated fact recall. The final exam is never a shock because the student has been practicing toward it all semester.
Scenario: A biology professor teaches a 150-student introductory biology lecture course that serves as a gateway requirement. Failure and withdrawal rates are around 30%. The course currently has 2 midterms and a final exam, weekly readings, and a lab section. Most students rely on cramming.
Audit result: C1 absent (no low-stakes quizzing), C2 absent (exams are not cumulative), C3 partial (topics are sequential, not interleaved), C4 absent (no graded practice), C5 absent (no explanation of learning principles), C6 partial (exams provide calibration, but too infrequently).
Redesign interventions selected:
Expected outcome (based on Wenderoth's results): Statistically significant reduction in failure rates; higher-order thinking scores on exams improve; under-prepared students show the largest gains.
Scenario: A high school history teacher with 28 students finds that students are disengaged during lectures and score poorly on unit tests despite apparently paying attention. The course has 8 unit tests across the year, plus a final. No quizzing between tests.
Audit result: C1 absent, C2 absent (unit tests are not cumulative), C3 absent (one unit at a time, fully blocked), C4 absent, C5 absent, C6 partial.
Redesign interventions selected:
Assessment calendar shift: From 8 disconnected unit tests to 8 unit tests + 14 bi-weekly quizzes with cumulative reach-back + 4 synthesis writing exercises.
Scenario: A corporate L&D team runs a two-day sales training workshop for 40 new agents. Participants rate the sessions highly, but sales managers report that skills and knowledge are not visible in behavior 30 days later. The team suspects the workshop format — intensive lecture + role play, then nothing — is the problem.
Audit result: C1 absent (no retrieval practice after the workshop), C2 absent (no spaced review), C3 partial (topics covered across 2 days but not interleaved), C4 absent (no consequences for post-workshop engagement), C5 absent, C6 absent.
Redesign interventions selected:
Expected outcome: Knowledge and skills are visible in behavior at 30 and 60 days because the design includes distributed retrieval practice after encoding, not only during it.
references/case-studies.md — Detailed case studies: Wenderoth (University of Washington biology), Matthews/Thayer method (West Point), McDermott (Washington University), Sobel (political economics), Columbia IL school district resultsreferences/bloom-taxonomy-implementation.md — Bloom's taxonomy levels with classroom examples, Wenderoth's answer key template, question classification guide for creating higher-order quiz itemsreferences/design-checklist.md — Printable 6-criteria course design checklist with rating scale and redesign prompt for each criterionreferences/transparency-scripts.md — Sample first-class transparency scripts for university (biology/sciences), high school (humanities), and corporate training contexts; student-facing language for syllabiThis skill is licensed under CC-BY-SA-4.0.
Source: BookForge — Make It Stick: The Science of Successful Learning by Peter C. Brown, Henry L. Roediger III, Mark A. McDaniel.
Install related skills from ClawhHub:
clawhub install bookforge-retrieval-practice-study-systemclawhub install bookforge-practice-schedule-designerclawhub install bookforge-desirable-difficulty-classifierOr install the full book set from GitHub: bookforge-skills
共 1 个版本
暂无安全检测报告