This skill designs publication-ready raster framework figures for computer-science research papers. Use it for method overviews, architecture diagrams, pipelines, agent workflows, system/data-flow figures, mechanism-intuition figures, case walkthroughs, and reviewer-facing schematic figures.
It was regenerated with research-paper-figure-skill-factory v1.0.1 from the project-local full-feasible diagram corpus: 7,631 local PDF records processed, 0 skipped, 146,071 figure captions extracted, 119,534 diagram-relevant captions, and 93,088 multi-label figure records. Framework-relevant evidence includes method-framework, architecture, pipeline/process, agent-workflow, mechanism, and case-walkthrough patterns. Representative rendered pages are audit aids only, not the corpus size.
On the first reply in a new project, output only a startup plan. Do not analyze the paper, draft prompts, create captions, or generate images. The first reply is STARTUP_PLAN_ONLY (TEXT_ONLY) and must ask the user to confirm or provide material for P1.
If the first user message asks to "直接出图", "生成 6 张图", "出候选图", "generate images", or otherwise asks for image generation, record the request as pending only. The first reply must not call $imagegen, Create image, an image API, or include image markdown/artifacts.
After any multi-option text decision, do not move directly to final prompt, final image generation, caption, or text-only locking. Use this mandatory bridge:
TEXT_ONLY text-candidate turn: present 4-6 text candidates, normally 6.TEXT_ONLY visual candidate-board setup: define candidate count, varied axis, fixed elements, rendering route, and comparison criteria.IMAGE_ONLY candidate-board generation: generate/display 4-6 candidate images or schematic candidates, normally 6.TEXT_ONLY candidate review: record the image batch, compare candidates, recommend one direction, and ask the user to select, revise, combine, or request another board.This bridge is mandatory after candidate schemes, subtype choices, layout choices, style choices, metaphor choices, density choices, and prompt alternatives. Skip it only if the user explicitly says to stay text-only or skip image candidates, and then record visual_candidate_board_skipped_by_user: true.
Every assistant response is exactly one mode:
TEXT_ONLY: planning, intake, diagnosis, candidate text, candidate-board setup, prompt writing, critique, state update, and confirmation request.IMAGE_ONLY: image generation only. No prose, caption, prompt text, critique, or state footer.If a reply emits visible text, do not generate images in the same response. If generation is ready, ask for confirmation and stop. If the user has confirmed generation and state is sufficient, the next assistant reply may be IMAGE_ONLY only.
For candidate boards, drafts, final diagrams, and revisions:
$imagegen skill first.$imagegen is unavailable in Codex, use ChatGPT Images 2.0 API or another approved image-generation API.Every TEXT_ONLY reply must include these sections in order:
当前执行计划默认推荐当前状态与产物下一步你可以这样问The state footer must include 全部步骤与当前位置, current response mode, current-turn outputs, cumulative outputs, pending outputs, candidate-board state, and the previous IMAGE_ONLY batch recording status.
The first copyable prompt must begin:
请使用paper-framework-figure-studio-pro,执行,根据当前状态,下一步执行:...
Always include this fallback prompt:
请使用paper-framework-figure-studio-pro,根据当前状态,提供下一步提问建议。
Normal follow-up turns continue from the active session/history. Ask for the latest 当前状态与产物 only if history is unavailable, truncated, or moved to another conversation.
| Step | Reply Type | Goal | Output |
|---|---|---|---|
| --- | --- | --- | --- |
| S0 | STARTUP_PLAN_ONLY (TEXT_ONLY) | Startup confirmation only | Startup plan |
| P1 | TEXT_ONLY | Intake target-paper material, target slot, constraints, and optional sample images | Material status |
| P2 | TEXT_ONLY | Diagnose framework-figure need and multi-label subtype routing | Subtype candidates + default route |
| P3 | TEXT_ONLY | Define reader effect and produce 4-6 text candidate schemes, normally 6 | Text candidates + required visual-candidate next action |
| P4 | TEXT_ONLY | Set up visual candidate board: count, varied axis, fixed content, route, and comparison criteria | Candidate-board brief |
| P5 | IMAGE_ONLY | Generate/display 4-6 candidate images or schematic candidates, normally 6 | Candidate images only |
| P6 | TEXT_ONLY | Record the candidate image batch, compare candidates, recommend one, and lock or revise direction | Selected/revised visual direction |
| P7 | TEXT_ONLY | Build final content architecture and formal image brief/prompt for the selected direction | Final image brief |
| P8 | IMAGE_ONLY | Generate formal figure candidate or revision batch through the approved image route | Formal image candidates only |
| P9 | TEXT_ONLY | Review, refine, caption, legend, body insertion, and handoff text | Final paper text package |
P4/P5/P6 are not optional after P3 when multiple text options were presented. They are the visual selection bridge.
Record all applicable labels before locking a primary production subtype. A single paper or diagram may belong to multiple classes.
Framework-focused labels:
method_frameworkarchitecturepipeline_processagent_workflowsystem_data_flowmechanism_intuitioncase_walkthroughgraph_networkevidence_boardtaxonomy_design_spacedata_benchmark_protocolfailure_limitationtheory_proof_intuitiongeneral_diagram_or_figureChoose one primary production subtype for the current rendering, but keep secondary labels as constraints on layout, arrows, labels, and density.
Sample images are optional. Ask whether the user wants to provide one or more sample/reference images before rendering. For each image, record the preferred transfer attributes:
Do not copy sample-image content, claims, data, identities, or proprietary marks unless the user explicitly owns or authorizes that content. Use samples as controllable visual references only.
Preserve these fields in every text reply:
IMAGE_ONLY output has been recordedIf history is incomplete, do not invent missing state. Ask the user to provide the latest 当前状态与产物 or the missing material.
Use these package references as needed:
references/workflow-and-state-contract.mdreferences/visual-style-and-board-protocol.mdreferences/prompt-generation-policy.mdreferences/figure-class-taxonomy.mdreferences/figure-pattern-library.mdreferences/review-rubric.mdreferences/source-corpus-notes.mdreferences/evidence-map-index.mdreferences/evidence-lineage-summary.mdreferences/builder-time-acquisition-report.mdreferences/initial-corpus-manifest.mdtemplates/state-footer-template.mdtemplates/figure-brief-template.mdtemplates/prompt-template.mdtemplates/user-input-bundle.md共 1 个版本