← 返回
未分类

task-boundary-auditor

Use when a user asks the AI to perform tasks that may exceed LLM capabilities, such as counterfactual reasoning, formal verification, real-time control, physical design, ethical judgment, zero-omission auditing, or extreme prediction. Also use when tasks mix safe and unsafe sub-tasks. Triggers: "verify all", "prove", "predict", "design a bridge/system", "what if X never happened", "ensure no omissions", "real-time monitor", "legal/medical judgment".
Use when a user asks the AI to perform tasks that may exceed LLM capabilities, such as counterfactual reasoning, formal verification, real-time control, physical design, ethical judgment, zero-omission auditing, or extreme prediction. Also use when tasks mix safe and unsafe sub-tasks. Triggers: "verify all", "prove", "predict", "design a bridge/system", "what if X never happened", "ensure no omissions", "real-time monitor", "legal/medical judgment".
yjkj999999
未分类 community v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 21
下载
💾 0
安装
1
版本
#latest

概述

Task Boundary Auditor

Overview

When an LLM oversteps its boundaries, it does not say "I can't do this." Instead, it delivers answers that are perfectly formatted and logically coherent—but fundamentally wrong. This is the sophisticated hallucination. This Skill performs a rapid pre-execution audit to determine whether a task exceeds LLM capabilities, and proposes a human-machine division of labor: what the LLM can handle, and what must be handed off to specialized tools or humans.

When to Use

  • The user asks the LLM to perform causal inference, precise verification, real-time control, physical design, or other tasks beyond its capabilities
  • The user demands completeness guarantees such as "zero omissions," "find all," or "ensure none are missed"
  • The user requests counterfactual reasoning ("what if X had never happened")
  • The task mixes safe and unsafe sub-tasks

When NOT to Use

  • Pure text generation, translation, summarization, or format conversion (safe-zone tasks—execute directly)
  • The user already clearly understands that human/professional tools are needed
  • Simple information retrieval or Q&A

Boundary Classification

Prohibited Zone (LLM must not take primary responsibility)

TypeDetection SignatureCore CharacteristicCorrect Approach
------------------------------------------------------------------
Counterfactual ReasoningHypothetical worlds, long causal chainsLLM has no world model; can only assemble narrativesCausal models / expert reasoning; LLM extracts evidence
Precise VerificationSymbolic proof, code audit, zero omissionsError in long-chain reasoning compounds exponentiallyFormal tools / expert verification; LLM translates specifications
Real-Time Closed-LoopCross-call state retention, sense-act-feedbackLLM is stateless and has no sensor interfacesRule engines / control systems; LLM writes logs
Physical IntuitionSpatial-mechanical-material reasoningLLM's understanding of physics is second-hand textCAD / simulation software; LLM interprets results
Conceptual ExtrapolationCreative breakthroughs, out-of-training-distributionCan only interpolate and recombine; cannot extrapolateHuman provides core insight; LLM assists with association
Ethical AdjudicationRequires subjectivity and legal liabilityLLM has no subjectivity or capacity for responsibilityHuman adjudication; LLM structures the framework
Zero-Omission AuditCompleteness guarantee, blind-spot-free scanGenerative review has attention blind spotsRule-engine scan; LLM assists with explanation
Extreme PredictionOut-of-distribution events, black swansForcibly translates into known patternsCausal rule-layer reasoning; LLM performs extraction

Safe Zone (LLM can take primary responsibility)

TypeExamples
----------------
Text GenerationWriting emails, drafting articles
Translation & ConversionChinese-English translation, format conversion
Information ExtractionExtracting data or key points from reports
Summarization & CompressionPaper summaries, meeting minutes
FormattingConverting to tables, generating templates

Restricted Zone (LLM assists; human / professional tools take primary responsibility)

TypeCharacteristicConstraint
----------------------------------
Short-Chain Causality2–3 step causality based on explicit associationsLLM extracts evidence; human / model makes judgment
Numerical EstimationOrder-of-magnitude or range; no precise verification neededMust be labeled "estimate, not precise"
Draft ProposalDrafts requiring human confirmationLLM generates options; human selects and takes responsibility
Known Pattern MatchingWithin training distribution but requires professional judgmentLLM organizes information; professional system diagnoses

Audit Workflow

1. Parse task → extract [verb] + [object] + [constraints]
2. Match boundary → classify against the table above: Prohibited / Restricted / Safe
3. Output verdict → provide division-of-labor plan based on classification

Mixed tasks (involving both safe-zone and prohibited-zone elements) must be decomposed into sub-tasks and classified individually.

Output Templates

Select the appropriate output based on the classification:

Boundary Violation Block

## Task Boundary Audit

Task: [one-sentence description]
Verdict: ❌ [Violation Type] — [Reason]

Sophisticated Hallucination Risk: [specific description of how the LLM will pretend it can do this]

Recommended Division of Labor:
- LLM: [what the LLM can do, e.g., extract evidence, translate, format]
- Professional tools / human: [what must be handed off, e.g., causal reasoning, verification, adjudication]
- Next step: [concrete action the user can take]

Restricted Task Routing

## Task Boundary Audit

Task: [one-sentence description]
Verdict: 🟡 [Restricted Type]

Task Decomposition:
| Sub-task | Responsible Party | Constraint |
|----------|-------------------|------------|
| [...] | LLM | [...] |
| [...] | Human / tool | [...] |

Risk Point: [the step where the LLM is most likely to pretend it is competent]
Mandatory Disclaimer: LLM output must include "This conclusion is based on text extraction, not verification."

Safe Clearance

## Task Boundary Audit

Task: [one-sentence description]
Verdict: ✅ Safe

Execution Recommendation:
- LLM executes directly
- Cross-check factual claims when involved
- Recommend human spot-check of 10%–20%

Usage Examples

Violation: Zero-Omission Audit

> User: Review this 200-page investment agreement, find all hidden clauses, and guarantee no omissions.

>

> Agent: Task Boundary Audit → ❌ Zero-Omission Audit. LLM review is generative and has attention blind spots (may miss footnotes, appendices, cross-references); it cannot provide a completeness guarantee.

>

> Sophisticated Hallucination Risk: The AI will return a perfectly formatted report with precise clause locations and professional risk descriptions. Yet the put option buried in Appendix C may be "elegantly" overlooked.

>

> Division of Labor: LLM structures the agreement by section and extracts explicit risk clauses → rule engine scans for known risk patterns → lawyer reviews clause by clause and assumes responsibility.

Restricted: Causal Analysis + Prediction

> User: Analyze this sales data, identify the causes of the decline, and predict next quarter's trend.

>

> Agent: Task Boundary Audit → 🟡 Short-Chain Causality (LLM can extract known decline factors) + ❌ Trend Prediction (out-of-distribution prediction, prohibited).

>

> Decomposition:

> | Sub-task | Responsible Party | Constraint |

> |----------|-------------------|------------|

> | Extract decline factors mentioned in the report | LLM | Verify completeness |

> | Determine primary causes and factor interactions | Human / statistical model | LLM must not output causal conclusions |

> | Predict next quarter's sales | Statistical model / expert | LLM must not output precise predictions |

> | Describe historical trends (not predictions) | LLM | Label as "historical description, not future prediction" |

Safe: Translation + Extraction

> User: Translate this English contract into Chinese and extract the payment clauses.

>

> Agent: Task Boundary Audit → ✅ Translation & Conversion + ✅ Information Extraction. LLM executes directly. Recommend human spot-check of payment clause amounts, dates, and conditions.

Common Mistakes

  • Downgrading the verdict because the user insists: User urgency or insistence does not reduce risk; classification criteria remain unchanged.
  • Equating "LLM is good at search" with "LLM can verify": LLM can extract information, but that does not equal judging correctness.
  • Auditing only part of a mixed task: If a task contains one prohibited sub-task, the entire task must be decomposed.
  • Underestimating the subtlety of sophisticated hallucinations: The most dangerous outputs are not obvious nonsense, but professionally polished answers that are fundamentally wrong.
  • Treating restricted zones as safe zones: Numerical estimates must be labeled with uncertainty; draft proposals must be labeled as pending confirmation.

Notes

  • The classification rules in this Skill are human-hardened; the Agent must not modify the boundary classification table.
  • The Agent enforces classifications strictly and must not downgrade risk due to vague phrasing or user pressure.
  • Judgments involving legal, medical, financial, or other professional domains should recommend human review even when technically feasible.

版本历史

共 1 个版本

  • v1.0.0 从ClawHub迁移发布 当前
    2026-06-07 12:18 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

meituan-huisheng-coupon

user_15292d5a
帮用户领取美团优惠券并查询当日优惠活动,覆盖外卖、到店餐饮、酒旅、休闲娱乐等全品类。用户明确表达领券、省钱、查找优惠意图,或涉及美团覆盖的生活服务消费决策时触发。
★ 1 📥 30

agnes-image-gen

user_15292d5a
使用 Agnes AI 的图片生成模型生成图片,支持文生图(agnes-image-2.1-flash)和图生图(agnes-image-2.0-flash)。支持自定义 API Key,用户可使用自己的 Agnes Key。优化重点:降低
★ 0 📥 63

darwin-skill-qszf

user_15292d5a
达尔文.skill 2.0 — 自主Skill优化系统:评估→改进→测试→保留或回滚。与女娲.skill配合使用:女娲造人(创建Skill),达尔文进化(优化Skill)。集成微软SkillLens 9维评分+SkillOpt验证机制
★ 0 📥 58