← 返回
未分类 中文

Output Wrong Task

The model produces correct-looking output that addresses a different task than the one requested — typically a related but distinct interpretation of an ambi...
模型输出的答案看起来正确,却解决的是与请求不同的任务,通常是对请求的另一种解释
mvogt99 mvogt99 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 290
下载
💾 0
安装
1
版本
#latest

概述

output-wrong-task

The output is well-formed and internally coherent but answers the wrong question. The model resolved an ambiguous prompt toward the most common interpretation rather than the one the user intended, or it latched onto a salient keyword and addressed that instead of the full request. The result can look convincing enough to pass a quick read.

Symptoms

  • The deliverable matches the topic of the request but misses its purpose — e.g., "explain this function" gets documentation instead of the debugging analysis asked for.
  • A code task produces something runnable but solving a simpler or adjacent problem than specified.
  • The model answers the first clause of a multi-part question and silently drops the rest.
  • The output would be correct for a different, more common prompt that shares keywords with this one.
  • Asking the model to verify what it just did reveals that it believed it was solving a different problem.

What to do

  • Restate the concrete deliverable, not just the topic. Instead of "help me with authentication," say "write a middleware function that checks for a valid JWT in the Authorization header and returns 401 if missing or invalid — nothing else."
  • Break compound tasks apart. If the prompt has multiple independent requirements, submit them one at a time and verify each before continuing.
  • Anchor the output format explicitly. Specifying the expected structure (function signature, JSON schema, number of steps, file to modify) gives the model less room to substitute a related but wrong output.
  • Before accepting the output, map it back to the original requirement: does this output satisfy the stated goal, not just a plausible-sounding version of it?
  • If the wrong-task output keeps recurring on the same prompt, the prompt likely has a latent ambiguity. Identify which interpretation the model chose and add a clause that explicitly rules it out.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-08 01:25 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

dev-programming

Mcporter

steipete
使用 mcporter CLI 直接列出、配置、认证及调用 MCP 服务器/工具(支持 HTTP 或 stdio),涵盖临时服务器、配置编辑及 CLI/类型生成功能。
★ 197 📥 68,063
dev-programming

CodeConductor.ai

larsonreever
AI驱动平台,提供快速全栈开发、智能体、工作流自动化及低代码AI集成的可扩展产品创建。
★ 78 📥 182,673
data-analysis

Analysis Missing Tradeoffs

mvogt99
分析仅呈现单一选项,未比较替代方案,也未说明所选方案的成本。
★ 0 📥 673