← 返回
数据分析 中文

VTL Image Analysis

Measure compositional structure in AI-generated images using the Visual Thinking Lens (VTL) framework. Detects default-mode bias (center lock, radial collaps...
利用视觉思维透镜 (VTL) 框架衡量AI生成图像的构图结构,检测默认模式偏差(如中心锁定、径向坍缩等)。
rusparrish
数据分析 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 816
下载
💾 17
安装
1
版本
#latest

概述

VTL Image Analysis

Use this skill whenever a user asks to analyze, diagnose, or improve a

generated image's composition. Also invoke it proactively after image

generation if the user has requested better compositional quality.

When to Use

  • User says "analyze this image", "why does this look generic/flat/boring"
  • User asks to improve a generated image's composition
  • After generating an image with openai-image-gen or similar skills
  • User asks why their prompts aren't producing interesting layouts

Step 1 — Measure

Run the probe script on the image:

python3 scripts/vtl_probe.py <image_path>

This returns JSON. Example:

{
  "valid": true,
  "mask_status": "PASS",
  "delta_x": -0.027,
  "delta_y": 0.008,
  "r_v": 0.875,
  "rho_r": 12.4,
  "dRC": 0.40,
  "dRC_label": "mass-dominant",
  "k_var": 1.12,
  "infl_density": 0.16,
  "flags": ["CENTER_LOCK"]
}

HARD STOP — Refusal Gate

Before reporting any results, check valid and mask_status.

If valid is false OR mask_status is "FAIL":

> "VTL measurement failed: [error message]. The image does not have sufficient

> structural signal for reliable compositional analysis. Try a different image

> or one with more defined edges and contrast."

Stop here. Do not report coordinates. Do not generate re-prompts.

If mask_status is "WARN":

> "VTL measurement returned low-confidence results (sparse structural signal).

> Coordinates are reported but treat them as indicative, not definitive."

> Then continue with the caveat attached to all outputs.

This refusal is non-negotiable. Fabricating a compositional reading from a

failed measurement produces false diagnosis. The framework is deterministic

by design — an uncertain measurement is reported as uncertain, not smoothed over.


Step 2 — Report Coordinates

Report the five coordinates plainly:

VTL ANALYSIS
────────────────────────────────
Placement   Δx={delta_x}  Δy={delta_y}
Void        rᵥ={r_v}
Packing     ρᵣ={rho_r}
Radial      dRC={dRC}  [{dRC_label}]
Tension     k_var={k_var}

FLAGS: {flags or NONE}

Step 3 — Generate Re-Prompt (if flags present)

Run the regen script with the user's original prompt and the metrics output:

python3 scripts/vtl_regen.py \
  --prompt "USER'S ORIGINAL PROMPT" \
  --metrics <path_to_metrics.json> \
  --out prompts.json

This selects operators from operators.yaml based on which flags fired and

returns up to 3 prompt variants. Report the selected variant as the primary

recommendation and offer the alternatives.

If no flags fired, report: "No default-mode patterns detected. Coordinates are

within normal range."


Operator Logic

Operators live in operators.yaml. They are rule-based — triggers are evaluated

deterministically against the metric values. The AI does not invent or modify

operators. If a trigger fires, the patch is applied. If not, it isn't.

Do not override operator logic. Do not substitute your own re-prompt language

for what the operator specifies. The operators are the prescription layer —

they are the operator's responsibility, not the AI's improvisation.

If the user wants to modify re-prompt behavior, direct them to edit operators.yaml.


Notes

  • Metrics describe compositional coordinates, not quality. CENTER_LOCK is not

"bad" — it's a signal that the model defaulted. A portrait photographer

choosing center composition is authorship. An AI doing it on every prompt

regardless of content is prior behavior. VTL measures the difference.

  • dRC requires radial eligibility. If mass centroid is very close to frame

center, dRC is labeled "dual-center" — report the label, not a number

interpretation.

  • Full metric definitions: references/vtl-metrics.md
  • Full framework: https://github.com/rusparrish/Visual-Thinking-Lens
  • Author: Russell Parrish — https://artistinfluencer.com

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-29 23:15 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

Stock Analysis

udiedrichsen
{"answer":"基于雅虎财经数据,分析股票与加密货币。支持投资组合管理、自选股预警、股息分析、8维评分、热门趋势扫描及传闻/早期信号探测。适用于股票分析、持仓追踪、财报异动、加密监控、热门股追踪或提前发掘非主流传闻。"}
★ 269 📥 56,884
data-analysis

Excel / XLSX

ivangdavila
创建、检查和编辑 Microsoft Excel 工作簿及 XLSX 文件,支持可靠的公式、日期、类型、格式、重算及模板保留功能。
★ 366 📥 139,952
data-analysis

A股量化 AkShare

mbpz
A股量化数据分析工具,基于AkShare库获取A股行情、财务数据、板块信息等。用于回答关于A股股票查询、行情数据、财务分析、选股等问题。
★ 162 📥 59,666