数字人形象养成工作室

A digital identity cultivation skill for AI Agent characters. Separates identity definition (Character Sheet) from scene generation (Scene Card), ensuring visual consistency across unlimited scenes while allowing full creative freedom. Supports: identity initialization, standard proof generation, identity locking, tuning, and continued scene output for the same character. Triggers: 数字人, 形象, 角色, identity, avatar, character, portrait, 写真, photoshoot, 头像, 形象照, cosplay, or any request to generate

# 🎨 Agent Identity Studio — 数字人形象养成工作室 > **[English version →](https://github.com/amayazhao/agent-identity-studio/blob/main/README_EN.md)** AI 驱动的数字人形象生成技能，定义一次角色身份，跨场景无限生成，形象永远一致。 ## 这是什么一个运行在 AI Agent 平台上的 Skill。兼容任何支持 Skill 加载的 Agent 框架（如 [OpenClaw](https://github.com/anthropics/claw)、[WorkBuddy](https://www.codebuddy.cn/docs/workbuddy/Overview) 等）。你告诉它"我的角色长什么样"，它会自动完成： - **形象初始化**（描述角色 → 生成3张验证照 → 确认锁定身份） - **场景写真生成**（说一句话就出图：「帮我拍一张咖啡馆的场景照」） - **角色一致性保障**（不管换什么场景、穿搭、光影，TA 永远是 TA） - **审核自动处理**（内置安全词库 + 被拒后自动降级重试，不用你操心） - **Cosplay模式**（换装不换人，角色特征永远保留） **在线Demo**: https://amayazhao.github.io/nami-gallery/skill.html **github**: https://github.com/amayazhao/agent-identity-studio ## 设计亮点 ### 🧬 两层分离架构：身份和场景彻底解耦这是整个 Skill 最核心的设计： | 层级 | 管什么 | 谁控制 | |------|--------|--------| | Character Sheet（身份层）| 五官、发色、瞳色、发饰、身材、气质 | 固定不变，所有场景共享 | | Scene Card（场景层）| 地点、穿搭、光影、表情、构图 | 每次生成都不同 | 为什么这样设计？传统方式每次生成都要在 Prompt 里重复角色描述，容易写漏、写错、每次微妙不一致。这个 Skill 把角色定义固化在 YAML 文件里，场景层绝对不允许触碰身份特征——发色、眼色、五官等字段只在身份层渲染，场景层碰都碰不到。 ### 📐 身材分级系统：不同构图自动适配 | 构图 | 身体描述 | 原因 | |------|----------|------| | 特写/头像 | 不加身材描述 | 画面只有脸，加了反而干扰 | | 半身照 | 基础体型 + 衣服勾勒 | 够用，不过度 | | 全身照 | 完整体型 + 腿部 + 姿态 | 全身需要完整比例信息 | | 宽松衣服 | 自动追加补偿词 | 防止宽松衣服"吃掉"身材描述 | 不需要手动调，gen.py 根据 Scene Card 的 `composition` 字段自动选择对应级别。 ### 🛡️ 审核安全：不说身体说衣服 MiniMax 的内容审核对身体描述敏感。Skill 内置了一套安全话术体系： | 你想要的效果 | ✅ 安全写法 | ❌ 会被拦截 | |-------------|-----------|-----------| | 展现身材曲线 | `form-fitting dress that hugs her curves` | `curvy body` | | 好看的腿 | `showing shapely legs` | `long sexy legs` | | 领口设计 | `V-neck dress with elegant draping` | `showing bust` | 原则：**描述衣服怎么穿，不描述身体长什么样。** 被拦截后自动降级 prompt 重试，最多 2 次。 ### 🎯 参考图双策略：按需锁脸 / 锁身材 | 构图 | 用哪张参考图 | 锁什么 | |------|-------------|--------| | 半身 / 特写 | 正面参考图 | 锁定面部特征 | | 全身 | 3/4侧面参考图 | 锁定身体比例 | 参考图可选不强制——对于原生画风角色，纯 Prompt 的一致性反而更好。参考图更适合"非原生"角色（如用户上传自己画的角色）的锚定。支持两种传参方式： - **本地 data:URI**（`data:image/png;base64,...`）— 不依赖外部 URL - **公开 URL** — 适合已部署到 CDN 的参考图 ⚠️ **注意**：MiniMax 的 `image_base64` 字段完全不可用，必须通过 `image_file` 字段传递（URL 或 data:URI 均可）。 ## 效果展示同一个角色，22+ 种不同场景，角色特征全自动保持： | 场景 | 穿搭 | 构图 | |------|------|------| | ☕ 咖啡馆约会 | 白色高领毛衣 | 半身 | | 🌧️ 雨夜街头 | 米色风衣+透明伞 | 全身 | | 🌸 樱花公园 | 蓝裙+白开衫+草帽 | 全身 | | 📚 图书馆 | 圆框眼镜+奶白毛衣 | 半身 | | 🎪 夜市 | 牛仔外套+条纹T | 半身 | | 💼 办公室 | 深灰西装+白衬衫 | 半身 | | 🏖️ 海边夕阳 | 白色亚麻裙 | 全身 | | 🎄 圣诞壁炉 | 红色oversized毛衣 | 半身 | | 🎭 Cosplay | 校服/舞台装（保持角色发色瞳色） | 全身 | | 🏋️ 瑜伽 | 运动背心+leggings | 半身 | ## 配置与安装 ### 第一步：安装技能将 `agent-identity-studio/` 文件夹放到你的 Agent 平台的 Skills 目录下即可。不同平台的路径示例： ``` # WorkBuddy / CodeBuddy ~/.workbuddy/skills/agent-identity-studio/ # OpenClaw ~/.claw/skills/agent-identity-studio/ # 其他平台：参考对应文档的 Skill 安装目录 ``` 安装成功后，Agent 会自动识别 **agent-identity-studio** 技能。 ### 第二步：获取 API Key（二选一即可） **方式 A：MiniMax（默认引擎）** 1. 前往 [MiniMax开放平台](https://www.minimaxi.com/) 注册 2. 创建应用，获取 API Key 3. 设为环境变量：`MINIMAX_API_KEY` **方式 B：腾讯混元生图 3.0**（via 腾讯云 SDK） 1. 前往 [腾讯云控制台](https://console.cloud.tencent.com/) 开通混元生图服务 2. 在 [API密钥管理](https://console.cloud.tencent.com/cam/capi) 获取 SecretId + SecretKey 3. 设为环境变量：`TENCENT_SECRET_ID` + `TENCENT_SECRET_KEY` 4. 安装 SDK：`pip install tencentcloud-sdk-python-aiart` 5. 切换引擎：`IDENTITY_STUDIO_ENGINE=hunyuan` ```bash # Windows PowerShell — 以 MiniMax 为例 [Environment]::SetEnvironmentVariable('MINIMAX_API_KEY', '你的key', 'User') # 如果用混元 [Environment]::SetEnvironmentVariable('TENCENT_SECRET_ID', '你的SecretId', 'User') [Environment]::SetEnvironmentVariable('TENCENT_SECRET_KEY', '你的SecretKey', 'User') [Environment]::SetEnvironmentVariable('IDENTITY_STUDIO_ENGINE', 'hunyuan', 'User') pip install tencentcloud-sdk-python-aiart ``` | 引擎 | 模型 | 单张成本 | 速度 | 参考图 | 外部依赖 | |------|------|---------|------|--------|---------| | MiniMax | image-01 | ~¥0.025 | 15-30s | 1张（data:URI / URL）| 无 | | 腾讯混元 | HY-Image-V3.0 | ~¥0.20 | 4-7s | 最多3张 | `tencentcloud-sdk-python-aiart` | ### 第三步：开始使用在 Agent 中发： > 帮我创建一个数字人形象：银色短发、琥珀色眼睛、活泼开朗的少女 Skill 会自动完成形象初始化 → 生成验证照 → 等你确认。确认后，发任意场景描述即可生成： > 帮我拍一张雨中撑伞的场景照，要有电影感 ## 使用方式 | 你说的话 | Skill 做什么 | |---------|------------| | 「创建一个角色：红色长发、绿眼睛的魔法少女」 | 初始化 Character Sheet → 生成3张验证照 | | 「帮我拍一张咖啡馆的照片」 | 设计 Scene Card → 拼装 Prompt → 调用 API → 返回图片 | | 「来一组周末居家写真集，4张」 | 批量生成多场景 Session | | 「头发颜色再深一点」 | 修改 Character Sheet → 重新验证 | | 「Cosplay成樱岛麻衣」 | 换装不换人模式 | ## 技术数据 | 指标 | MiniMax | 腾讯混元 | |------|---------|---------| | 模型 | image-01 | HY-Image-V3.0 | | 单张成本 | ~¥0.025 | ~¥0.20 | | 生成速度 | 15-30s | 4-7s | | 参考图上限 | 1张 | 3张 | | 调用模式 | 同步 | 异步（SDK，提交+轮询）| | 外部依赖 | 无 | `tencentcloud-sdk-python-aiart` | | 认证方式 | `MINIMAX_API_KEY` | `TENCENT_SECRET_ID` + `TENCENT_SECRET_KEY` | | 引擎切换 | 默认 | `IDENTITY_STUDIO_ENGINE=hunyuan` | ## 常见问题 | 问题 | 排查 | |------|------| | `MINIMAX_API_KEY not set` | 检查环境变量，新终端需重启才能读到 | | `TENCENT_SECRET_ID / SECRET_KEY not set` | 检查环境变量；确认已 `pip install tencentcloud-sdk-python-aiart` | | 图片被审核拦截 (1033) | Prompt 含敏感词，自动降级重试；如持续失败，调整穿搭描述 | | MiniMax 参考图报 `unknown error` | image_base64 不可用，需用 image_file + data:URI 或 URL | | 混元生成超时 | 异步轮询最多等 120s；网络或并发问题可重试 | | 角色每次长得不一样 | 确认 character-sheet.yaml 已创建且 meta.confirmed=true | | 全身照比例奇怪 | 避免使用 `tall` / `model-like` / `slender` | | 怎么切换引擎 | 设环境变量 `IDENTITY_STUDIO_ENGINE=hunyuan` 或 `minimax` |

user_d763f1a5

未分类 community v1.0.0 1 版本 100000 Key: 需要

★ 0

Stars

📥 23

下载

💾 0

安装

版本

#latest

概述

Agent Identity Studio

> Core principle: Define "who they are" first, then generate "what they're doing".

> Character Sheet (fixed) + Scene Card (variable) = consistent visual output.

Architecture Overview

┌─────────────────────────────────────────────────┐
│  Layer 0 — Engine (API / Audit / Retry)         │
│  MiniMax image-01, ~¥0.025/image, 15-30s/image  │
├─────────────────────────────────────────────────┤
│  Layer 1 — Character Sheet (Fixed)              │
│  Face + Body + Personality + Style + Reference  │
│  → render() → fixed prompt segment              │
├─────────────────────────────────────────────────┤
│  Layer 2 — Scene Card (Variable per shot)       │
│  Location + Outfit + Pose + Lighting + Comp     │
│  → render() → variable prompt segment           │
├─────────────────────────────────────────────────┤
│  Prompt Assembly                                 │
│  final = character_prompt + expression + scene   │
└─────────────────────────────────────────────────┘

Image Generation Engine — MiniMax image-01

API Configuration

API URL: https://api.minimaxi.com/v1/image_generation
Model: image-01
Auth: Bearer Token from env MINIMAX_API_KEY
Cost: ~¥0.025/image (~40 images per ¥1)
Speed: 15-30s/image (first call may be slower)

Gen Script

Path: scripts/gen.py
Usage: from gen import CharacterSheet, SceneCard, Engine, generate, generate_session
CLI: python gen.py proofs / python gen.py session --scenes '[...]' --output ./dir
Dependencies: None (pure Python stdlib)

API Response Handling

status_code == 0 → success, download from data.image_urls[0]
status_code == 1033/1026 → content audit rejection → auto-retry with toned-down prompt
status_code == 1000 with subject_reference → fallback to no-ref generation
status_code == 1008 → insufficient balance
Max 2 retries per image, then skip and report

Subject Reference (Character Anchor)

Use image_file field with data:image/png;base64,... URI (local files)
Or image_file with public URL
⚠️ image_base64 field is BROKEN on MiniMax — always use image_file
Dual-ref strategy: front ref (face lock) + side ref (body lock)
Reference images are optional — pure prompt works well for native MiniMax style characters

Layer 1 — Character Sheet

File

Path: ~/.workbuddy/identity-studio/character-sheet.yaml
Format: YAML with structured identity fields
Status: meta.confirmed: true/false

Fields & Their Impact on Consistency

Field	Impact	Notes
-------	--------	-------
reference_image	★★★★★	Visual anchor — critical for non-native characters
hair.color	★★★★☆	Most important text field in prompt
face.eyes	★★★★☆	Second most important text field
hair.signature_accessory	★★★☆☆	Unique identifier, disappears if omitted
personality.core_vibe	★★★☆☆	Controls maturity/age impression
body	★★☆☆☆	Only matters for full-body shots
art_style	★☆☆☆☆	MiniMax defaults to anime, minor effect

Prompt Rendering Rules

# CharacterSheet.render(composition, garment) produces the fixed segment:
#
# Always: "beautiful mature anime girl, with {hair_color},
#          {eyes}, {signature_accessory}, {skin},
#          {art_base}, {quality}, {maturity_suffix}"
#
# + medium: adds half_body_type + curve_template(garment)
# + full_body: adds full_body_type + curve + legs + stance
# + loose outfit detected: adds loose_outfit_boost

CRITICAL RULES:

Character Sheet render output is identical across ALL scenes (body only varies by composition)
Maturity suffix is always appended automatically
Audit-safe body descriptions are built into curve/legs templates

Layer 2 — Scene Card

Structure

SceneCard:
  name: str          # Scene name
  location: str      # Environment/setting
  time_of_day: str   # Time context
  atmosphere: str    # Mood/atmosphere
  garment: str       # Outfit (injected into curve_template)
  accessories: str   # Props/accessories
  pose: str          # Action/posture
  expression_override: str  # Expression (empty = use character default)
  lighting: str      # Lighting setup
  composition: str   # Shot type (close-up / medium / full body)
  aspect_ratio: str  # Image ratio

❌ Scene Card MUST NOT contain:

Forbidden	Reason
-----------	--------
Hair color/style	Managed by Character Sheet
Eye color/shape	Managed by Character Sheet
Facial features	Managed by Character Sheet
Body type/figure	Auto-injected by Character Sheet
Age descriptors	Managed by Character Sheet
Art style	Managed by Character Sheet
Character name	Managed by Character Sheet

Expression System

Scene has expression_override → use override
Scene has no expression_override → use CharacterSheet.default_expression

Expression levels (shallow → deep):

# Level 1 — Casual/Cute
giving the viewer a gentle warm smile
looking over her shoulder at the viewer with a playful wink

# Level 2 — Intimate/Romantic
gazing at the viewer with a mysterious confident smile
looking warmly at the viewer with tender loving eyes

# Level 3 — Emotional/Deep
gazing at the viewer with warm tender eyes
eyes slightly moist with emotion, lips parted

# Level 4 — Alluring/Bold
looks over her shoulder with a confident half-smile
leaning forward with a happy surprised smile and flushed cheeks

Workflow — Three Phases

Phase 1: Identity Initialization (Required)

> ❌ No confirmed identity = no scene generation

1. Load or create character-sheet.yaml
2. Generate 3 standard proof images (gen.py proofs):
   - portrait-front.png — Front close-up
   - portrait-3quarter.png — 3/4 medium shot
   - full-body.png — Full body standing
   → White background, focus on character identity
3. User review:
   - ✅ Approved → meta.confirmed = true, lock sheet
   - ❌ Rejected → adjust fields, regenerate
   - 🔄 Tweak → modify specific fields
4. Output: character-sheet.yaml + reference/ directory

Phase 2: Scene Generation (Requires confirmed identity)

Prerequisites: character-sheet.yaml meta.confirmed == true

1. Design scene → create SceneCard (DO NOT touch identity traits)
2. Auto-assemble prompt:
   char_prompt = sheet.render(composition, garment)
   expression  = scene.expression_override or sheet.default_expression
   scene_prompt = scene.render()
   final = f"{char_prompt}. {expression}. {scene_prompt}"
3. Call Engine → if audit fails, only downgrade SceneCard (NEVER touch CharacterSheet)
4. Save to sessions/YYYY-MM-DD-theme/

Phase 3: Identity Tuning (Optional)

When user wants adjustments:
- "Not mature enough" → adjust personality.core_vibe + maturity_suffix
- "Proportions wrong" → adjust body fields
- "Expression too stiff" → adjust default_expression
→ Re-run proofs → user confirms → update reference images

Content Safety — Audit Rules

❌ BANNED keywords (trigger rejection)

bikini, swimsuit, swimwear, lingerie, underwear, bra, backless,

spaghetti strap, naked, nude, topless, sexy body, busty, voluptuous,

bust, chest, hips, hourglass

✅ Safe body description: "describe the clothing, not the body"

Want	Safe Phrasing	Dangerous Phrasing
------	--------------	-------------------
Curves	form-fitting dress that hugs her curves	curvy body
Neckline	V-neck dress with elegant draping	showing bust
Legs	showing shapely legs	long sexy legs
Collarbone	delicate collarbone showing	OK to write directly

Audit-failure Handling

Engine auto-retries up to 2 times with toned-down prompt
Only Scene Card gets modified (outfit/pose toned down)
Character Sheet NEVER changes due to audit
If 2 retries fail, skip and report

Cosplay Mode

When generating cosplay images, the result must be "[Character] cosplaying as [Target]".

Core Rules

ALWAYS keep: All Character Sheet identity traits (hair color, eye color, accessory)
CAN change: Outfit, props, expression, scene, hairstyle (NOT color)
NEVER change: Hair color, eye color, facial features

File Structure

skills/agent-identity-studio/
├── SKILL.md                     ← This file (AI reads this)
├── README.md                    ← User-facing documentation
├── scripts/
│   └── gen.py                   ← Generation engine (zero dependencies)
├── references/
│   ├── audit-guide.md           ← Content safety rules
│   └── prompt-templates.md      ← Scene & expression templates
└── assets/

~/.workbuddy/identity-studio/    ← Runtime data (per-user)
├── character-sheet.yaml         ← Identity definition
├── reference/                   ← Proof images (Phase 1 output)
│   ├── ref-front.png            ← Front reference (face lock)
│   ├── ref-side.png             ← Side reference (body lock)
│   ├── portrait-front.png
│   ├── portrait-3quarter.png
│   └── full-body.png
└── sessions/                    ← Generated image sessions
    └── YYYY-MM-DD-theme/

Quick Reference

Generate Identity Proofs

from gen import CharacterSheet, generate_character_proofs
from pathlib import Path

sheet = CharacterSheet.load()
results = generate_character_proofs(Path.home() / ".workbuddy/identity-studio/reference")

Generate Photo Session

from gen import generate_session
from pathlib import Path

scenes = [
    {"name": "cafe", "location": "cozy cafe by window", "garment": "cream knit cardigan",
     "pose": "resting chin on hand", "expression": "warm smile at viewer",
     "lighting": "golden afternoon light", "composition": "medium shot"},
]
results = generate_session(scenes, Path.home() / ".workbuddy/identity-studio/sessions/2026-04-07-cafe")

版本历史

共 1 个版本

v1.0.0 Initial release 当前

2026-06-04 15:09 安全安全

安全检测

腾讯云安全 (Keen)

安全，无风险

查看报告

腾讯云安全 (Sanbu)

安全，无风险

查看报告

🔗 相关推荐

ai-agent

Self-Improving + Proactive Agent

ivangdavila

自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。

★ 1,379 📥 320,577

ai-agent

Skill Vetter

spclaudehome

AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前，检查风险信号、权限范围及可疑模式。

★ 1,227 📥 267,952

ai-agent

self-improving agent

pskoett

捕获经验教训、错误及修正内容，以实现持续改进。适用于以下场景：（1）命令或操作意外失败；（2）用户纠正Claude（如“不，那不对……”“实际上……”）；（3）用户请求的功能不存在；（4）外部API或工具出现故障；（5）Claude发现自身

★ 4,082 📥 811,250