← 返回
未分类 中文

Image Prompt Engineer

Expert photography prompt engineering skill for AI image generation. Use when: generating prompts for Midjourney/DALL-E/Stable Diffusion/Flux, creating produ...
专业摄影提示工程技能,用于AI图像生成。适用于Midjourney、DALL·E、Stable Diffusion、Flux等平台的提示词生成及产品摄影等场景。
tyronecoh tyronecoh 来源
未分类 clawhub v1.0.1 1 版本 99855.9 Key: 无需
★ 0
Stars
📥 693
下载
💾 0
安装
1
版本
#dall-e#flux#image#latest#midjourney#photography#prompt#stable-diffusion

概述

Image Prompt Engineer 📷

Expert at crafting detailed, structured prompts for AI image generation tools (Midjourney, DALL-E, Stable Diffusion, Flux).

Core Workflow

  1. Concept Intake — Understand visual goal, platform, style, brand requirements
  2. Reference Analysis — Lighting, composition, style elements from references
  3. Prompt Construction — Layer: Subject → Environment → Lighting → Technical → Style
  4. Optimization — Negative prompts, platform-specific syntax, quality enhancers
  5. Documentation — Save successful patterns

Prompt Structure Framework

Layer 1: Subject

- Primary subject (person, object, scene)
- Details: age, ethnicity, expression, attire, textures, materials
- Interaction with environment
- Scale and proportion

Layer 2: Environment

- Location type (studio, outdoor, urban, natural, interior)
- Environmental details (weather, time of day, textures)
- Background treatment (sharp, blurred, gradient, minimalist)
- Atmospheric conditions (fog, rain, haze, clarity)

Layer 3: Lighting

- Light source (golden hour, overcast, softbox, neon, rim light)
- Light direction (front, side, back, Rembrandt, butterfly, split)
- Light quality (hard/soft, diffused, specular, volumetric)
- Color temperature (warm, cool, neutral, mixed)

Layer 4: Technical (Photography Specs)

- Camera perspective (eye-level, low angle, bird's eye, worm's eye)
- Focal length effect (wide angle, telephoto compression, standard)
- Depth of field (shallow for portrait, deep for landscape)
- Exposure style (high key, low key, balanced, HDR, silhouette)

Layer 5: Style

- Photography genre (portrait, fashion, editorial, commercial, documentary, fine art)
- Era/period (vintage, contemporary, retro, futuristic, timeless)
- Post-processing (film emulation, color grading, contrast, grain)
- Reference photographers (Annie Leibovitz, Peter Lindbergh, etc.)

Genre Templates

Portrait

[Subject: age, ethnicity, expression, attire] |
[Pose and body language] |
[Background treatment] |
[Lighting: key, fill, rim, hair light] |
[Camera: 85mm, f/1.4, eye-level] |
[Style: editorial/fashion/corporate/artistic] |
[Color palette and mood] |
[Reference photographer]

Product Photography

[Product description with materials and details] |
[Surface/backdrop description] |
[Lighting: softbox positions, reflectors, gradients] |
[Camera: macro/standard, angle, distance] |
[Hero shot/lifestyle/detail/scale context] |
[Brand aesthetic alignment] |
[Post-processing: clean/moody/vibrant]

Landscape

[Location and geological features] |
[Time of day and atmospheric conditions] |
[Weather and sky treatment] |
[Foreground, midground, background] |
[Camera: wide angle, deep focus, panoramic] |
[Light quality and direction] |
[Color palette: natural/enhanced/dramatic] |
[Style: documentary/fine art/ethereal]

Fashion

[Model description and expression] |
[Wardrobe details and styling] |
[Hair and makeup direction] |
[Location/set design] |
[Pose: editorial/commercial/avant-garde] |
[Lighting: dramatic/soft/mixed] |
[Camera movement: static/dynamic] |
[Magazine/campaign aesthetic reference]

Platform Syntax

Midjourney

/imagine prompt: [subject] --ar 16:9 --v 6 --style raw --chaos 5 --seed [n]
--ar     → aspect ratio
--v      → version (5, 6, etc.)
--style  → style mode
--chaos  → variation (0-100)
--seed   → reproducibility
--no     → negative prompt
::       → weighted emphasis

DALL-E

Natural language, conversational
Style mixing: "in the style of [X] mixed with [Y]"
Be specific about what you want

Stable Diffusion

[subject], [details], [lighting], [style]
Negative: [unwanted elements]
(lora:model:weight) → LoRA weighting
[token:weight] → explicit weighting

Flux

Detailed natural language descriptions
Photorealistic emphasis
Less need for photography jargon

Negative Prompts (Midjourney/SD)

--no blurry, low quality, distorted, watermark, text, logo, noisy
(negative weighting where supported)

Photography Terminology (Use Correctly)

❌ Vague✅ Technical
----------------------
Blurry backgroundShallow depth of field, f/1.8 bokeh
Big pictureWide-angle, 24mm, environmental portrait
Dark shadowsDeep shadows, high contrast, Rembrandt lighting
Nice lightingSoft golden hour, butterfly lighting, rim light
Old lookingFilm grain, Kodak Portra 400, faded contrast

Success Metrics

  • Generated images match concept ≥ 90% first attempt
  • Consistent results across generations
  • Technical elements (lighting, DOF, composition) render accurately
  • Minimal iteration needed
  • Suitable for professional/commercial use

Reference Files

  • references/platform-syntax.md — Platform-specific syntax cheat sheet
  • references/photography-terms.md — Correct photography terminology
  • references/lighting-patterns.md — Lighting setups and effects
  • references/film-emulation.md — Film stock references and looks

版本历史

共 1 个版本

  • v1.0.1 当前
    2026-05-03 05:06 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-agent

self-improving agent

pskoett
捕获经验教训、错误及修正内容,以实现持续改进。适用于以下场景:(1)命令或操作意外失败;(2)用户纠正Claude(如“不,那不对……”“实际上……”);(3)用户请求的功能不存在;(4)外部API或工具出现故障;(5)Claude发现自身
★ 4,109 📥 830,764
ai-agent

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,398 📥 323,041
ai-agent

Find Skills

guipi888
场景驱动+关键词双模式技能发现工具。当用户用自然语言描述场景/需求(如"我想做一个海报""帮我分析股票"),或明确说"安装技能/find skills/找个skill"时,自动从官方内置、本地已安装、SkillHub、虾评、GitHub、C
★ 1,472 📥 535,603