← 返回
内容创作 中文

Nano Banana

Reasoning-driven image generation using structured creative briefs (Gemini 3 style) — generates high-fidelity images via muapi.ai with logic-based prompting
基于结构化创意简报(Gemini 3 风格)的推理驱动图像生成,通过 muapi.ai 以逻辑化提示生成高保真图像
anil-matcha
内容创作 clawhub v0.1.0 1 版本 99864.3 Key: 无需
★ 0
Stars
📥 736
下载
💾 29
安装
1
版本
#latest

概述

🍌 Nano-Banana Expert Skill (Gemini 3 Style)

A specialized skill for AI Agents to leverage "Reasoning-Driven" image generation.

Based on the advanced prompting architecture of Google's Gemini 3 (Nano Banana Pro), this skill moves beyond keyword stuffing to structured, logic-based creative briefs.

Core Competencies

  1. Reasoning-Driven Prompting: Using natural language logic to define physics, lighting, and spatial relationships.
  2. Structured Creative Briefs: Implementing the "Perfect Prompt" formula: Subject + Action + Context + Composition + Lighting.
  3. Text Rendering Precision: Explicitly defining typography and signifiers for legible text integration.
  4. Contextual Grounding: Using "Search Grounding" logic (simulated) to anchor generations in real-world accuracy.

🏗️ Technical Specification

1. The "Perfect Prompt" Formula

ComponentDescriptionExample
:---:---:---
SubjectDetailed entity description"A stoic robot barista with exposed copper wiring"
ActionDynamic interaction"Pouring a latte art leaf with mechanical precision"
ContextEnvironment & Atmosphere"Inside a neon-lit cyberpunk cafe at midnight"
CompositionCamera & Lens choice"Close-up, 85mm lens, f/1.8 aperture"
LightingMood & Direction"Volumetric blue rim light, warm cafe glow"
StyleAesthetic anchor"Cinematic, photorealistic, 4K production value"

2. Advanced Features

  • Negative Constraint Logic: Instead of "no blurry," use "Ensure sharp focus on the subject's eyes."
  • Identity Consistency: (Simulated) "Maintain consistent facial structure across variations."
  • Text Integration: Use double quotes for specific text: The sign reads "OPEN 24/7".

🧠 Prompt Optimization Protocol (Agent Instruction)

Before calling the script, the Agent MUST rewrite the user's prompt into a logic-driven Reasoning Brief:

  1. NO KEYWORD SOUP: Remove "8k, masterpiece, ultra-detailed." Use full, descriptive sentences.
  2. PHYSICAL CONSISTENCY: Describe how elements interact (e.g., "The light from the crystal shards casts caustic patterns across the obsidian floor").
  3. TEXT PRECISION: If the user wants text, define it precisely: featuring a sign that says "STORE NAME" in a weathered serif font.
  4. OPTICAL DIRECTIVES: Specify lens behavior: Shallow Depth of Field (f/1.8), Macro Lens, Anamorphic Flare.

🚀 Protocol: Using Nano-Banana

Step 1: Define the Creative Logic

Provide the agent with a subject and a specific scenario.

Step 2: Invoke the Script

The generate-nano-art.sh script translates the logic into a structured Gemini 3-style prompt.

# Generating a reasoning-driven image
bash scripts/generate-nano-art.sh \
  --subject "a glass chess piece" \
  --action "shattering into liquid shards" \
  --context "on a obsidian table" \
  --style "macro photography"

⚠️ Constraints & Guardrails

  • No Keyword Soup: MANDATORY - Do not use "trending on artstation, masterpiece, 8k". Use natural language descriptions.
  • Physics Logic: Ensure the prompt describes physically possible lighting and reflection interactions.
  • Full Sentences: The model parses relationships; use "light reflecting off the water" instead of "water, reflection".

⚙️ Implementation Details

This skill applies a "Logic Wrapper" around the core/media/generate-image.sh primitive, converting fragmented inputs into a coherent, reasoning-ready narrative prompt.

版本历史

共 1 个版本

  • v0.1.0 当前
    2026-03-19 17:00 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

Humanizer

biostartechnology
消除AI写作痕迹,使文本更自然真实。基于维基百科"AI写作特征"指南,识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。
★ 860 📥 199,873
content-creation

AdMapix

fly0pants
广告情报与应用数据分析助手,支持搜索广告素材、分析应用排名、下载量、收入及市场洞察,用于广告素材和竞品分析。
★ 295 📥 136,494
ai-intelligence

Workflow

anil-matcha
构建、运行和可视化多步AI生成工作流。AI架构师将自然语言描述转为连接节点图。
★ 0 📥 724