← 返回
内容创作 Key 中文

Nano Banana 2 Image Generation&Editing

Generate and edit images using Google's Nano Banana 2 (Imagen) model — the latest high-quality image generation AI. Supports text-to-image generation and ima...
{"answer":"使用 Google Nano Banana 2 (Imagen) 模型生成与编辑图像——最新高质量图像生成AI。支持文生图及图..."}
xixihhhh
内容创作 clawhub v1.0.9 3 版本 99831.1 Key: 需要
★ 2
Stars
📥 1,142
下载
💾 146
安装
3
版本
#latest

概述

Nano Banana 2 Image Generation

Generate and edit images using Google's Nano Banana 2 (Imagen) model via two provider options.

> Privacy & data note: This skill sends text prompts and image data to third-party APIs (Atlas Cloud at api.atlascloud.ai or Google AI Studio at generativelanguage.googleapis.com) for image generation. For image editing via Atlas Cloud, local files are uploaded to Atlas Cloud's temporary storage to obtain a URL — the agent MUST ask the user for explicit confirmation before uploading any local file. Uploaded files are temporary and may be cleaned up periodically. No data is stored locally beyond the downloaded output files.

Required Environment Variables

VariableRequiredDescription
----------:--------:-------------
ATLASCLOUD_API_KEYIf using Atlas CloudAtlas Cloud API key for image generation
GEMINI_API_KEYIf using Google AI StudioGoogle AI Studio API key

At least one of the above must be set. If both are set, ask the user which provider to use.

Provider Selection

  1. If ATLASCLOUD_API_KEY is set → use Atlas Cloud
  2. If GEMINI_API_KEY is set → use Google AI Studio
  3. If both are set → ask the user which provider to use
  4. If neither is set → ask the user to configure one:
    • Atlas Cloud: Sign up at https://www.atlascloud.ai, Console → API Keys → Create key, then export ATLASCLOUD_API_KEY="your-key"
    • Google AI Studio: Get key from https://aistudio.google.com/apikey, then export GEMINI_API_KEY="your-key"

Atlas Cloud

  • Async API with polling workflow
  • Flat-rate pricing regardless of resolution
  • Supports 300+ models through one API key

Google AI Studio

  • Direct access via Google's Gemini API
  • Synchronous response with base64 image output

Pricing Comparison

ResolutionGoogle AI StudioAtlas CloudSavings
:----------::----------------::-----------::-------:
1K (default)$0.080/image$0.072/image10% off
2K$0.080/image$0.072/image10% off
4K$0.080/image$0.072/image10% off

Atlas Cloud is 10% cheaper than Google AI Studio across all resolutions, with flat-rate pricing regardless of resolution.

Available Models

Text-to-Image Models

Model ID (Atlas Cloud)PriceDescription
--------------------------------------------
google/nano-banana-2/text-to-image$0.072/imageStable, production-ready

Image Editing Models

Model ID (Atlas Cloud)PriceDescription
--------------------------------------------
google/nano-banana-2/edit$0.072/imageStable image editing

Google AI Studio model: gemini-3.1-flash-image-preview (handles both generation and editing)


Mode 1: Atlas Cloud API

Setup

The user needs an Atlas Cloud API key. Guide them to:

  1. Sign up at https://www.atlascloud.ai
  2. Go to Console → API Keys → Create new key
  3. Set environment variable: export ATLASCLOUD_API_KEY="your-key"

Script Usage

This skill includes a Python script for image generation. Zero external dependencies required.

List available image models

python scripts/generate_image.py list-models

Generate an image

python scripts/generate_image.py generate \
  --model "MODEL_ID" \
  --prompt "Your prompt here" \
  --output ./output

Upload a local image (for editing)

python scripts/generate_image.py upload ./local-image.jpg

Edit an image

python scripts/generate_image.py generate \
  --model "MODEL_ID" \
  --prompt "Edit instruction" \
  --image "https://...uploaded-url..."

Run python scripts/generate_image.py generate --help for all options. Extra model params can be passed as key=value (e.g. aspect_ratio=16:9 resolution=2k).


Text-to-Image Generation

Parameters:

ParameterTypeRequiredDefaultOptions
---------------------------------------------
promptstringYes-Text description of the image
aspect_ratiostringNo1:11:1, 3:2, 2:3, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9
resolutionstringNo1k1k, 2k, 4k
output_formatstringNopngpng, jpeg
seedintegerNorandomFor reproducible results

Workflow — submit, poll, download:

# Step 1: Submit generation request
curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateImage" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "google/nano-banana-2/text-to-image",
    "prompt": "A serene Japanese garden with cherry blossoms",
    "aspect_ratio": "16:9",
    "resolution": "2k"
  }'
# Response: { "code": 0, "data": { "id": "prediction-id" } }

# Step 2: Poll for result (repeat until status is "completed" or "succeeded")
curl -s "https://api.atlascloud.ai/api/v1/model/prediction/{prediction-id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"
# Response when done: { "code": 0, "data": { "status": "completed", "outputs": ["https://...image-url..."] } }

# Step 3: Download the image
curl -o output.png "IMAGE_URL_FROM_OUTPUTS"

When implementing this workflow programmatically:

  • Poll every 2-3 seconds
  • Check for status: "completed" or "succeeded" means done
  • Check for status: "failed" means error — read the error field
  • Image URLs are in data.outputs[] array

Uploading Local Images

To use local images for editing, first upload them to get a URL. The agent MUST confirm with the user before uploading any local file (e.g., "I'll upload /path/to/image.jpg to Atlas Cloud for editing. Proceed?").

curl -s -X POST "https://api.atlascloud.ai/api/v1/model/uploadMedia" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -F "file=@/path/to/local/image.jpg"
# Returns: { "code": 200, "data": { "download_url": "https://...url...", "filename": "image.jpg", "size": 123456 } }

Use the returned download_url as the image URL in the images array for editing requests.

> Note: Uploaded files are for temporary use with Atlas Cloud generation tasks only. URLs may expire after a period of time.

Image Editing

Same workflow as text-to-image, but with additional images parameter:

ParameterTypeRequiredDefaultOptions
---------------------------------------------
promptstringYes-Editing instruction
imagesarray of stringsYes-1-14 image URLs to edit
aspect_ratiostringNo-Same options as above
resolutionstringNo1k1k, 2k, 4k
curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateImage" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "google/nano-banana-2/edit",
    "prompt": "Change the sky to a dramatic sunset",
    "images": ["https://example.com/photo.jpg"],
    "resolution": "2k"
  }'

Using Atlas Cloud MCP Tools (if available)

If the user has the Atlas Cloud MCP server configured, use the built-in tools directly:

# Quick generate
atlas_quick_generate(model_keyword="nano banana 2", type="Image", prompt="...")

# Or with specific model
atlas_generate_image(model="google/nano-banana-2/text-to-image", params={...})

# Check result
atlas_get_prediction(prediction_id="...")

Mode 2: Google AI Studio (Official)

Setup

  1. Get API key from https://aistudio.google.com/apikey
  2. Set environment variable: export GEMINI_API_KEY="your-key"

Text-to-Image Generation

curl -s -X POST \
  "https://generativelanguage.googleapis.com/v1beta/models/gemini-3.1-flash-image-preview:generateContent" \
  -H "x-goog-api-key: $GEMINI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "contents": [{
      "parts": [{"text": "A serene Japanese garden with cherry blossoms"}]
    }],
    "generationConfig": {
      "responseModalities": ["TEXT", "IMAGE"],
      "imageConfig": {
        "aspectRatio": "16:9",
        "imageSize": "2K"
      }
    }
  }'

Parameters for Google AI Studio:

ParameterLocationOptions
------------------------------
aspectRatiogenerationConfig.imageConfig1:1, 1:4, 1:8, 2:3, 3:2, 3:4, 4:1, 4:3, 4:5, 5:4, 8:1, 9:16, 16:9, 21:9
imageSizegenerationConfig.imageConfig512px, 1K, 2K, 4K (uppercase K required)
responseModalitiesgenerationConfig["TEXT", "IMAGE"] for image output

Response handling:

The response contains base64-encoded image data in candidates[0].content.parts[]. Loop through parts — text parts have .text, image parts have .inline_data.mime_type and .inline_data.data (base64).

Image Editing (Google AI Studio)

Include the source image as base64 inline_data alongside the text prompt:

curl -s -X POST \
  "https://generativelanguage.googleapis.com/v1beta/models/gemini-3.1-flash-image-preview:generateContent" \
  -H "x-goog-api-key: $GEMINI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "contents": [{
      "parts": [
        {"text": "Change the sky to a dramatic sunset"},
        {"inline_data": {
          "mime_type": "image/png",
          "data": "BASE64_ENCODED_IMAGE"
        }}
      ]
    }],
    "generationConfig": {
      "responseModalities": ["TEXT", "IMAGE"]
    }
  }'

Python Example (Google AI Studio)

from google import genai
from google.genai import types
import base64

client = genai.Client()

# Text-to-Image
response = client.models.generate_content(
    model="gemini-3.1-flash-image-preview",
    contents="A serene Japanese garden with cherry blossoms",
    config=types.GenerateContentConfig(
        response_modalities=['TEXT', 'IMAGE'],
        image_config=types.ImageConfig(
            aspect_ratio="16:9",
            image_size="2K"
        ),
    )
)

for part in response.parts:
    if part.text:
        print(part.text)
    elif image := part.as_image():
        image.save("output.png")

Implementation Guide

When the user asks to generate an image, follow this workflow:

  1. Determine provider: Check which API key is available (see Provider Selection above).
  1. Extract parameters from user request:
    • Prompt: the image description
    • Aspect ratio: infer from context (banner→16:9, portrait→9:16, square→1:1, phone wallpaper→9:16, desktop wallpaper→16:9)
    • Resolution: default 1k unless user wants high quality (then 2k or 4k)
    • For editing: identify source image(s)
  1. Choose model (Atlas Cloud only):
    • Use google/nano-banana-2/text-to-image for generation
    • Use google/nano-banana-2/edit for editing tasks
  1. Execute the API call using bash with curl
  1. For Atlas Cloud: Poll the prediction endpoint every 3 seconds until complete, then download the image
  1. For Google AI Studio: Parse the response, extract base64 image data, save to file
  1. Present the result: Show the saved file path and offer to open it

Prompt Engineering Tips

Share these with users to get better results:

  • Be specific about style: "oil painting", "photorealistic", "anime style", "watercolor"
  • Describe lighting: "golden hour", "studio lighting", "neon glow"
  • Mention composition: "close-up", "wide angle", "bird's eye view"
  • Include mood: "serene", "dramatic", "whimsical"
  • For text in images: Nano Banana 2 handles text rendering well — just include the text in quotes in your prompt

版本历史

共 3 个版本

  • v1.0.9 当前
    2026-03-29 04:29 安全 安全
  • v1.0.1
    2026-03-26 21:41
  • v1.0.5
    2026-03-18 13:51

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

YouTube

byungkyu
使用托管OAuth集成YouTube Data API,支持搜索视频、管理播放列表、获取频道数据及评论互动,适用于用户需要时使用此技能。
★ 142 📥 41,074
content-creation

AdMapix

fly0pants
广告情报与应用数据分析助手,支持搜索广告素材、分析应用排名、下载量、收入及市场洞察,用于广告素材和竞品分析。
★ 295 📥 136,490
content-creation

Humanizer

biostartechnology
消除AI写作痕迹,使文本更自然真实。基于维基百科"AI写作特征"指南,识别并修正夸张象征、宣传用语、肤浅-ing分析、模糊归因、破折号滥用、三项排比、AI词汇、负面平行结构及冗长连接词等模式。
★ 860 📥 199,843