runcomfy.com · docs · Nano Banana 2 model page
Nano Banana 2 is the flash-tier text-to-image model in Google's Gemini family. This skill generates images with Nano Banana 2 hosted on the RunComfy Model API — runcomfy run google/nano-banana-2/text-to-image from your terminal, no Google API key, no GPU rental.
Nano Banana 2 is Google's second-generation flash-tier image model — the iteration-speed-first variant in the Nano Banana line. Three properties make Nano Banana 2 distinct:
"AURA" in the Nano Banana 2 prompt, the literal word appears in the image — Nano Banana doesn't paraphrase or scramble.Nano Banana 2 also exposes an optional web-grounded context flag (enable_web_search) for image generation that references current events or real entities. The Nano Banana web grounding adds latency and cost; off by default.
Pick Nano Banana 2 when any of these is true:
enable_web_search.The Nano Banana family has two production tiers. Pick the right Nano Banana variant based on intent:
If the user said just "Nano Banana" without specifying 2 vs Pro, default to Nano Banana 2 for ideation, batches, and typography work; default to Nano Banana Pro for portrait fidelity.
npm i -g @runcomfy/cliruncomfy login opens a browser device-code flow.RUNCOMFY_TOKEN= instead of runcomfy login.google/nano-banana-2/text-to-imageThis is the Nano Banana 2 text-to-image endpoint. The Nano Banana 2 edit endpoint runs on a separate template not covered here.
| Field | Type | Required | Default | Notes |
|---|---|---|---|---|
| --- | --- | --- | --- | --- |
prompt | string | yes | — | Subject-first description for Nano Banana 2. |
num_images | int | no | 1 | 1–4. Use 4 for Nano Banana ideation rounds. |
seed | int | no | 0 | Reuse for reproducible Nano Banana 2 output. |
aspect_ratio | enum | no | auto | 11 values: auto, 21:9, 16:9, 3:2, 4:3, 5:4, 1:1, 4:5, 3:4, 2:3, 9:16. |
resolution | enum | no | 1K | 0.5K (Nano Banana drafts), 1K (default), 2K (final), 4K (max). |
output_format | enum | no | png | png, jpeg, webp. |
safety_tolerance | int | no | 4 | 1 (strict) – 6 (permissive). |
limit_generations | bool | no | true | Limit each Nano Banana prompt round to one generation. |
enable_web_search | bool | no | false | Adds Nano Banana web grounding (extra cost + latency). |
For Nano Banana 2 image edit (preserve subject + apply changes), see the sibling nano-banana-edit skill.
Default Nano Banana 2 draft (1K, square, png):
runcomfy run google/nano-banana-2/text-to-image \
--input '{"prompt": "<Nano Banana prompt>"}' \
--output-dir <absolute/path>
Nano Banana 2 vertical 4-up batch for ideation:
runcomfy run google/nano-banana-2/text-to-image \
--input '{
"prompt": "<Nano Banana prompt>",
"num_images": 4,
"aspect_ratio": "9:16",
"resolution": "0.5K"
}' \
--output-dir <absolute/path>
Final Nano Banana 2 at 2K with seed lock:
runcomfy run google/nano-banana-2/text-to-image \
--input '{
"prompt": "<Nano Banana prompt>",
"resolution": "2K",
"aspect_ratio": "16:9",
"seed": 42
}' \
--output-dir <absolute/path>
Web-grounded Nano Banana 2 (current event / real entity):
runcomfy run google/nano-banana-2/text-to-image \
--input '{
"prompt": "<Nano Banana prompt referencing a real-world event from this week>",
"enable_web_search": true
}' \
--output-dir <absolute/path>
Nano Banana 2 responds to specific prompting patterns better than naive prose. Apply these for sharper Nano Banana output.
Subject-first declarative grammar. "A cinematic close-up portrait of an American woman standing under neon lights in rainy Tokyo, shallow depth of field, reflective wet streets, ultra-detailed, realistic skin texture" — primary subject first, then action, environment, style, camera. Nano Banana 2 reads early tokens with more weight; front-load the subject.
Exact text quoting for in-image typography. "The label reads 'AURA' in clean bold sans-serif, centered, white on black" — quote the literal characters Nano Banana should render. Specify placement and font style. Don't say "with the brand name on it" and hope Nano Banana figures it out.
Consistent seeds for Nano Banana refinement. Lock seed when iterating a single Nano Banana prompt across small variants — keeps Nano Banana composition stable so you can compare apples to apples.
Web-grounding sparingly. Turn on enable_web_search only when the Nano Banana prompt names current events or real entities. Adds latency + cost; off by default.
Don't conflict styles for Nano Banana. "minimalist + ornate + retro + cyberpunk" cancels in Nano Banana output. Pick 1–2 anchors.
Nano Banana 2 anti-patterns:
| Use case | Why Nano Banana 2 |
|---|---|
| --- | --- |
| Marketing draft thumbnails (batch of 4) | Nano Banana 2 fast iteration at 0.5K, then promote winner to 2K |
| Social-platform-native | Nano Banana 2 wide aspect ratio support including 9:16, 4:5, 21:9 |
| In-image typography for posters / cards | Nano Banana 2 predictable text rendering when characters are quoted |
| Web-grounded current-event imagery | Nano Banana 2 enable_web_search integrates fresh info |
| Reproducible Nano Banana variant testing | Strong Nano Banana seed + consistent framing |
Cinematic Nano Banana portrait (page example):
A cinematic close-up portrait of an American woman standing under neon
lights in rainy Tokyo, shallow depth of field, reflective wet streets,
ultra-detailed, realistic skin texture
Brand-asset card with quoted text (Nano Banana typography):
A minimalist 16:9 product card: a matte black ceramic mug centered on a
soft warm-grey paper background, rim highlight from upper-left, the
headline "Brewed Quietly" in clean bold sans-serif top-right, balanced
negative space below, e-commerce ready, clean studio lighting
Vertical Nano Banana platform-native:
A 9:16 vertical hero for a wellness brand: a single ceramic teacup on a
linen runner, soft morning side-light, the words "Slow Down" in
hand-drawn serif large at the top, gentle steam rising, neutral color
palette, uncluttered
What's the difference between Nano Banana 2 and Nano Banana Pro? Nano Banana 2 is the flash tier (fast, predictable, drafts and typography). Nano Banana Pro is the pro tier (slower, higher fidelity, portraits). This skill is the Nano Banana 2 variant.
What resolutions does Nano Banana 2 support? Four: 0.5K (drafts), 1K (default), 2K (final), 4K (max). 2K and 4K Nano Banana cost more.
Can Nano Banana 2 generate multiple images per call? Yes — set num_images: 4. Useful for Nano Banana ideation batches.
Does Nano Banana 2 do image edit? Not on this endpoint. For Nano Banana edit (preserve subject + apply changes), use the nano-banana-edit skill.
Why is enable_web_search off by default? Nano Banana web grounding adds latency and cost. Only enable when the prompt explicitly references current events.
What languages does Nano Banana 2 understand in the prompt? English is the most reliable for Nano Banana. Multilingual prompts work for layout/style but in-image text is best in Latin-script languages.
How do I reproduce a Nano Banana 2 output? Pass seed as a fixed int. Same prompt + same seed = same Nano Banana generation.
| code | meaning |
|---|---|
| --- | --- |
| 0 | Nano Banana generation succeeded |
| 64 | bad CLI args |
| 65 | bad input JSON for Nano Banana / schema mismatch |
| 69 | upstream 5xx |
| 75 | retryable: timeout / 429 |
| 77 | not signed in or token rejected |
Full reference: docs.runcomfy.com/cli/troubleshooting.
The skill invokes runcomfy run google/nano-banana-2/text-to-image with a JSON body matching the Nano Banana 2 schema. The CLI POSTs to https://model-api.runcomfy.net/v1/models/google/nano-banana-2/text-to-image, polls the Nano Banana request, fetches the Nano Banana result, and downloads any .runcomfy.net / .runcomfy.com URL into --output-dir. Ctrl-C cancels the remote Nano Banana request before exit.
runcomfy login writes the API token to ~/.config/runcomfy/token.json with mode 0600.--input. No shell injection.model-api.runcomfy.net and .runcomfy.net / .runcomfy.com.共 1 个版本