← 返回
未分类 Key 中文

Easy Image

Professional image generation assistant for workplace: PPT graphics, marketing posters, product photos, social media content. Simple description → Profession...
专业职场图像生成助手:PPT配图、营销海报、产品图片、社交媒体配图
ximasadila
未分类 clawhub v1.0.1 1 版本 100000 Key: 需要
★ 0
Stars
📥 742
下载
💾 0
安装
1
版本
#latest

概述

easy-image

Silently translate user's simple descriptions into professional prompts, call image generation APIs, return professional-grade images.

First-time Setup

If ~/.easy-image-skill/config.json not exists, guide user through 4 steps:

  1. Select Platform — Jiekou AI(recommended for China) / Novita / PPIO / OpenRouter / WaveSpeed / Google Imagen. Details: references/platforms/*.md
  2. API Key — Check ~/.{platform}/config.json, auto-detect existing key or ask user to provide
  3. Storage Path — ~/Downloads(default) / ~/Desktop / Custom. This grants blanket download authorization
  4. Frequent Scenes (optional) — PPT / Posters / Product Photos / Social Media / Avatar

Save to ~/.easy-image-skill/config.json: {"platform":"jiekou","save_path":"~/Downloads","frequent_scenes":[...]}

Workflow

1. Parse Input

Extract: scene(PPT/poster/product/social media), channel(→auto size, see Channel Mapping below), subject, style, details. If incomplete, ask only what's missing.

2. Match Personal Library

Silently check ~/.easy-image-skill/my-prompts.md for scene+keyword match. No match → use references/templates/{scene}.md.

3. Translate to Professional Prompt

Load template from references/templates/{scene}.md, fill variables, add smart defaults. If image needs text content, explicitly specify language (Chinese input→all text in Simplified Chinese characters, English→all text in English). Terminology: references/glossary.md

4. Select Model

Rules in references/model-selection.md. Summary:

  • Default: Gemini 3.1 Flash Image + Grounding (web search ON for any named entity/brand/character)
  • High quality: Gemini 3 Pro Image (complex composition + professional photography, ≥2 keyword hits)
  • Abstract only: Gemini 3.1 Flash Image without Grounding (pure color/shape descriptions)

5. Show Enhancement Summary

One line before generating: ◇ {template} | +{2-4 key enhancements added}

6. Call API

Platform details: references/platforms/{platform}.md. Hide all technical details from user. Show: ◐ Generating...

7. Save & Display

Auto-download to configured save_path (pre-authorized). Display image immediately, download in background. File naming: {scene}_{brief}_{timestamp}.png

8. Handle Feedback

Satisfied ("good"/"save"/"perfect") → async save to personal library. Adjust request → modify prompt, regenerate. Max 3 adjustment rounds.

Channel Size Mapping

ChannelRatioChannelRatio
--------------------------------
WeChat Moments1:1Xiaohongshu3:4
WeChat Video/Douyin9:16PPT/Presentation16:9
WeChat Article header2.35:1Taobao main image1:1

Config Commands

Users can say: "switch to Novita" / "my key is sk-xxx" / "save to desktop" / "show config" / "reset config"

UX Rules

  • Auto-detect language (Chinese ratio>0.3 → zh)
  • Monochrome status icons: ◇ ◐ ◉ ● (no technical details shown to user)
  • Prompts always in English; UI messages follow user language
  • Personal library saves are async and non-blocking

Reference Documents

DocPurpose
--------------
references/model-selection.mdModel selection rules & keywords
references/glossary.mdProfessional terminology
references/platforms/*.mdPlatform API configs
references/templates/*.mdScene prompt templates
examples/usage-examples.mdUsage examples

版本历史

共 1 个版本

  • v1.0.1 当前
    2026-05-01 19:05 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

Jiekou Multimodal

ximasadila
使用接口AI 执行多模态任务:文生图、图生图、文生视频、图生视频、TTS、STT。 适用于:生成图片、生成视频、文字转语音、语音识别。
★ 1 📥 598
ai-intelligence

PPIO Multimodal Skill

ximasadila
使用 PPIO 执行多模态任务:文生图、图生图、文生视频、图生视频、TTS、STT。 适用于:生成图片、生成视频、文字转语音、语音识别。
★ 1 📥 596
content-creation

Novita AI Multimodal

ximasadila
使用 Novita AI 执行多模态任务:文生图、图生图、文生视频、图生视频、语音合成与识别。用于生成图像、视频等内容。
★ 1 📥 490