image-generation

Solid

Generate or edit images from text prompts. Use when the user asks to create, draw, design, or edit an image, illustration, photo, icon, poster, or any visual content.

AI & Automation 44,972 stars 10156 forks Updated today MIT

Install

View on GitHub

Quality Score: 96/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Image Generation Generate and edit images using AI models. The script automatically picks a backend based on which API keys are configured — **you don't need to specify a model unless the user explicitly names one**. Supported models (passed via `model` only when the user asks for a specific one): - **OpenAI** — `gpt-image-2`, `gpt-image-1` - **Gemini Nano Banana** — `nano-banana-2`, `nano-banana-pro`, `nano-banana` - **Seedream (Volcengine Ark)** — `seedream-5.0-lite`, `seedream-4.5` - **Qwen (DashScope)** — `qwen-image-2.0`, `qwen-image-2.0-pro` - **MiniMax** — `image-01` ## Usage Run `scripts/generate.py` with a JSON argument. The path is relative to this skill's `base_dir`. ```bash python <base_dir>/scripts/generate.py '<json_args>' ``` **Set bash timeout to at least 600 seconds**, as image generation can take 30–200s per provider, and the script may try multiple providers sequentially. ### Parameters | Parameter | Type | Required | Default | Description | |-----------|------|----------|---------|-------------| | `prompt` | string | yes | — | Image description | | `image_url` | string / list | no | null | Input image(s) for editing: local file path or URL. Multi-image fusion is supported (pass a list) | | `quality` | string | no | auto | `low` / `medium` / `high` (only some backends honour this) | | `size` | string | no | auto | `512` / `1K` / `2K` / `3K` / `4K`, or pixel value (`1024x1024`) | | `aspect_ratio` | string | no | null | `1:1` / `3:2` / `2:3` / `16:...

Details

Author
zhayujie
Repository
zhayujie/CowAgent
Created
3 years ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

imagegen

Use when the user asks to generate or edit images via the OpenAI Image API (for example: generate image, edit/inpaint/mask, background removal or replacement, transparent background, product shots, concept art, covers, or batch variants); run the bundled CLI (`scripts/image_gen.py`) and require `OPENAI_API_KEY` for live calls.

1 Updated today
HGGodhand33
AI & Automation Solid

imagegen

Use when the user asks to generate or edit images via the OpenAI Image API (for example: generate image, edit/inpaint/mask, background removal or replacement, transparent background, product shots, concept art, covers, or batch variants); run the bundled CLI (`scripts/image_gen.py`) and require `OPENAI_API_KEY` for live calls.

27,681 Updated today
davila7
AI & Automation Solid

imagegen

Use when the user asks to generate or edit images via the OpenAI Image API (for example: generate image, edit/inpaint/mask, background removal or replacement, transparent background, product shots, concept art, covers, or batch variants); run the bundled CLI (`scripts/image_gen.py`) and require `OPENAI_API_KEY` for live calls.

2,202 Updated 1 weeks ago
foryourhealth111-pixel
AI & Automation Listed

image-generation

Generate images from text prompts using Google's Gemini model. Creates photorealistic images, illustrations, concept art, and more via the generate_image tool.

21 Updated 4 days ago
project-nomos
AI & Automation Solid

gemini-image-generator

Generate images from text prompts via Google Gemini.

391 Updated today
notque