ai-image-generator

Solid

Generate AI images using Gemini or GPT APIs directly. Covers model selection (Gemini for scenes, GPT for transparent icons), the 5-part prompting framework, API calling patterns, multi-turn editing, and quality assurance. Produces photorealistic scenes, icons, illustrations, OG images, and product shots. Use when building websites that need images, creating marketing assets, or generating visual content. Triggers: 'generate image', 'ai image', 'create hero image', 'make an icon', 'generate illustration', 'create og image', 'ai art', 'image generation'.

AI & Automation 809 stars 82 forks Updated 2 weeks ago MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%
97
Recency 20%
90
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# AI Image Generator Generate images using AI APIs (Google Gemini and OpenAI GPT). This skill teaches the prompting patterns and API mechanics for producing professional images directly from Claude Code. > **Managed alternative**: If you don't want to manage API keys, [ImageBot](https://imagebot.au) provides a managed image generation service with album templates and brand kit support. ## Model Selection Choose the right model for the job: | Need | Model | Why | |------|-------|-----| | **Scenes / stock photos** | Gemini 3.1 Flash Image | Best depth, complexity, environmental context | | **Transparent icons / logos** | GPT Image 1.5 | Native RGBA alpha channel (`background: "transparent"`) | | **Text on images** | GPT Image 1.5 | 90% accurate text rendering | | **Drafts / iteration** | Gemini 2.5 Flash Image | Free tier (~500/day) | | **Final client assets** | Gemini 3 Pro Image | Higher detail, better style consistency | ### Model IDs | Model | API ID | Provider | |-------|--------|----------| | Gemini 3.1 Flash Image | `gemini-3.1-flash-image-preview` | Google AI | | Gemini 3 Pro Image | `gemini-3-pro-image-preview` | Google AI | | Gemini 2.5 Flash Image | `gemini-2.5-flash-image` | Google AI | | GPT Image 1.5 | `gpt-image-1.5` | OpenAI | **Verify model IDs before use** — they change frequently: ```bash curl -s "https://generativelanguage.googleapis.com/v1beta/models?key=$GEMINI_API_KEY" | python3 -c "import sys,json; [print(m['name']) for m in json.load(sys.stdin)[...

Details

Author
jezweb
Repository
jezweb/claude-skills
Created
7 months ago
Last Updated
2 weeks ago
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category