gemini-image-generator

Solid

Generate images from text prompts via Google Gemini.

AI & Automation 391 stars 36 forks Updated today MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%
86
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
80
License 10%
100
Description 5%
100

Skill Content

# Gemini Image Generator Generate images from text prompts via CLI using Google Gemini APIs. Supports model selection between fast (`gemini-2.5-flash-image`) and quality (`gemini-3-pro-image-preview`) models, batch generation, watermark removal, and background transparency. ## Reference Loading Table | Signal | Load These Files | Why | |---|---|---| | tasks related to this reference | `prompts.md` | Loads detailed guidance from `prompts.md`. | ## Instructions ### Step 1: Validate Environment Verify the API key exists before any generation attempt -- a missing key produces confusing errors that waste time debugging. ```bash echo "GEMINI_API_KEY is ${GEMINI_API_KEY:+set}" ``` Expect: `GEMINI_API_KEY is set`. If not set, instruct user to configure it and stop. Verify Python dependencies are available: ```bash python3 -c "from google import genai; from PIL import Image; print('OK')" ``` If missing, install: ```bash pip install google-genai Pillow ``` Determine the output path. Always use absolute paths for output files -- relative paths break when scripts run in different working directories. Verify the parent directory exists or will be created. **Proceed only when**: API key is set, dependencies installed, output path is valid. ### Step 2: Select Model and Compose Prompt Choose the model based on the use case: | Scenario | Model | Why | |----------|-------|-----| | Iterating on prompt, drafts | `gemini-2.5-flash-image` | Fast feedback (2-5s) | | Final quality as...

Details

Author
notque
Repository
notque/vexjoy-agent
Created
2 months ago
Last Updated
today
Language
Python
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

image-generation

Generate images from text prompts using Google's Gemini model. Creates photorealistic images, illustrations, concept art, and more via the generate_image tool.

21 Updated 4 days ago
project-nomos
AI & Automation Listed

gemini-image-gen

Generate or edit images with Google Gemini. Alternative to DALL-E for image generation. Requires GEMINI_API_KEY.

0 Updated today
yourkenike
AI & Automation Listed

gemini-imagegen

This skill should be used when generating and editing images using the Gemini API (Nano Banana Pro). It applies when creating images from text prompts, editing existing images, applying style transfers, generating logos with text, creating stickers, product mockups, or any image generation/manipulation task. Supports text-to-image, image editing, multi-turn refinement, and composition from multiple reference images.

0 Updated today
CodeWithBehnam
AI & Automation Solid

ai-image-generator

Generate AI images using Gemini or GPT APIs directly. Covers model selection (Gemini for scenes, GPT for transparent icons), the 5-part prompting framework, API calling patterns, multi-turn editing, and quality assurance. Produces photorealistic scenes, icons, illustrations, OG images, and product shots. Use when building websites that need images, creating marketing assets, or generating visual content. Triggers: 'generate image', 'ai image', 'create hero image', 'make an icon', 'generate illustration', 'create og image', 'ai art', 'image generation'.

809 Updated 2 weeks ago
jezweb
Data & Documents Listed

invoking-gemini

Invokes Google Gemini models for structured outputs, image generation, multi-modal tasks, and Google-specific features. Use when users request Gemini, image generation, structured JSON output, Google API integration, or cost-effective parallel processing.

124 Updated yesterday
oaustegard