gemini-image-generator

Solid

Generate images from text prompts via Google Gemini.

AI & Automation 391 stars 36 forks Updated today MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# Gemini Image Generator Generate images from text prompts via CLI using Google Gemini APIs. Supports model selection between fast (`gemini-2.5-flash-image`) and quality (`gemini-3-pro-image-preview`) models, batch generation, watermark removal, and background transparency. ## Reference Loading Table | Signal | Load These Files | Why | |---|---|---| | tasks related to this reference | `prompts.md` | Loads detailed guidance from `prompts.md`. | ## Instructions ### Step 1: Validate Environment Verify the API key exists before any generation attempt -- a missing key produces confusing errors that waste time debugging. ```bash echo "GEMINI_API_KEY is ${GEMINI_API_KEY:+set}" ``` Expect: `GEMINI_API_KEY is set`. If not set, instruct user to configure it and stop. Verify Python dependencies are available: ```bash python3 -c "from google import genai; from PIL import Image; print('OK')" ``` If missing, install: ```bash pip install google-genai Pillow ``` Determine the output path. Always use absolute paths for output files -- relative paths break when scripts run in different working directories. Verify the parent directory exists or will be created. **Proceed only when**: API key is set, dependencies installed, output path is valid. ### Step 2: Select Model and Compose Prompt Choose the model based on the use case: | Scenario | Model | Why | |----------|-------|-----| | Iterating on prompt, drafts | `gemini-2.5-flash-image` | Fast feedback (2-5s) | | Final quality as...

Details

Author: notque
Repository: notque/vexjoy-agent
Created: 2 months ago
Last Updated: today
Language: Python
License: MIT

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

image-generation

Generate images from text prompts using Google's Gemini model. Creates photorealistic images, illustrations, concept art, and more via the generate_image tool.

21 Updated 4 days ago

project-nomos

AI & Automation Listed

gemini-image-gen

Generate or edit images with Google Gemini. Alternative to DALL-E for image generation. Requires GEMINI_API_KEY.

0 Updated today

yourkenike

AI & Automation Listed

gemini-imagegen

This skill should be used when generating and editing images using the Gemini API (Nano Banana Pro). It applies when creating images from text prompts, editing existing images, applying style transfers, generating logos with text, creating stickers, product mockups, or any image generation/manipulation task. Supports text-to-image, image editing, multi-turn refinement, and composition from multiple reference images.

0 Updated today

CodeWithBehnam

AI & Automation Solid

ai-image-generator

Generate AI images using Gemini or GPT APIs directly. Covers model selection (Gemini for scenes, GPT for transparent icons), the 5-part prompting framework, API calling patterns, multi-turn editing, and quality assurance. Produces photorealistic scenes, icons, illustrations, OG images, and product shots. Use when building websites that need images, creating marketing assets, or generating visual content. Triggers: 'generate image', 'ai image', 'create hero image', 'make an icon', 'generate illustration', 'create og image', 'ai art', 'image generation'.

809 Updated 2 weeks ago

jezweb

Data & Documents Listed

invoking-gemini

Invokes Google Gemini models for structured outputs, image generation, multi-modal tasks, and Google-specific features. Use when users request Gemini, image generation, structured JSON output, Google API integration, or cost-effective parallel processing.

124 Updated yesterday

oaustegard