invoking-gemini

Solid

Invokes Google Gemini models for structured outputs, image generation, multi-modal tasks, and Google-specific features. Use when users request Gemini, image generation, structured JSON output, Google API integration, or cost-effective parallel processing.

Data & Documents 134 stars 7 forks Updated yesterday MIT

Install

View on GitHub

Quality Score: 84/100

Stars 20%

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# Invoking Gemini Delegate tasks to Google's Gemini models when they offer advantages over Claude. ## When to Use Gemini **Image generation:** - Blog header images, illustrations, diagrams - Style-guided image creation (risograph, editorial, etc.) - Text rendering in images **Structured outputs:** - JSON Schema validation with property ordering guarantees - Pydantic model compliance - Strict schema adherence (enum values, required fields) **Cost optimization:** - Parallel batch processing (Gemini 3 Flash is lightweight) - High-volume simple tasks **Multi-modal tasks:** - Image analysis with JSON output - Video processing - Audio transcription with structure ## Setup ```bash uv pip install requests pydantic ``` **Credentials — Option A (recommended): Cloudflare AI Gateway** Source `/mnt/project/proxy.env` with `CF_ACCOUNT_ID`, `CF_GATEWAY_ID`, `CF_API_TOKEN`. Requests route through Cloudflare AI Gateway, bypassing IP blocks. Google API key stored in gateway via BYOK. **Credentials — Option B: Direct Google API** If no `proxy.env`, falls back to direct: `GOOGLE_API_KEY.txt` or `API_CREDENTIALS.json`. ## Image Generation Generate images using Gemini's native image models. This is the primary way to create illustrations, blog headers, diagrams, and visual content. ### Quick Start ```python import sys sys.path.append('/mnt/skills/user/invoking-gemini/scripts') from gemini_client import generate_image # One call — returns {"path": "...", "caption": "..."} or None r...

Details

Author: oaustegard
Repository: oaustegard/claude-skills
Created: 9 months ago
Last Updated: yesterday
Language: Python
License: MIT

Integrates with

Cloudflare · Cloud

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

gemini-image

Generate and edit images with Google's Gemini models (Nano Banana / Gemini 3 Pro Image / 3.1 Flash Image) from a zero-dependency Python CLI. Use whenever the user wants to create, generate, edit, restyle, or compose an image with Gemini / Google AI; produce listing photos, mockups, banners, icons, or marketing visuals via Gemini; or attach reference images for image-to-image editing. Also handles text/conversation/model-listing — it is the maintained successor to the older `gemini-client` plugin. Triggers: "generate an image with Gemini", "use Google AI to make a picture", "Gemini image", "nano banana", "edit this image with Gemini", "Imagen".

0 Updated today

BryceEWatson

AI & Automation Listed

gemini-image-gen

Generate or edit images with Google Gemini. Alternative to DALL-E for image generation. Requires GEMINI_API_KEY.

1 Updated yesterday

yourkenike

AI & Automation Listed

gemini-cli

Wield Google's Gemini CLI as a powerful auxiliary tool for code generation, review, analysis, and web research. Use when tasks benefit from a second AI perspective, current web information via Google Search, codebase architecture analysis, or parallel code generation. Also use when user explicitly requests Gemini operations.

4 Updated yesterday

Junayedahmedd