vertex-ai-media-master

Solid

Automatic activation for ALL Google Vertex AI multimodal operations - video processing, audio generation, image creation, and marketing campaigns. **TRIGGER PHRASES:** - "vertex ai", "gemini multimodal", "process video", "generate audio", "create images", "marketing campaign" - "imagen", "video understanding", "multimodal", "content generation", "media assets" **AUTO-INVOKES FOR:** - Video processing and understanding (up to 6 hours) - Audio generation and transcription - Image generation with Imagen 4 - Marketing campaign automation - Social media content creation - Ad creative generation - Multimodal content workflows

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Vertex AI Media Master - Comprehensive Multimodal AI Operations This Agent Skill provides comprehensive mastery of Google Vertex AI multimodal capabilities for video, audio, image, and text processing with focus on marketing applications. ## Core Capabilities ### ๐ŸŽฅ Video Processing (Gemini 2.0/2.5) - **Video Understanding**: Process videos up to 6 hours at low resolution or 2 hours at default resolution - **2M Context Window**: Gemini 2.5 Pro handles massive video content - **Audio Track Processing**: Automatic audio transcription from video - **Multi-video Analysis**: Process multiple videos in single request - **Video Summarization**: Extract key moments, scenes, and insights - **Marketing Use Cases**: - Analyze competitor video ads - Extract highlights from long-form content - Generate video summaries for social media - Transcribe and caption video content - Identify brand mentions and product placements ### ๐ŸŽต Audio Generation & Processing - **Lyria Model (2025)**: Native audio and music generation - **Speech-to-Text**: Transcribe audio with speaker diarization - **Text-to-Speech**: Generate natural voiceovers - **Music Composition**: Background music for campaigns - **Audio Enhancement**: Noise reduction and quality improvement - **Marketing Use Cases**: - Generate podcast scripts and voiceovers - Create audio ads and radio spots - Produce background music for video campaigns - Transcribe customer interviews - Generate multilingual voiceovers ...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content โ€” not just same category

Data & Documents Listed

media-processor

Process multimedia content โ€” audio transcription, video analysis, PDF data extraction, image generation. Use for deeper image analysis when implementing from UI designs, analyzing charts for data, reading dense screenshots, or studying artworks and visual references.

64 Updated 2 weeks ago
avibebuilder
AI & Automation Solid

vertex-ai-pipeline-creator

Create vertex ai pipeline creator operations. Auto-activating skill for GCP Skills. Triggers on: vertex ai pipeline creator, vertex ai pipeline creator Part of the GCP Skills skill category. Use when working with vertex ai pipeline creator functionality. Trigger with phrases like "vertex ai pipeline creator", "vertex creator", "vertex".

2,266 Updated today
jeremylongshore
AI & Automation Featured

vertex-agent-builder

Build and deploy production-ready generative AI agents using Vertex AI, Gemini models, and Google Cloud infrastructure with RAG, function calling, and multi-modal capabilities. Use when appropriate context detected. Trigger with relevant phrases based on skill purpose.

2,266 Updated today
jeremylongshore
AI & Automation Solid

vertex-infra-expert

Terraform infrastructure specialist for Vertex AI services and Gemini deployments. Provisions Model Garden, endpoints, vector search, pipelines, and enterprise AI infrastructure. Triggers: "vertex ai terraform", "gemini deployment terraform", "model garden infrastructure", "vertex ai endpoints"

2,266 Updated today
jeremylongshore
AI & Automation Listed

super-claudiomedia-content-creation

Media content creation skill. Use when the user wants to create, generate, or produce any kind of video, audio, or image. This is the main skill for all media generation tasks. Trigger on video: "I want to make a video", "create a TikTok video", "generate a realistic video", "make a promo video", "animate my photo", "create a video ad", "Remotion", "Higgsfield", "Kling", "Seedance", "Weavy AI", "Hailuo". Trigger on audio: "read this article aloud", "create a voiceover", "text to speech", "TTS", "generate narration in Portuguese", "background music", "create a jingle", "ElevenLabs", "Francisca Neural", "Suno", "Udio", "audio summary". Trigger on image: "generate an image", "create a graphic", "make a diagram", "draw X", "generate a photo of Y", "make an infographic", "Midjourney", "DALL-E", "Flux", "Napkin.ai", "Nano Banana 2", "animate a static image". Also triggers for: marketing creatives, social media visuals, product photos, content creator tools.

1 Updated today
toolbox-playground