groq-core-workflow-b

Featured

Execute Groq secondary workflows: audio transcription (Whisper), vision, text-to-speech, and batch model evaluation. Trigger with phrases like "groq whisper", "groq transcription", "groq audio", "groq vision", "groq TTS", "groq speech".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Groq Core Workflow B: Audio, Vision & Speech ## Overview Beyond chat completions, Groq provides ultra-fast audio transcription (Whisper at 216x real-time), multimodal vision (Llama 4 Scout/Maverick), and text-to-speech. These endpoints use the same `groq-sdk` client. ## Prerequisites - `groq-sdk` installed, `GROQ_API_KEY` set - For audio: audio files in supported formats - For vision: image URLs or base64 images ## Audio Models | Model ID | Languages | Speed | Best For | |----------|-----------|-------|----------| | `whisper-large-v3` | 100+ | 164x real-time | Best accuracy, multilingual | | `whisper-large-v3-turbo` | 100+ | 216x real-time | Best speed/accuracy balance | **Supported audio formats**: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, webm ## Instructions ### Step 1: Audio Transcription (Whisper) ```typescript import Groq from "groq-sdk"; import fs from "fs"; const groq = new Groq(); // Transcribe audio file async function transcribe(filePath: string): Promise<string> { const transcription = await groq.audio.transcriptions.create({ file: fs.createReadStream(filePath), model: "whisper-large-v3-turbo", response_format: "json", // or "text" or "verbose_json" language: "en", // Optional: ISO 639-1 code }); return transcription.text; } // With timestamps (verbose mode) async function transcribeWithTimestamps(filePath: string) { const transcription = await groq.audio.transcriptions.create({ file: fs.createReadS...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

groq-core-workflow-a

Execute Groq primary workflow: chat completions with tool use and JSON mode. Use when implementing chat interfaces, function calling, structured output, or building AI features with Groq's fast inference. Trigger with phrases like "groq chat completion", "groq tool use", "groq function calling", "groq JSON mode".

2,266 Updated today
jeremylongshore
AI & Automation Listed

groqcloud-automation

Automate AI inference, chat completions, audio translation, and TTS voice management through GroqCloud's high-performance API via Composio

62,564 Updated 1 weeks ago
ComposioHQ
AI & Automation Listed

video-transcribe

Video/audio transcription, visual frame analysis, Groq Whisper long-form transcripts, timestamped Obsidian notes, and keyframe-based visual summaries. Use for video links, audio links, 字幕/转录/视频总结/画面分析/图文笔记, especially when the result must replace watching the video. Keywords: video, transcribe, 转录, 视频, 音频, audio, subtitle, 字幕, summary, 总结, 图文笔记, 视频内容, 画面分析, visual analysis, keyframe, whisper, groq, yt-dlp

7 Updated today
codingSamss
AI & Automation Listed

speech-to-text

Transcribe audio to text with Whisper models via inference.sh CLI. Models: Fast Whisper Large V3, Whisper V3 Large. Capabilities: transcription, translation, multi-language, timestamps. Use for: meeting transcription, subtitles, podcast transcripts, voice notes. Triggers: speech to text, transcription, whisper, audio to text, transcribe audio, voice to text, stt, automatic transcription, subtitles generation, transcribe meeting, audio transcription, whisper ai

335 Updated today
aiskillstore
AI & Automation Featured

groq-performance-tuning

Optimize Groq API performance with model selection, caching, streaming, and parallel requests. Use when experiencing slow responses, implementing caching strategies, or optimizing request throughput for Groq integrations. Trigger with phrases like "groq performance", "optimize groq", "groq latency", "groq caching", "groq slow", "groq speed".

2,266 Updated today
jeremylongshore