deepgram-performance-tuning

Featured

Optimize Deepgram API performance for faster transcription and lower latency. Use when improving transcription speed, reducing latency, or optimizing audio processing pipelines. Trigger: "deepgram performance", "speed up deepgram", "optimize transcription", "deepgram latency", "deepgram faster", "deepgram throughput".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Deepgram Performance Tuning ## Overview Optimize Deepgram transcription performance through audio preprocessing with ffmpeg, model selection for speed vs accuracy, streaming for large files, parallel processing, result caching, and connection reuse. Targets: <2s latency for short files, 100+ files/minute batch throughput. ## Performance Levers | Factor | Impact | Default | Optimized | |--------|--------|---------|-----------| | Audio format | High | Any format | 16kHz mono WAV | | Model | High | nova-3 | base (speed) or nova-3 (accuracy) | | File size | High | Full file sync | Stream >60s, callback >5min | | Concurrency | Medium | Sequential | 50 parallel (p-limit) | | Caching | Medium | None | Redis hash by audio+options | | Features | Medium | All enabled | Disable unused (diarize, utterances) | ## Instructions ### Step 1: Audio Preprocessing with ffmpeg ```bash # Optimal format for Deepgram: 16kHz, 16-bit, mono, WAV ffmpeg -i input.mp3 \ -ar 16000 \ # 16kHz sample rate (ideal for speech) -ac 1 \ # Mono channel -acodec pcm_s16le \ # 16-bit signed LE PCM -f wav \ output.wav # Remove silence (saves API cost + processing time) ffmpeg -i input.wav \ -af "silenceremove=stop_periods=-1:stop_duration=0.5:stop_threshold=-30dB" \ -ar 16000 -ac 1 -acodec pcm_s16le \ trimmed.wav # Noise reduction + normalization ffmpeg -i input.wav \ -af "highpass=f=200,lowpass=f=3000,loudnorm=I=-16:TP=-1.5:LRA=11" \ -ar 16000 -ac 1 -acodec pcm_s16l...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

deepgram-core-workflow-a

Implement production pre-recorded speech-to-text with Deepgram. Use when building audio transcription, batch processing, or implementing diarization and intelligence features. Trigger: "deepgram transcription", "speech to text", "transcribe audio", "batch transcription", "deepgram nova", "diarize audio".

2,266 Updated today
jeremylongshore
AI & Automation Featured

deepgram-cost-tuning

Optimize Deepgram costs and usage for budget-conscious deployments. Use when reducing transcription costs, implementing usage controls, or optimizing pricing tier utilization. Trigger: "deepgram cost", "reduce deepgram spending", "deepgram pricing", "deepgram budget", "optimize deepgram usage", "deepgram billing".

2,266 Updated today
jeremylongshore
AI & Automation Featured

deepgram-reference-architecture

Implement Deepgram reference architecture for scalable transcription systems. Use when designing transcription pipelines, building production architectures, or planning Deepgram integration at scale. Trigger: "deepgram architecture", "transcription pipeline", "deepgram system design", "deepgram at scale", "enterprise deepgram", "deepgram queue".

2,266 Updated today
jeremylongshore
AI & Automation Featured

deepgram-migration-deep-dive

Deep dive into migrating to Deepgram from other transcription providers. Use when migrating from AWS Transcribe, Google Cloud STT, Azure Speech, OpenAI Whisper, AssemblyAI, or Rev.ai to Deepgram. Trigger: "deepgram migration", "switch to deepgram", "migrate transcription", "deepgram from AWS", "deepgram from Google", "replace whisper with deepgram".

2,266 Updated today
jeremylongshore
AI & Automation Featured

deepgram-hello-world

Create a minimal working Deepgram transcription example. Use when starting a new Deepgram integration, testing your setup, or learning basic Deepgram API patterns. Trigger: "deepgram hello world", "deepgram example", "deepgram quick start", "simple transcription", "transcribe audio".

2,266 Updated today
jeremylongshore