deepgram-core-workflow-b

Featured

Implement real-time streaming transcription with Deepgram WebSocket. Use when building live transcription, voice interfaces, real-time captioning, or voice AI applications. Trigger: "deepgram streaming", "real-time transcription", "live transcription", "websocket transcription", "voice streaming", "deepgram live".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Deepgram Core Workflow B: Live Streaming Transcription ## Overview Real-time streaming transcription using Deepgram's WebSocket API. The SDK manages the WebSocket connection via `listen.live()`. Covers microphone capture, interim/final result handling, speaker diarization, UtteranceEnd detection, auto-reconnect, and building an SSE endpoint for browser clients. ## Prerequisites - `@deepgram/sdk` installed, `DEEPGRAM_API_KEY` configured - Audio source: microphone (via Sox/`rec`), file stream, or WebSocket audio from browser - For mic capture: `sox` installed (`apt install sox` / `brew install sox`) ## Instructions ### Step 1: Basic Live Transcription ```typescript import { createClient, LiveTranscriptionEvents } from '@deepgram/sdk'; const deepgram = createClient(process.env.DEEPGRAM_API_KEY!); const connection = deepgram.listen.live({ model: 'nova-3', language: 'en', smart_format: true, punctuate: true, interim_results: true, // Show in-progress results utterance_end_ms: 1000, // Silence threshold for utterance end vad_events: true, // Voice activity detection events encoding: 'linear16', // 16-bit PCM sample_rate: 16000, // 16 kHz channels: 1, // Mono }); // Connection lifecycle events connection.on(LiveTranscriptionEvents.Open, () => { console.log('WebSocket connected to Deepgram'); }); connection.on(LiveTranscriptionEvents.Close, () => { console.log('WebSocket closed'); }); connection.on(Li...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

deepgram-core-workflow-a

Implement production pre-recorded speech-to-text with Deepgram. Use when building audio transcription, batch processing, or implementing diarization and intelligence features. Trigger: "deepgram transcription", "speech to text", "transcribe audio", "batch transcription", "deepgram nova", "diarize audio".

2,266 Updated today
jeremylongshore
AI & Automation Featured

deepgram-webhooks-events

Implement Deepgram callback and webhook handling for async transcription. Use when implementing callback URLs, processing async transcription results, or handling Deepgram event notifications. Trigger: "deepgram callback", "deepgram webhook", "async transcription", "deepgram events", "deepgram notifications", "deepgram async".

2,266 Updated today
jeremylongshore
AI & Automation Featured

deepgram-hello-world

Create a minimal working Deepgram transcription example. Use when starting a new Deepgram integration, testing your setup, or learning basic Deepgram API patterns. Trigger: "deepgram hello world", "deepgram example", "deepgram quick start", "simple transcription", "transcribe audio".

2,266 Updated today
jeremylongshore
AI & Automation Featured

assemblyai-core-workflow-b

Execute AssemblyAI streaming transcription and LeMUR workflows. Use when implementing real-time speech-to-text, live captions, voice agents, or LLM-powered audio analysis with LeMUR. Trigger with phrases like "assemblyai streaming", "assemblyai real-time", "assemblyai live transcription", "assemblyai LeMUR", "assemblyai summarize audio".

2,266 Updated today
jeremylongshore
AI & Automation Featured

deepgram-reference-architecture

Implement Deepgram reference architecture for scalable transcription systems. Use when designing transcription pipelines, building production architectures, or planning Deepgram integration at scale. Trigger: "deepgram architecture", "transcription pipeline", "deepgram system design", "deepgram at scale", "enterprise deepgram", "deepgram queue".

2,266 Updated today
jeremylongshore