azure-ai-transcription-py

Solid

Azure AI Transcription SDK for Python. Use for real-time and batch speech-to-text transcription with timestamps and diarization. Triggers: "transcription", "speech to text", "Azure AI Transcription", "TranscriptionClient".

AI & Automation 2,418 stars 270 forks Updated yesterday MIT

Install

View on GitHub

Quality Score: 94/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
74
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Azure AI Transcription SDK for Python Client library for Azure AI Transcription (speech-to-text) with real-time and batch transcription. ## Installation ```bash pip install azure-ai-transcription ``` ## Environment Variables ```bash TRANSCRIPTION_ENDPOINT=https://<resource>.cognitiveservices.azure.com TRANSCRIPTION_KEY=<your-key> ``` ## Authentication Use subscription key authentication (DefaultAzureCredential is not supported for this client): ```python import os from azure.ai.transcription import TranscriptionClient client = TranscriptionClient( endpoint=os.environ["TRANSCRIPTION_ENDPOINT"], credential=os.environ["TRANSCRIPTION_KEY"] ) ``` ## Transcription (Batch) ```python job = client.begin_transcription( name="meeting-transcription", locale="en-US", content_urls=["https://<storage>/audio.wav"], diarization_enabled=True ) result = job.result() print(result.status) ``` ## Transcription (Real-time) ```python stream = client.begin_stream_transcription(locale="en-US") stream.send_audio_file("audio.wav") for event in stream: print(event.text) ``` ## Best Practices 1. **Enable diarization** when multiple speakers are present 2. **Use batch transcription** for long files stored in blob storage 3. **Capture timestamps** for subtitle generation 4. **Specify language** to improve recognition accuracy 5. **Handle streaming backpressure** for real-time transcription 6. **Close transcription sessions** when complete

Details

Author
microsoft
Repository
microsoft/skills
Created
4 months ago
Last Updated
yesterday
Language
TypeScript
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category