audio-transcriber

Featured

Transform audio recordings into professional Markdown documentation with intelligent summaries using LLM integration

AI & Automation 39,227 stars 6374 forks Updated today MIT

Install

Quality Score: 99/100

Stars 20%

100

Recency 20%

100

Frontmatter 20%

70

Documentation 15%

100

Issue Health 10%

50

License 10%

100

Description 5%

100

Skill Content

## Purpose This skill automates audio-to-text transcription with professional Markdown output, extracting rich technical metadata (speakers, timestamps, language, file size, duration) and generating structured meeting minutes and executive summaries. It uses Faster-Whisper or Whisper with zero configuration, working universally across projects without hardcoded paths or API keys. Inspired by tools like Plaud, this skill transforms raw audio recordings into actionable documentation, making it ideal for meetings, interviews, lectures, and content analysis. ## When to Use Invoke this skill when: - User needs to transcribe audio/video files to text - User wants meeting minutes automatically generated from recordings - User requires speaker identification (diarization) in conversations - User needs subtitles/captions (SRT, VTT formats) - User wants executive summaries of long audio content - User asks variations of "transcribe this audio", "convert audio to text", "generate meeting notes from recording" - User has audio files in common formats (MP3, WAV, M4A, OGG, FLAC, WEBM) ## Workflow ### Step 0: Discovery (Auto-detect Transcription Tools) **Objective:** Identify available transcription engines without user configuration. **Actions:** Run detection commands to find installed tools: ```bash # Check for Faster-Whisper (preferred - 4-5x faster) if python3 -c "import faster_whisper" 2>/dev/null; then TRANSCRIBER="faster-whisper" echo "✅ Faster-Whisper detected (op...

Details

Author: sickn33
Repository: sickn33/antigravity-awesome-skills
Created: 4 months ago
Last Updated: today
Language: Python
License: MIT

Similar Skills

Semantically similar based on skill content — not just same category

Data & Documents Listed

audio-transcriber

Transform audio recordings into professional Markdown documentation with intelligent summaries using LLM integration

335 Updated today

Data & Documents Listed

whisper-extract

Extract, transcribe, and summarize audio or video files using OpenAI Whisper. Use this skill whenever the user wants to transcribe audio or video, extract what was said in a recording, get a transcript of a meeting/interview/lecture/podcast, or generate a summary of spoken content. Also trigger when the user mentions files like .mp3, .mp4, .wav, .m4a, .ogg, .flac, .webm, .mkv, .mov and wants text out of them. Generates a .md file with an AI summary followed by the full literal transcript. También se activa en castellano: "transcribir", "transcripción", "extraer audio", "qué dice este audio", "transcribir reunión", "transcribir entrevista", "pasar audio a texto", "resumir grabación", "transcribir este video", "extraer texto de audio", "transcript", "whisper", "grabar y resumir".

0 Updated 1 weeks ago

Data & Documents Listed

meeting-auto-summary

Transcribe a meeting audio/video file, optionally separate speakers, generate subtitle/transcript files, optionally translate outputs, and produce polished summary.md/report.md files in the requested output folder. Use when the user wants meeting media converted into subtitles, transcript, and meeting notes.

1 Updated 4 days ago

AI & Automation Solid

meeting-analyzer

Analyzes meeting transcripts and recordings to surface behavioral patterns, communication anti-patterns, and actionable coaching feedback. Use this skill whenever the user uploads or points to meeting transcripts (.txt, .md, .vtt, .srt, .docx), asks about their communication habits, wants feedback on how they run meetings, requests speaking ratio analysis, mentions filler words or conflict avoidance, or wants to compare their communication across time periods. Also trigger when users mention tools like Granola, Otter, Fireflies, or Zoom transcripts. Even if the user just says "look at my meetings" or "how do I come across in meetings" — use this skill.

16,642 Updated yesterday

Data & Documents Listed

transcribing-meeting-recordings

Use when converting a meeting recording (mp4/mov/m4a/wav audio or video) into a timestamped SRT/VTT transcript, especially Chinese or multilingual audio. Triggers include 'transcribe this meeting', 'generate captions', 'convert recording to subtitles', '生成字幕', '会议转文字', 'speaker labels', 'who said what', or requests for SRT/transcript files. Supports automatic speaker identification via pyannote diarization combined with video-frame analysis of Microsoft Teams or Zoom active-speaker highlighting to map anonymized SPEAKER_XX clusters onto real participant names. Do NOT use for live/streaming transcription, lecture summarization without a transcript artifact, or text-to-speech.

0 Updated 3 days ago