server

Solid

Start/stop Kokoro TTS HTTP server. TRIGGERS - start tts server, kokoro server, tts http, stop tts server.

AI & Automation 32 stars 3 forks Updated 1 months ago MIT

Install

View on GitHub

Quality Score: 73/100

Stars 20%
51
Recency 20%
75
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Kokoro TTS Server Manage the Kokoro TTS HTTP server — an OpenAI-compatible `/v1/audio/speech` endpoint on localhost:8779. > **Self-Evolving Skill**: This skill improves through use. If instructions are wrong, parameters drifted, or a workaround was needed — fix this file immediately, don't defer. Only update for real, reproducible issues. ## Server Overview The server provides: - `GET /health` — Health status JSON - `GET /v1/models` — List available models - `POST /v1/audio/speech` — Synthesize text to audio (WAV, MP3, Opus, PCM) ## Quick Start ### Start (foreground, for testing) ```bash ~/.local/share/kokoro/.venv/bin/python ~/.local/share/kokoro/tts_server.py ``` ### Start (launchd, for production) Per the macOS launchd policy, the launchd plist must launch a compiled Swift binary (not a bash script). Guide the user through: 1. Compile Swift launcher binary 2. Create launchd plist at `~/Library/LaunchAgents/com.terryli.kokoro-tts-server.plist` 3. Bootstrap: `launchctl bootstrap gui/$(id -u) ~/Library/LaunchAgents/com.terryli.kokoro-tts-server.plist` ### Stop ```bash launchctl bootout gui/$(id -u) ~/Library/LaunchAgents/com.terryli.kokoro-tts-server.plist ``` ### Verify ```bash curl -s http://127.0.0.1:8779/health | python3 -m json.tool ``` Expected: `{"status": "ok", "provider": "kokoro-tts-mlx", "model": "mlx-community/Kokoro-82M-bf16", "device": "mlx-metal"}` ## Environment Variables | Variable | Default | Purpose | | -...

Details

Author
terrylica
Repository
terrylica/cc-skills
Created
5 months ago
Last Updated
1 months ago
Language
TypeScript
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

audiomind

Tired of juggling multiple audio APIs? This skill gives you one-command access to TTS, music generation, sound effects, and voice cloning. Use when you want to generate any audio without managing multiple API keys.

25 Updated 2 months ago
wells1137
AI & Automation Featured

narrator-ai-cli

Create AI-narrated film/drama commentary videos via CLI. Two workflow paths (Original & Adapted narration), 93 movies, 146 BGM tracks, 63 dubbing voices in 11 languages, 90+ narration templates. Use when creating narration videos, film commentary, short drama dubbing, or video production.

667 Updated 1 months ago
GridLtd-ProductDev
Data & Documents Listed

speak

Speak a message out loud in the Star Trek Computer Voice through an external TTS service

17 Updated 3 months ago
jarombouts
AI & Automation Solid

opengork

Uncensored AI agent with dual backend support. Use Local Heretic mode (Ollama) for 100% uncensored responses, or xAI API for cloud-based inference with partial filtering. Modes: heretic (uncensored), savage (brutal honesty), based (raw opinions), genius (expert analysis), chaos (wild creativity).

24 Updated 2 months ago
diagnostikon
AI & Automation Solid

tlive

IM bridge for AI coding tools — chat with Claude Code / Codex from Telegram, Discord, or Feishu. Approve permissions, get streaming responses, manage sessions from your phone. Use for: starting IM bridge, configuring IM platforms, checking status, diagnosing issues. Trigger phrases: "tlive", "IM bridge", "消息桥接", "手机交互", "启动桥接", "连接飞书", "连接Telegram", "诊断", "查看日志", "配置". Do NOT use for: building bots, webhook integrations, or general coding tasks.

199 Updated 1 months ago
y49