together-deploy-integration

Solid

Together AI deploy integration for inference, fine-tuning, and model deployment. Use when working with Together AI's OpenAI-compatible API. Trigger: "together deploy integration".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Together AI Deploy Integration ## Overview Deploy a containerized Together AI inference integration service with Docker. This skill covers building a production image that connects to Together's OpenAI-compatible API for running completions, embeddings, and image generation across 100+ open-source models. Includes environment configuration for model selection and batch processing, health checks that verify API key validity and model availability, and rolling update strategies for zero-downtime deployments serving real-time inference requests. ## Docker Configuration ```dockerfile FROM python:3.12-slim AS builder WORKDIR /app COPY requirements.txt ./ RUN pip install --no-cache-dir -r requirements.txt FROM python:3.12-slim RUN groupadd -r app && useradd -r -g app app WORKDIR /app COPY --from=builder /usr/local/lib/python3.12/site-packages /usr/local/lib/python3.12/site-packages COPY --from=builder /usr/local/bin /usr/local/bin COPY src/ ./src/ USER app EXPOSE 8000 HEALTHCHECK --interval=30s --timeout=5s --retries=3 \ CMD curl -f http://localhost:8000/health || exit 1 CMD ["python", "src/server.py"] ``` ## Environment Variables ```bash export TOGETHER_API_KEY="tog_xxxxxxxxxxxx" export TOGETHER_BASE_URL="https://api.together.xyz/v1" export TOGETHER_DEFAULT_MODEL="meta-llama/Llama-3.1-8B-Instruct" export TOGETHER_MAX_TOKENS="2048" export LOG_LEVEL="info" export PORT="8000" ``` ## Health Check Endpoint ```typescript import express from 'express'; const app = express()...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category