model-deployment

Featured

Generates a Jupyter notebook that deploys fine-tuned models from SageMaker Serverless Model Customization to SageMaker endpoints or Bedrock. Use when the user says "deploy my model", "create an endpoint", "make it available", or asks about deployment options. Identifies the correct deployment pathway (Nova vs OSS), generates deployment code, and handles endpoint configuration.

DevOps & Infrastructure 753 stars 107 forks Updated 4 days ago Apache-2.0

Install

View on GitHub

Quality Score: 95/100

Stars 20%
96
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Model Deployment Identifies the correct deployment pathway based on model characteristics and generates deployment code. ## Scope This skill supports deploying Nova and OSS models that were fine-tuned through **SageMaker Serverless Model Customization** only. **Not supported:** - Base models (not fine-tuned) - Models fine-tuned through other processes - Full Fine-Tuning (FFT) — only LoRA fine-tuned models are supported ## Principles 1. **One thing at a time.** Each response advances exactly one decision. 2. **Confirm before proceeding.** Wait for the user to agree before moving on. But don't re-ask questions already answered in the conversation — use what you know. 3. **Don't read files until you need them.** Only read pathway references after the pathway is confirmed. 4. **Use what you know.** If conversation history or artifacts already answer a question, confirm your understanding instead of asking again. ## Workflow ### Step 1: Identify the Training Job You need the training job name or ARN. Check the conversation history first — the user may have already mentioned it, or it may be available from earlier steps in the workflow (e.g., fine-tuning). If not, ask the user. Once you have the training job name or ARN, use the AWS MCP tool to look it up: 1. Use the AWS MCP tool `describe-training-job` and extract: - **S3 output path** (from `ModelArtifacts.S3ModelArtifacts` or `OutputDataConfig.S3OutputPath`) - **IAM role ARN** (from `RoleArn`) - **Region**...

Details

Author
awslabs
Repository
awslabs/agent-plugins
Created
3 months ago
Last Updated
4 days ago
Language
Shell
License
Apache-2.0

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

ml-ops-engineer

Expert MLOps engineering covering model deployment, ML pipelines, model monitoring, feature stores, and infrastructure automation. Use when deploying models to production, building training pipelines, setting up drift detection, configuring feature stores, or automating ML CI/CD workflows.

183 Updated 3 days ago
borghei
DevOps & Infrastructure Solid

agent-skill-deploy

Deploys agent skill collections from any GitHub repository with a /skills folder to one or more distribution surfaces: GitHub releases, Claude Code marketplace, VS Code plugin marketplace, and Copilot CLI plugin marketplace. Handles pre-flight validation, conventional commit analysis, version bumping across surface configs, and surface-specific publishing with dry-run support. Use when releasing, publishing, or deploying a skills collection to any supported marketplace or creating a GitHub release for a skills repository. Don't use for deploying non-skill packages, npm modules, Docker images, or Azure resources.

29 Updated today
webmaxru
AI & Automation Featured

kubernetes

Kubernetes workflow skill. Use this skill when a user needs workload manifests, rollout strategy, service exposure, or cluster operations guidance.

50 Updated 2 days ago
diegosouzapw
DevOps & Infrastructure Solid

openclaw-it-team-deploy

为 openclaw-it-team 仓库生成和维护部署型工作流。用于把当前仓库的多 Agent 配置安装到 OpenClaw、本地化 `openclaw.json`、填充模型与飞书账号配置,并完成启动前验收。

31 Updated 1 months ago
jefferyjob
AI & Automation Featured

bedrock

AWS Bedrock foundation models for generative AI. Use when invoking foundation models, building AI applications, creating embeddings, configuring model access, or implementing RAG patterns.

1,108 Updated today
itsmostafa