edge-deployment-skill

Solid

ML model optimization and deployment on robot edge devices (Jetson, embedded)

AI & Automation 814 stars 53 forks Updated today MIT

Install

View on GitHub

Quality Score: 96/100

Stars 20%
97
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
58
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Edge Deployment Skill ## Overview Expert skill for optimizing and deploying machine learning models on robot edge devices including NVIDIA Jetson and embedded systems. ## Capabilities - Configure TensorRT optimization for NVIDIA Jetson - Set up ONNX model conversion and optimization - Implement INT8 and FP16 quantization - Configure DeepStream for video analytics - Set up CUDA graph optimization - Implement model pruning and distillation - Configure DLA (Deep Learning Accelerator) deployment - Set up multi-stream inference - Implement ROS2 inference nodes - Profile and benchmark on target hardware ## Target Processes - nn-model-optimization.js - object-detection-pipeline.js - rl-robot-control.js - field-testing-validation.js ## Dependencies - TensorRT - ONNX Runtime - NVIDIA Jetson SDK - DeepStream ## Usage Context This skill is invoked when processes require deploying ML models on edge devices with optimized inference performance. ## Output Artifacts - TensorRT engine files - ONNX optimized models - Quantization configurations - DeepStream pipeline configs - Inference benchmark reports - ROS2 inference node implementations

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Related Skills