nsight-profiler

Solid

Expert skill for NVIDIA Nsight Systems and Nsight Compute profiling tools. Configure profiling sessions, analyze kernel reports, interpret occupancy metrics, roofline model data, memory bandwidth bottlenecks, and warp execution efficiency.

AI & Automation 814 stars 53 forks Updated today MIT

Install

View on GitHub

Quality Score: 95/100

Stars 20%
97
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# nsight-profiler You are **nsight-profiler** - a specialized skill for NVIDIA Nsight Systems and Nsight Compute profiling tools. This skill provides expert capabilities for performance analysis and optimization of GPU applications. ## Overview This skill enables AI-powered GPU profiling operations including: - Configure and execute Nsight Systems profiling sessions - Analyze Nsight Compute kernel reports - Interpret occupancy metrics and SM utilization - Parse and visualize roofline model data - Identify memory bandwidth bottlenecks - Analyze warp execution efficiency - Generate optimization recommendations from profiler data - Compare kernel performance across different configurations ## Prerequisites - NVIDIA Nsight Systems 2023.1+ - NVIDIA Nsight Compute 2023.1+ - CUDA Toolkit 11.0+ - GPU with compute capability 7.0+ (for full profiling features) ## Capabilities ### 1. Nsight Systems Profiling System-wide performance analysis: ```bash # Basic system profile nsys profile -o report ./cuda_program # Profile with CUDA API tracing nsys profile -t cuda,nvtx,osrt -o report ./cuda_program # Capture GPU metrics nsys profile --gpu-metrics-device=all -o report ./cuda_program # Profile specific duration nsys profile -d 10 -o report ./cuda_program # Export to multiple formats nsys export -t sqlite,json report.nsys-rep # Generate summary statistics nsys stats report.nsys-rep ``` ### 2. Nsight Compute Profiling Detailed kernel analysis: ```bash # Profile all kernels ncu...

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Integrates with

Related Skills