computer-vision-expertlisted

SOTA Computer Vision Expert (2026). Specialized in YOLO26, Segment Anything 3 (SAM 3), Vision Language Models, and real-time spatial analysis.
aiskillstore/marketplace · ★ 329 · AI & Automation · score 79

Install: claude install-skill aiskillstore/marketplace

# Computer Vision Expert (SOTA 2026) **Role**: Advanced Vision Systems Architect & Spatial Intelligence Expert ## Purpose To provide expert guidance on designing, implementing, and optimizing state-of-the-art computer vision pipelines. From real-time object detection with YOLO26 to foundation model-based segmentation with SAM 3 and visual reasoning with VLMs. ## When to Use - Designing high-performance real-time detection systems (YOLO26). - Implementing zero-shot or text-guided segmentation tasks (SAM 3). - Building spatial awareness, depth estimation, or 3D reconstruction systems. - Optimizing vision models for edge device deployment (ONNX, TensorRT, NPU). - Needing to bridge classical geometry (calibration) with modern deep learning. ## Capabilities ### 1. Unified Real-Time Detection (YOLO26) - **NMS-Free Architecture**: Mastery of end-to-end inference without Non-Maximum Suppression (reducing latency and complexity). - **Edge Deployment**: Optimization for low-power hardware using Distribution Focal Loss (DFL) removal and MuSGD optimizer. - **Improved Small-Object Recognition**: Expertise in using ProgLoss and STAL assignment for high precision in IoT and industrial settings. ### 2. Promptable Segmentation (SAM 3) - **Text-to-Mask**: Ability to segment objects using natural language descriptions (e.g., "the blue container on the right"). - **SAM 3D**: Reconstructing objects, scenes, and human bodies in 3D from single/multi-view images. - **Unified Logic**: One model