High-performance C++ engine for Second-Order Hessian Pruning. The surgical foundation of the Tensorbit Labs P-D-Q pipeline for ultra-efficient LLM and Vision Transformers edge inference.
-
Updated
May 6, 2026 - C++
High-performance C++ engine for Second-Order Hessian Pruning. The surgical foundation of the Tensorbit Labs P-D-Q pipeline for ultra-efficient LLM and Vision Transformers edge inference.
VisionicAI: Ultra-lightweight object detection model optimized for NPU. 1인 주도로 아키텍처 설계 및 학습 파이프라인 전 과정 개발.
Add a description, image, and links to the npu-optimization topic page so that developers can more easily learn about it.
To associate your repository with the npu-optimization topic, visit your repo's landing page and select "manage topics."