npu-optimization

Here are 2 public repositories matching this topic...

Tensorbit-Labs / tensorbit-core

High-performance C++ engine for Second-Order Hessian Pruning. The surgical foundation of the Tensorbit Labs P-D-Q pipeline for ultra-efficient LLM and Vision Transformers edge inference.

sparsity cpp inference-engine model-compression edge-ai llm llm-optimization llm-infrastructure npu-optimization hessian-pruning tensorbit

Updated May 6, 2026
C++

sonjuhy / VisionicAI

Star

VisionicAI: Ultra-lightweight object detection model optimized for NPU. 1인 주도로 아키텍처 설계 및 학습 파이프라인 전 과정 개발.

pytorch object-detection computer-visio reparameterization lightweight-model npu-optimization

Updated Jun 2, 2026
Python

Improve this page

Add a description, image, and links to the npu-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the npu-optimization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly