-
Peking University
- Beijing
-
19:44
(UTC +08:00)
Highlights
- Pro
Pinned Loading
-
VecAttention
VecAttention Public[CVPR2026] VecAttention: Vector-wise Sparse Attention for Accelerating Long-Context Inference
-
open-compass/VLMEvalKit
open-compass/VLMEvalKit PublicOpen-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
-
Tencent/AngelSlim
Tencent/AngelSlim PublicModel compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.
-
microsoft/MInference
microsoft/MInference Public[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…
-
asterinas
asterinas PublicForked from asterinas/asterinas
Asterinas is a safe, fast, and general-purpose OS kernel, written in Rust and providing Linux-compatible ABI.
Rust
If the problem persists, check the GitHub status page or contact support.

