Vector compression with TurboQuant codecs for embeddings, retrieval, and KV-cache. 10x compression, pure NumPy, no GPU required.
retrieval quantization vector-database kv-cache llm-inference embedding-compression turboquant vector-compression qjl semafold
-
Updated
Mar 31, 2026 - Python