Skip to content
@d-matrix-ai

d-Matrix

d-Matrix.ai

Transforming AI from unsustainable to attainable. d-Matrix powers next generation compute for generative AI inference.

Pinned Loading

  1. dmx-compressor dmx-compressor Public

    d-Matrix DMX Compressor: A Pytorch toolkit for nn.Module transformations supporting advanced quantization, sparsity, and elementwise function approximations.

    Python 21 3

  2. keyformer-llm keyformer-llm Public

    Keyformer proposes KV Cache reduction through key tokens identification and without the need for fine-tuning

    Python 57 5

  3. comet comet Public

    COMET is a framework for modeling and optimizing dataflow for compound operations on machine learning accelerators

    C++ 2

  4. rlquant rlquant Public

    Reinforcement Learning algorithms (GRPO, drGRPO, etc) under quantization (QAT, PTQ)

    Python 1 1

  5. pt2-bfp pt2-bfp Public

    Implementation of Block Floating Point supporting the Pytorch 2.0 export quantization route.

    Python

  6. bigcode-evaluation-harness bigcode-evaluation-harness Public

    Forked from bigcode-project/bigcode-evaluation-harness

    A framework for the evaluation of autoregressive code generation language models.

    Python

Repositories

Showing 10 of 10 repositories
  • dmx-compressor Public

    d-Matrix DMX Compressor: A Pytorch toolkit for nn.Module transformations supporting advanced quantization, sparsity, and elementwise function approximations.

    d-matrix-ai/dmx-compressor’s past year of commit activity
    Python 21 Apache-2.0 3 0 0 Updated Mar 5, 2026
  • .github Public
    d-matrix-ai/.github’s past year of commit activity
    0 0 0 0 Updated Jan 10, 2026
  • comet Public

    COMET is a framework for modeling and optimizing dataflow for compound operations on machine learning accelerators

    d-matrix-ai/comet’s past year of commit activity
    C++ 0 Apache-2.0 2 0 0 Updated Dec 20, 2025
  • rlquant Public

    Reinforcement Learning algorithms (GRPO, drGRPO, etc) under quantization (QAT, PTQ)

    d-matrix-ai/rlquant’s past year of commit activity
    Python 1 1 0 0 Updated Sep 15, 2025
  • lm-evaluation-harness-nikita Public Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    d-matrix-ai/lm-evaluation-harness-nikita’s past year of commit activity
    Python 0 MIT 3,184 0 0 Updated Nov 15, 2024
  • pt2-bfp Public

    Implementation of Block Floating Point supporting the Pytorch 2.0 export quantization route.

    d-matrix-ai/pt2-bfp’s past year of commit activity
    Python 0 Apache-2.0 0 1 0 Updated Aug 7, 2024
  • bigcode-evaluation-harness Public Forked from bigcode-project/bigcode-evaluation-harness

    A framework for the evaluation of autoregressive code generation language models.

    d-matrix-ai/bigcode-evaluation-harness’s past year of commit activity
    Python 0 Apache-2.0 260 0 0 Updated Jun 19, 2024
  • keyformer-llm Public

    Keyformer proposes KV Cache reduction through key tokens identification and without the need for fine-tuning

    d-matrix-ai/keyformer-llm’s past year of commit activity
    Python 57 Apache-2.0 5 4 0 Updated Mar 26, 2024
  • gptq Public Forked from IST-DASLab/gptq

    Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers"

    d-matrix-ai/gptq’s past year of commit activity
    Python 0 201 0 0 Updated Feb 5, 2024
  • mlperf-hf Public Forked from guschmue/mlperf-hf

    Benchmark huggingface models the mlperf way

    d-matrix-ai/mlperf-hf’s past year of commit activity
    Python 0 1 0 0 Updated Dec 20, 2022

Top languages

Python C++

Most used topics

Loading…