jiayus-nvidia

Follow

jiayus-nvidia

Follow

3 followers · 0 following

Achievements

Achievements

Popular repositories Loading

flash-attention flash-attention Public

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 3 3
FBGEMM FBGEMM Public

Forked from pytorch/FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 2 4
atten atten Public

Forked from NVIDIA-JerryChen/atten

Python
TensorRT-LLM TensorRT-LLM Public

Forked from NVIDIA/TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python
cudnn-frontend cudnn-frontend Public

Forked from NVIDIA/cudnn-frontend

cuDNN Frontend is NVIDIA's modern, open-source entry point to the cuDNN library and a growing collection of high-performance open-source kernels.

Python