NVIDIA Corporation
- 25k followers
- 2788 San Tomas Expressway, Santa Clara, CA, 95051
- https://nvidia.com
Pinned Loading
Repositories
- TensorRT-LLM Public
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
NVIDIA/TensorRT-LLM’s past year of commit activity - aicr Public
Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes
NVIDIA/aicr’s past year of commit activity - OSMO Public
The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML
NVIDIA/OSMO’s past year of commit activity - TransformerEngine Public
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
NVIDIA/TransformerEngine’s past year of commit activity - NeMo-Agent-Toolkit-UI Public
The NVIDIA NeMo Agent Toolkit UI streamlines interacting with NeMo Agent Toolkit workflows in an easy-to-use web application.
NVIDIA/NeMo-Agent-Toolkit-UI’s past year of commit activity - NeMo-Retriever Public
NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
NVIDIA/NeMo-Retriever’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…