RBLN Model Zoo

500+ models · Compile once, run anywhere · AI model serving on RBLN NPUs

RBLN Model Zoo

Quick Start

# Install compiler
pip install -i https://pypi.rbln.ai/simple rebel-compiler==0.10.2

# Navigate to model directory
cd huggingface/transformers/text2text-generation/llama/llama3.1-8b

# Install dependencies
pip install -r requirements.txt

# Compile and run
python compile.py && python inference.py

Note

The versions pinned above match what this repo was tested with as of 2026-03-27; a newer stable release may already be available. For current versions and install steps, see the RBLN installation guide.

Tip

For models that support configuration presets, use --model_name <preset> to specify model-specific configurations. See each model's README for available presets.

Important

A RBLN portal account is required to install rebel-compiler from PyPI.

Ecosystems & APIs

Select the ecosystem or API for your AI serving workload on RBLN NPUs.

Python

Ecosystem	# Models	Key packages
Hugging Face	150+	transformers, diffusers
PyTorch	250+	torch
TensorFlow	75+	keras, tensorflow

Other

C API — C/C++ inference bindings. Install via APT, then build from source.

Deployment

vLLM-RBLN

Compile a model from the Model Zoo, then deploy with:

# Compile
python compile.py

# Install vLLM-RBLN
pip3 install \
  --extra-index-url https://download.pytorch.org/whl/cpu \
  --extra-index-url https://wheels.vllm.ai/0.13.0/cpu \
  vllm-rbln==0.10.2

Note

The versions pinned above match what this repo was tested with as of 2026-03-27; a newer stable release may already be available. For current versions and install steps, see the RBLN installation guide.

# Import
from vllm import LLM, SamplingParams

# Load model and generate
llm = LLM(model="Llama-3.1-8B-Instruct")
out = llm.generate(["Hello"], SamplingParams(max_tokens=64))
print(out[0].outputs[0].text)

vLLM-RBLN — LLM serving on RBLN NPUs
Triton — Triton Inference Server
TorchServe — PyTorch model serving

Links

CHANGELOG — Release history
Issues — Report issues, request features, or request new model support.

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
cpp		cpp
huggingface		huggingface
pytorch		pytorch
pytorch_dynamo		pytorch_dynamo
serving		serving
tensorflow		tensorflow
vllm		vllm
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
rbln-model-zoo-banner.png		rbln-model-zoo-banner.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RBLN Model Zoo

Quick Start

Ecosystems & APIs

Python

Other

Deployment

vLLM-RBLN

Links

About

Uh oh!

Releases 23

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RBLN Model Zoo

Quick Start

Ecosystems & APIs

Python

Other

Deployment

vLLM-RBLN

Links

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 23

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages