vLLM (Internal Mirror)

This repository is an internal mirror of vLLM, adapted for our company's environment.

Purpose

Internal Customization: Adjust vLLM for our infrastructure, e.g., modify Dockerfile for our cluster.
Experimental Features: Add prototypes and experimental changes not in the official vLLM.

Usage

Use the prebuilt Docker images hosted on GitHub Container Registry:

# Choose the image according to your environment (CUDA or ROCm)
docker pull ghcr.io/MotifTechnologies/vllm:cuda-latest  # for CUDA
docker pull ghcr.io/MotifTechnologies/vllm:rocm-latest  # for ROCm

# Run the container
docker run --rm -it ghcr.io/MotifTechnologies/vllm:<tag>

For a full list of available images and tags, see GHCR.

Examples

Skypilot multinode DP with external load balancing

...
envs:
  PROXY_PORT: 8000
  MY_VLLM_PORT: 8080 # Can't use VLLM_PORT due to conflict with vLLM's internal usage

run: |
  if [ "$SKYPILOT_NODE_RANK" == "0" ]; then
    pkill haproxy
    python /app/vllm/examples/proxy/generate_haproxy_cfg.py \
      --proxy-port $PROXY_PORT \
      --vllm-port $MY_VLLM_PORT \
      --nodes "$SKYPILOT_NODE_IPS" \ # automatically provided by SkyPilot
      --output haproxy.cfg
    haproxy -f haproxy.cfg
  fi
  # Run a vLLM server on each node as a single-node instance
  vllm serve --port $MY_VLLM_PORT ...

Name		Name	Last commit message	Last commit date
Latest commit History 11,879 Commits
.buildkite		.buildkite
.gemini		.gemini
.github		.github
benchmarks		benchmarks
cmake		cmake
csrc		csrc
docker		docker
docs		docs
examples		examples
requirements		requirements
tests		tests
tools		tools
vllm		vllm
.clang-format		.clang-format
.coveragerc		.coveragerc
.dockerignore		.dockerignore
.git-blame-ignore-revs		.git-blame-ignore-revs
.gitignore		.gitignore
.markdownlint.yaml		.markdownlint.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yaml		.readthedocs.yaml
.shellcheckrc		.shellcheckrc
.yapfignore		.yapfignore
CMakeLists.txt		CMakeLists.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DCO		DCO
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
README.original.md		README.original.md
RELEASE.md		RELEASE.md
SECURITY.md		SECURITY.md
codecov.yml		codecov.yml
docker-bake.hcl		docker-bake.hcl
mkdocs.yaml		mkdocs.yaml
pyproject.toml		pyproject.toml
setup.py		setup.py
use_existing_torch.py		use_existing_torch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

vLLM (Internal Mirror)

Purpose

Usage

Examples

Skypilot multinode DP with external load balancing

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

MotifTechnologies/vllm

Folders and files

Latest commit

History

Repository files navigation

vLLM (Internal Mirror)

Purpose

Usage

Examples

Skypilot multinode DP with external load balancing

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages