Kubernetes scanner that discovers LLMs running on vLLM and extracts their deployment and runtime facts.
-
Updated
Jan 29, 2026 - Python
Kubernetes scanner that discovers LLMs running on vLLM and extracts their deployment and runtime facts.
ModelSpec is an open, declarative specification for describing how AI models especially LLMs are deployed, served, and operated in production. It captures execution, serving, and orchestration intent to enable validation, reasoning, and automation across modern AI infrastructure.
Add a description, image, and links to the modelspec topic page so that developers can more easily learn about it.
To associate your repository with the modelspec topic, visit your repo's landing page and select "manage topics."