A lightweight, config-driven framework for unified vision-language model inference across local and cloud backends.
transformers vlm huggingface inference-framework vision-language-model vllm ollama multimodal-ai mlx-vlm multimodal-inference
-
Updated
Apr 2, 2026 - Python