MorphServe

MorphServe is a dynamic, workload-aware serving framework for Large Language Models (LLMs). It enables elastic adaptation of both transformer layer weight precision and KV cache capacity at runtime, allowing systems to maintain high throughput and stable latency under varying workloads.

🚧 Code Release Status

We are currently preparing the codebase for public release. The repository is being cleaned, documented, and organized for reproducibility, with additional examples and usage guidelines under preparation.

👉 The full code will be released soon. Stay tuned!

🔍 Overview

Modern LLM serving systems face significant challenges under dynamic and bursty workloads, where fixed resource allocation strategies often lead to suboptimal performance or degraded quality.

MorphServe addresses this by introducing a joint adaptation mechanism that coordinates:

Layer Weight Precision (via selective quantization)
KV Cache Capacity (via dynamic memory management)

This allows the system to respond effectively to runtime resource pressure without requiring model retraining or offline reconfiguration.

✨ Features

Dynamic adaptation to workload fluctuations
No retraining required
Compatible with existing LLM serving optimizations (e.g., FlashAttention, PagedAttention, dynamic batching)
Designed for real-world deployment scenarios

📄 Paper

More details can be found in our paper:

MorphServe: Efficient and Workload-Aware LLM Serving via Runtime Quantized Layer Swapping and KV Cache Resizing

📌 https://mlsys.org/virtual/2026/oral/3816

📌 Citation

If you find our work useful, please consider citing:

@article{su2025morphserve,
  title={MorphServe: Efficient and Workload-Aware LLM Serving via Runtime Quantized Layer Swapping and KV Cache Resizing},
  author={Su, Zhaoyuan and Zhang, Zeyu and Lan, Tingfeng and Wang, Zirui and Shen, Haiying and Yang, Juncheng and Cheng, Yue},
  journal={arXiv preprint arXiv:2506.02006},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MorphServe

🚧 Code Release Status

🔍 Overview

✨ Features

📄 Paper

📌 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

MorphServe

🚧 Code Release Status

🔍 Overview

✨ Features

📄 Paper

📌 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages