Skip to content

Added inference serving with DMA slicing for KV handoff

fe974d0
Select commit
Loading
Failed to load commit list.
Open

KV handoff with DMA slicing APIs to avoid KV input/output copies. #1039

Added inference serving with DMA slicing for KV handoff
fe974d0
Select commit
Loading
Failed to load commit list.
DCO / DCO succeeded Jun 4, 2026 in 0s

DCO

All commits are signed off!