Skip to content

feat(llama-cpp-localai-paged): paged KV cache llama.cpp backend + cross-request prefix sharing + GB10 decode optimization [WIP]#10462

Open
localai-bot wants to merge 194 commits into
masterfrom
worktree-feat+paged-attention
Open

feat(llama-cpp-localai-paged): paged KV cache llama.cpp backend + cross-request prefix sharing + GB10 decode optimization [WIP]#10462
localai-bot wants to merge 194 commits into
masterfrom
worktree-feat+paged-attention

docs(paged): record MoE-prefill engine-gap decomposition + GEMM-port …

be65438
Select commit
Loading
Failed to load commit list.

Select a check to view from the sidebar