kvcache: skip multi-turn cache reads in decode-only mode by LouisDDN · Pull Request #284 · mlcommons/storage

LouisDDN · 2026-03-20T15:52:56Z

Skip multi-turn conversation cache reads when running in decode-only mode, since previous turn cache entries are never written in this mode.

This change:

Prevents wasteful cache lookups that always miss
Cleans up multi_turn_cache_misses metrics (no longer polluted)
Improves code correctness by not checking cache that was never written

The multi-turn cache read (Step 2) is now guarded by the same if not self.decode_only check as the prefill write (Step 3), since both operations are meaningless in decode-only mode.

Performance impact: negligible (<0.01%), but improves code clarity.

github-actions · 2026-03-20T15:53:06Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

Skip multi-turn conversation cache reads when running in decode-only mode, since previous turn cache entries are never written in this mode. This change: - Prevents wasteful cache lookups that always miss - Cleans up multi_turn_cache_misses metrics (no longer polluted) - Improves code correctness by not checking cache that was never written The multi-turn cache read (Step 2) is now guarded by the same `if not self.decode_only` check as the prefill write (Step 3), since both operations are meaningless in decode-only mode. Performance impact: negligible (<0.01%), but improves code clarity.

LouisDDN requested a review from a team March 20, 2026 15:52

LouisDDN force-pushed the ld/skip-multiturn-decode-only branch from 984ee49 to f45de66 Compare March 20, 2026 15:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kvcache: skip multi-turn cache reads in decode-only mode#284

kvcache: skip multi-turn cache reads in decode-only mode#284
LouisDDN wants to merge 1 commit intomlcommons:mainfrom
LouisDDN:ld/skip-multiturn-decode-only

LouisDDN commented Mar 20, 2026

Uh oh!

github-actions bot commented Mar 20, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

LouisDDN commented Mar 20, 2026

Uh oh!

github-actions bot commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions bot commented Mar 20, 2026 •

edited

Loading