Enforced kv layer ordering #2312

oandreeva-nv · 2025-08-05T19:29:26Z

Overview:

Besides kv layer order enforcement, here's what I do:

Eliminated redundant operations:

Replaced list(kv_caches.values())[0] with next(iter(kv_caches.values())) - more efficient for getting the first tensor
Removed the unnecessary tensors = list(kv_caches.values()) variable

The original code iterated over kv_caches.values() three times, now it only does it once (in the validation check)

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

closes GitHub issue: #xxx

copy-pr-bot · 2025-08-05T19:29:29Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

oandreeva-nv added 2 commits August 5, 2025 10:32

Added enforced order in kv_cache passed tensors

7ea5bfe

Added enforced order in kv_cache passed tensors

8f4e662

oandreeva-nv requested review from grahamking, nnshah1, piotrm-nvidia, ptarasiewiczNV, ryanolson and tanmayv25 as code owners August 5, 2025 19:29

oandreeva-nv requested review from GuanLuo, PeaBrane, alec-flowers, biswapanda, ishandhanani, jthomson04, kkranen, paulhendricks, rmccorm4, tedzhouhk and tmonty12 as code owners August 5, 2025 19:29

pull-request-size bot added the size/M label Aug 5, 2025

oandreeva-nv requested review from a team as code owners August 5, 2025 19:29

oandreeva-nv removed request for a team, biswapanda, grahamking, kkranen, paulhendricks and tedzhouhk August 5, 2025 19:29

oandreeva-nv removed request for GuanLuo, PeaBrane, alec-flowers, ishandhanani, jthomson04, nnshah1, piotrm-nvidia, ptarasiewiczNV, rmccorm4, tanmayv25 and tmonty12 August 5, 2025 19:29

ryanolson approved these changes Aug 6, 2025

View reviewed changes

ryanolson merged commit d52b42e into ryan/connector-dev Aug 6, 2025
6 of 12 checks passed

ryanolson deleted the oandreeva/kvbm/connector/kv_layer_ordering branch August 6, 2025 05:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enforced kv layer ordering #2312

Enforced kv layer ordering #2312

Uh oh!

oandreeva-nv commented Aug 5, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Aug 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Enforced kv layer ordering #2312

Enforced kv layer ordering #2312

Uh oh!

Conversation

oandreeva-nv commented Aug 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Uh oh!

copy-pr-bot bot commented Aug 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

oandreeva-nv commented Aug 5, 2025 •

edited

Loading