[Runtime][KVCache] Initial interface setup for MLA #17616

MasterJH5574 · 2025-01-30T04:02:54Z

This PR introduces the initial KV cache interface setup for multi-head latent attention in DeepSeek models.

Some interface implementations are marked todo for implementation in the soon future.

This PR introduces the initial KV cache interface setup for multi-head latent attention in DeepSeek models. Some interface implementations are marked todo for implementation in the soon future.

MasterJH5574 · 2025-01-30T13:59:44Z

@tvm-bot rerun

This PR introduces the initial KV cache interface setup for multi-head latent attention in DeepSeek models. Some interface implementations are marked todo for implementation in the soon future.

[Runtime][KVCache] Initial interface setup for MLA

1d20279

This PR introduces the initial KV cache interface setup for multi-head latent attention in DeepSeek models. Some interface implementations are marked todo for implementation in the soon future.

jinhongyii approved these changes Jan 31, 2025

View reviewed changes

jinhongyii merged commit 8b4df72 into apache:main Jan 31, 2025
19 checks passed

ysh329 mentioned this pull request Apr 19, 2025

[Release] v0.20.0 Release Candidate Notes #17860

Closed

kurisu6912 mentioned this pull request Sep 5, 2025

kurisu add assume attr patch 1 tile-ai/tvm#8

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Runtime][KVCache] Initial interface setup for MLA #17616

[Runtime][KVCache] Initial interface setup for MLA #17616

Uh oh!

MasterJH5574 commented Jan 30, 2025

Uh oh!

MasterJH5574 commented Jan 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Runtime][KVCache] Initial interface setup for MLA #17616

[Runtime][KVCache] Initial interface setup for MLA #17616

Uh oh!

Conversation

MasterJH5574 commented Jan 30, 2025

Uh oh!

MasterJH5574 commented Jan 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants