Skip to content

Conversation

@AKKamath
Copy link
Contributor

Passes the threadId to the prefix mask call manually. Shouldn't change existing code, but is necessary since POD remaps threadIds and blockIds.

@yzh119 yzh119 merged commit a9935ea into flashinfer-ai:main May 14, 2025
2 checks passed
@Edenzzzz
Copy link
Contributor

Edenzzzz commented May 17, 2025

I will try to add BatchedPrefill for POD in the meantime. It seems mostly about setting up the params and page indices in wrapper.plan and pod_with_kv_cache_tensor

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants