Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add batched Llama model definition using vLLM paged attention (#1134)
* Add batched Llama model with vllm paged attention * update core.py * doc * minor * add e2e test * mv file * clean * Check if TVM has been built with USE_VLLM * update BuildArgs docstring
- Loading branch information