Skip to content

Conversation

@bringlein
Copy link
Collaborator

No description provided.

Signed-off-by: Burkhard Ringlein <ngl@zurich.ibm.com>
Signed-off-by: Burkhard Ringlein <ngl@zurich.ibm.com>
Signed-off-by: Burkhard Ringlein <ngl@zurich.ibm.com>
Signed-off-by: Burkhard Ringlein <ngl@zurich.ibm.com>
Signed-off-by: Burkhard Ringlein <ngl@zurich.ibm.com>
@bringlein
Copy link
Collaborator Author

python3 /scripts/benchmark.py prefix -x /scripts/setups/prefix_correctnes.conf
python3 /scripts/benchmark.py prefix -x /scripts/setups/prefix_correctnes_rocm.conf

passes on H100/MI300

Signed-off-by: Burkhard Ringlein <ngl@zurich.ibm.com>
Signed-off-by: Burkhard Ringlein <ngl@zurich.ibm.com>
Signed-off-by: Burkhard Ringlein <ngl@zurich.ibm.com>
Signed-off-by: Burkhard Ringlein <ngl@zurich.ibm.com>
@bringlein
Copy link
Collaborator Author

inside the docker container, just

vllm serve meta-llama/Llama-3.1-8B-Instruct

will use vLLM V1 with the vllm-triton-backend plugin and jitcache etc. from triton-dejavu

@bringlein bringlein changed the title Load test setup from config files, fix correctness Load test setup from config files, fix correctness, enable plugin in V1 Jun 26, 2025
@bringlein bringlein requested review from jvlunteren and tdoublep June 26, 2025 13:27
Signed-off-by: Burkhard Ringlein <ngl@zurich.ibm.com>
Copy link
Collaborator

@jvlunteren jvlunteren left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@bringlein bringlein merged commit 3422877 into main Jun 27, 2025
1 check passed
@bringlein bringlein deleted the ngl_updates_06-2025_2 branch June 27, 2025 09:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants