You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
SUMMARY:
Added e2e testing for block quantization.
TEST PLAN:
Tested locally with the following command:
```
python -m pytest tests/e2e/vLLM/test_vllm.py -vv -s
```
log:
```
================= vLLM GENERATION =================
PROMPT:
The capital of France is
GENERATED TEXT:
Paris, which is located in the Île-de-France region. The
PROMPT:
The president of the US is
GENERATED TEXT:
paying for the protests against him. The White House has reportedly cut
PROMPT:
My name is
GENERATED TEXT:
[insert name], and I am a [insert job title]. I am excited
PASSED
===================================================================================================================== 1 passed in 130.10s (0:02:10) =====================================================================================================================
```
---------
Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>
0 commit comments