Skip to content

Commit

Permalink
fix model update (vllm-project#116)
Browse files Browse the repository at this point in the history
  • Loading branch information
Sara Adkins authored Jul 31, 2024
1 parent 622f721 commit 3976ea7
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion tests/test_quantization/lifecycle/test_apply.py
Original file line number Diff line number Diff line change
Expand Up @@ -110,7 +110,7 @@ def test_apply_quantization_config_tinyllama():
# sanity check correct number of layers targeted
assert num_linears == 154 # 155 Linear layers - 1 that gets ignored
assert num_embeddings == 1
assert num_rotary_embeddings == 22
assert num_rotary_embeddings == 23 # model updated, now has model.rotary_embedding

# test quantization compression
# sample forward pass to fill scales, zps
Expand Down

0 comments on commit 3976ea7

Please sign in to comment.