We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 7c49180 commit 8654736Copy full SHA for 8654736
torchtune/modules/kv_cache.py
@@ -17,7 +17,7 @@ class KVCache(nn.Module):
17
Args:
18
batch_size (int): batch size model will be run with
19
max_seq_len (int): maximum sequence length model will be run with
20
- num_kv_heads (int): number key/value heads.
+ num_kv_heads (int): number of key/value heads.
21
head_dim (int): per-attention head embedding dimension
22
dtype (torch.dtype): dtype for the caches
23
"""
0 commit comments