Commits
Commits on Dec 30, 2022
initial commit of sub-quadratic attention source from https://github.com/AminRezaei0x443/memory-efficient-attention.
committed- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
Revert "move kv_chunk_size_min concern to callsite (1c4f107)" because equivalent fast-path for 1 query chunk, 1 kv chunk is already supported inside
committed- committed
- committed