Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
llama : custom attention mask + parallel decoding + no context swaps #3228
llama : custom attention mask + parallel decoding + no context swaps #3228
Changes from all commits
c5df72e
3b4bab6
1fb033f
fad5693
d29e769
58bb511
9f42e75
6952a46
4d76d76
f015b26
86c90e3
0cbf3bf
7c1bdd0
1f17ea6
0161372
466b513
897cacc
fa0e677
daf4c6d
7e2b997
25bd254
467e307
36714e1
ddad227
806d397
16090a5
d37081a
82e20e9
4b5f3cd
8a9aca3
eed3fd4
6028879
7b7472e
e1067ef
a1327c7
addae65
b377bf2
db0fc2d
e04dc51
5420696
2f3a46f
1be2b8c
ee1d670
ded9b43
b2debf6
5a3369d
8845160
c1596f6
2585690
4ad0676
e946379
4c72ab1
d008733
a207561
2b8830a
ce2d995
c5650ed
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing