-
Notifications
You must be signed in to change notification settings - Fork 31.1k
Continuous batching refactor #40426
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Continuous batching refactor #40426
Changes from all commits
Commits
Show all changes
27 commits
Select commit
Hold shift + click to select a range
9495e0e
Rework of the CB example
remi-or cce99f0
Further rework of CB example
remi-or 74a0d73
Refactor PA cache, slice on tokens, add debug prints -- WIP
remi-or 79118c5
Slice cache -- WIP
remi-or dc53ad6
Added a mechanism to check batched outputs in CB script
remi-or b107785
Less logging, debug flag for slice, !better reset! -- WIP
remi-or bababa4
QOL and safety margins
remi-or f01e9db
Refactor and style
remi-or 3cffe20
Better saving of cb example
remi-or 7cd70ac
Fix
remi-or 2933099
Fixes and QOL
remi-or bfcf611
Mor einformations about metrics
remi-or f000b17
Further logging
remi-or 042e87d
Style
remi-or 604fe6e
Licenses
remi-or ef63547
Removed some comments
remi-or d403b02
Add a slice input flag
remi-or 023774f
Fix in example
remi-or c327f08
Added back some open-telemetry deps
remi-or 173b497
Removed some aux function
remi-or fff2ee8
Added FA2 option to example script
remi-or 7353aef
Fixed math (all of it)
remi-or 0de06e3
Added a simple example
remi-or 8325b37
Renamed core to classes
remi-or 7dee44e
Made allocation of attention mask optionnal
remi-or 3f17daf
Style
remi-or 463ea91
Merge branch 'main' into conbat
remi-or File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.