-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RLlib] Initial design for Ray-Data based offline RL Algos (on new API stack). #44969
Merged
sven1977
merged 74 commits into
ray-project:master
from
simonsays1980:offline-data-new-stack
Jul 22, 2024
+5,433
−196
Merged
Changes from all commits
Commits
Show all changes
74 commits
Select commit
Hold shift + click to select a range
3f13747
Sketched a simple offline data class with usage in 'Algorithm'.
simonsays1980 f33e595
LINTER.
simonsays1980 1496723
Merged master
simonsays1980 801911d
Implemented test for 'OfflineData' class and stored intermediate work.
simonsays1980 ff46fa2
LINTER.
simonsays1980 caa48d0
Added a basic workflow to convert batches into list of episodes for L…
simonsays1980 c748df8
Changed comment.
simonsays1980 6409007
Merge branch 'master' of https://github.com/ray-project/ray
simonsays1980 d2f9030
Merge branch 'master' of https://github.com/ray-project/ray
simonsays1980 a3416a8
Merge branch 'master' of https://github.com/ray-project/ray
simonsays1980 8582ad9
Merge branch 'master' of https://github.com/ray-project/ray
simonsays1980 b565f34
Merge branch 'master' of https://github.com/ray-project/ray
simonsays1980 c0eed1f
Merge branch 'master' of https://github.com/ray-project/ray
simonsays1980 341cb95
Merge branch 'master' of https://github.com/ray-project/ray
simonsays1980 b76807f
Merge branch 'master' of https://github.com/ray-project/ray
simonsays1980 c84fab8
Merged master.
simonsays1980 85cf954
Initial commit for BC with offline data API in new stack.
simonsays1980 69157f5
Implemented BC in new API stack with Ray Data API and Learner API usi…
simonsays1980 af9c9e9
Merge branch 'master' of https://github.com/ray-project/ray
simonsays1980 780b49d
Merge branch 'master' into offline-data-new-stack
simonsays1980 7337d08
Added new test to BUILD file.
simonsays1980 da83264
Added new test for offline data to BUILD file.
simonsays1980 82ae5bd
Added functionality to map batches directly to episode lists in an it…
simonsays1980 06ed2e5
Fixed bug in test.
simonsays1980 bafdcba
Added @sven1977's review.
simonsays1980 72cd797
Added locality hints for distributed training.
simonsays1980 848a205
Multi-learner initialization.
simonsays1980 d39ef0a
LINTER.
simonsays1980 4952de7
Tryout with callable class in 'map_batches'.
simonsays1980 d4479ff
Merged master.
simonsays1980 372a107
Added resampled JSONL CartPole-v1 dataset from cartpole-small.json wi…
simonsays1980 5ffa94b
Modified episode conversion to work with new data format for offlien …
simonsays1980 dcf9524
Added 'batch_size' to the 'map_batches' and modified '_map_to_episode…
simonsays1980 04ce0f0
Added large CartPole-v1 data in new format. Modified BC algorithm, fi…
simonsays1980 72088fa
Merge branch 'master' into offline-data-new-stack
simonsays1980 40e9b35
Started multi-learner setup.
simonsays1980 89b06fe
Added tuned example for BC with new offline API.
simonsays1980 0da5db1
Merge branch 'master' into offline-data-new-stack
simonsays1980 8d5f1bd
Set up multi-learner training and tested it.
simonsays1980 5128bdb
Merged master.
simonsays1980 5d9dd97
Set default to 'parquet' files. Tested for different learner setups a…
simonsays1980 19d99a5
Added parquet files for cartpole and pendulum data. Also reset the de…
simonsays1980 c437210
Merge branch 'master' into offline-data-new-stack
simonsays1980 f1a7663
Added @sven1977's review. Also added 'override_num_blocks' to tuned e…
simonsays1980 dea037b
Disabled hybrid stack. Tested old stack and made some cleanups.
simonsays1980 356222a
Much refactoring and fixing smaller and larger bugs related to transf…
simonsays1980 fb3eea0
Merge branch 'master' into offline-data-new-stack
simonsays1980 8b16489
Readded the test for BC in old stack.
simonsays1980 68fabec
Fixed a small bug, due to the fact that 'Algorithm' objects do not ha…
simonsays1980 980ebcc
Reset concurrency. This was a relict from testing.
simonsays1980 8d45a78
Fixed some minor bugs that let tests failing.
simonsays1980 38b43cf
Merge branch 'master' into offline-data-new-stack
simonsays1980 b7b0a34
Another small bug fix due to the hybrid stack.
simonsays1980 9870e5f
Set training step such that old and hybrid stacks are training on MAR…
simonsays1980 56e5de5
Refactored hybrid and new stack training logic into two separate meth…
simonsays1980 34fa38e
Some small nits.
simonsays1980 1a81cd9
More small nits fixed in test file for OfflineData.
simonsays1980 6b4939f
Merged master.
simonsays1980 a3fe5fb
Added changes to BC to enable multi-learner.
simonsays1980 88786b6
Fixed a bug in offline data tests and refactored. In addition changed…
simonsays1980 01a4f54
Fixed a bug in BC with policies being a set not a list.
simonsays1980 e8ade96
Fixed data path in BUILD.
simonsays1980 2b3af33
Fixed a small bug in 'LearnerGroup' due to mistyped arguments.
simonsays1980 b226c2e
Small modification of elarning rate in multi-agent SAC test.
simonsays1980 b406295
Merge branch 'master' into offline-data-new-stack
simonsays1980 6c855ad
Merged master.
simonsays1980 7b928d2
Merge branch 'master' into offline-data-new-stack
simonsays1980 6582320
Added a further check to deal in Offline Data setups with 'PolicyServ…
simonsays1980 30c3f8d
Readded '_set_optimizer_state' after erroneously removing it.
simonsays1980 59cf300
Merge branch 'master' into offline-data-new-stack
simonsays1980 a1cf4f1
Merged master and modified 'OfflineData' to integrate the newest chan…
simonsays1980 be43bcd
Saving state.
simonsays1980 30f4170
Fixed bug with synching the weights between learner and local worker …
simonsays1980 7591410
Merge branch 'master' into offline-data-new-stack
simonsays1980 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nice! Old stack is no longer the norm :D