feat: Mixed Experience Replay 🤝 #30

callumtilbury · 2024-07-17T13:48:21Z

A simple utility to mix sampling of multiple buffers. Useful for offline-online stuff, and some off-policy variants that include portions of on-policy data (e.g. "combined experience replay," see here).

Important (& intentional) restrictions:

The buffers must be of the same type. Why? We need to concatenate the samples, thus they need the same pytree structure. FlatBuffer returns an ExperiencePair, and this cannot be combined with a PrioritisedTrajectoryBufferSample, etc.
We can ask for any ratio, [x,y,z,...], with a joint sample_batch_size when creating the mixer. We are still constrained by the underlying buffer sample functions, though. Suppose we have buffer_a which returns (4, ...) and buffer_b which returns (16, ...). We could create a mixer [1,1] with the sample_batch_size=6. In that case, we get 3 "batches" from buffer_a, and 3 batches from buffer_b. But if we ask for sample_batch_size=10 with ratio [1,1] (i.e. a batch size of 10/2 = 5 from each buffer), we'll only get the 4 batches from buffer_a, along with the 5 batches from buffer_b—so, a total sample_batch_size = 9. This is the idea of a "best effort"—we'll try grab enough batches of data, but only if possible. If not, we return a smaller batch than desired—as much as we can from each buffer.

It'd be great to test this out in a real system. Perhaps in Stoix, @EdanToledo? I can also look at stitching vaults together, etc.

See example notebook: https://colab.research.google.com/github/instadeepai/flashbax/blob/feat/mixed_experience_replay/examples/mixer_demonstration.ipynb (obviously won't run, unless you pip install the branch version, or run locally on a branch)

flashbax/buffers/mixer.py

eleninisioti · 2024-09-01T14:08:27Z

Hello, I wanted to say that this is a great functionality. In my case I am interested in distributed RL with experience sharing among agents, so having a buffer where you can sample/add differently for personal use and sharing would be great. Is this a feature you are planning to merge soon? And if so, could you extend the notebook to show how you can add to the two buffers?

EdanToledo · 2024-09-01T14:19:17Z

@eleninisioti Hey, so yeah ideally we can merge this ASAP. I'm just waiting on one last thing from @callumtilbury but hes quite busy at the moment. If hes unable to complete it in the coming week, I'll take over and finish it. So hopefully sometime this week it will be merged :)

EdanToledo

looks good to me

EdanToledo and others added 7 commits July 9, 2024 16:50

feat: initial start to mixed replay buffers

0a84071

feat: more work on mixer util.

00b1668

feat: a working mixer, with comments and docs

890f480

feat: tests for mixer.

c6eee8f

chore: typing in mixer tests; fix List type in mixer

7874f38

feat: simple demo notebook.

fc42692

feat: add some comments to demo notebook.

d34b816

callumtilbury requested a review from EdanToledo July 17, 2024 13:55

callumtilbury added the enhancement New feature or request label Jul 17, 2024

EdanToledo reviewed Jul 18, 2024

View reviewed changes

flashbax/buffers/mixer.py Show resolved Hide resolved

EdanToledo and others added 2 commits August 27, 2024 10:25

Merge branch 'main' into feat/mixed_experience_replay

e16bdab

Merge branch 'main' into feat/mixed_experience_replay

3fddc3b

EdanToledo and others added 2 commits September 20, 2024 11:33

Merge branch 'main' into feat/mixed_experience_replay

57f77ca

fix: issue with can_sample not being jittable

989976c

EdanToledo enabled auto-merge September 20, 2024 13:19

EdanToledo approved these changes Sep 20, 2024

View reviewed changes

EdanToledo merged commit e0199d7 into main Sep 20, 2024
3 checks passed

EdanToledo deleted the feat/mixed_experience_replay branch September 20, 2024 13:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Mixed Experience Replay 🤝 #30

feat: Mixed Experience Replay 🤝 #30

callumtilbury commented Jul 17, 2024 •

edited

Loading

eleninisioti commented Sep 1, 2024

EdanToledo commented Sep 1, 2024

EdanToledo left a comment

feat: Mixed Experience Replay 🤝 #30

feat: Mixed Experience Replay 🤝 #30

Conversation

callumtilbury commented Jul 17, 2024 • edited Loading

eleninisioti commented Sep 1, 2024

EdanToledo commented Sep 1, 2024

EdanToledo left a comment

Choose a reason for hiding this comment

callumtilbury commented Jul 17, 2024 •

edited

Loading