GitHub - facebookresearch/ReAgent: A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

ReAgent is officially archived and no longer maintained. For latest support on production-ready reinforcement learning open-source library, please refer to Pearl - Production-ready Reinforcement Learning AI Agent Library, by the Applied Reinforcement Learning team @ Meta.

Overview

ReAgent is an open source end-to-end platform for applied reinforcement learning (RL) developed and used at Facebook. ReAgent is built in Python and uses PyTorch for modeling and training and TorchScript for model serving. The platform contains workflows to train popular deep RL algorithms and includes data preprocessing, feature transformation, distributed training, counterfactual policy evaluation, and optimized serving. For more detailed information about ReAgent see the release post here and white paper here.

The platform was once named "Horizon" but we have adopted the name "ReAgent" recently to emphasize its broader scope in decision making and reasoning.

Algorithms Supported

Classic Off-Policy algorithms:

Discrete-Action DQN
Parametric-Action DQN
Double DQN, Dueling DQN, Dueling Double DQN
Distributional RL: C51 and QR-DQN
Twin Delayed DDPG (TD3)
Soft Actor-Critic (SAC)
Critic Regularized Regression (CRR)
Proximal Policy Optimization Algorithms (PPO)

RL for recommender systems:

Counterfactual Evaluation:

Doubly Robust (for bandits)
Doubly Robust (for sequential decisions)
MAGIC

Multi-Arm and Contextual Bandits:

Others:

Installation

ReAgent can be installed via. Docker or manually. Detailed instructions on how to install ReAgent can be found here.

Tutorial

ReAgent is designed for large-scale, distributed recommendation/optimization tasks where we don’t have access to a simulator. In this environment, it is typically better to train offline on batches of data, and release new policies slowly over time. Because the policy updates slowly and in batches, we use off-policy algorithms. To test a new policy without deploying it, we rely on counter-factual policy evaluation (CPE), a set of techniques for estimating a policy based on the actions of another policy.

We also have a set of tools to facilitate applying RL in real-world applications:

Domain Analysis Tool, which analyzes state/action feature importance and identifies whether the problem is a suitable for applying batch RL
Behavior Cloning, which clones from the logging policy to bootstrap the learning policy safely

Detailed instructions on how to use ReAgent can be found here.

License

ReAgent is released under a BSD 3-Clause license. Find out more about it here.

Citing

@article{gauci2018horizon,
  title={Horizon: Facebook's Open Source Applied Reinforcement Learning Platform},
  author={Gauci, Jason and Conti, Edoardo and Liang, Yitao and Virochsiri, Kittipat and Chen, Zhengxing and He, Yuchen and Kaden, Zachary and Narayanan, Vivek and Ye, Xiaohui},
  journal={arXiv preprint arXiv:1811.00260},
  year={2018}
}

Name	Name	Last commit message	Last commit date
Latest commit fried and facebook-github-bot fix deprecated find_module().load_module() Mar 12, 2025 9e707c0 · Mar 12, 2025 History 1,611 Commits
.circleci	.circleci	import torchrec properly	Jul 27, 2022
docs	docs	Fix typo Meta Platform -> Meta Platforms	Aug 12, 2024
logo	logo	update logo	Oct 16, 2019
preprocessing	preprocessing	Upgrade ReAgent to use Python 3.8 (#415 )	Jun 10, 2021
reagent	reagent	Add missing Pyre mode headers] [batch:5/1516] [shard:33/N]	Jan 6, 2025
scripts	scripts	Tune SAC and CRR Models. Initial support for batch gym training (#470 )	May 18, 2021
serving	serving	fix deprecated find_module().load_module()	Mar 12, 2025
.codecov.yml	.codecov.yml	Adding .codecov.yml (#383 )	Jan 29, 2021
.gitignore	.gitignore	Fix CI (#230 )	Apr 14, 2020
.gitmodules	.gitmodules	Rasp bag of fixes	Oct 16, 2019
.isort.cfg	.isort.cfg	Gym post step (#232 )	Apr 21, 2020
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	Adopt Contributor Covenant	Aug 30, 2019
CONTRIBUTING.md	CONTRIBUTING.md	CI instructions	Oct 18, 2019
LICENSE	LICENSE	Codecov (#239 )	Apr 22, 2020
README.md	README.md	Redirect ReAgent users to Pearl	Dec 21, 2023
pyproject.toml	pyproject.toml	Adjusting CircleCI config (#323 )	Sep 30, 2020
rasp_requirements.txt	rasp_requirements.txt	Simplify CI setup (#225 )	Apr 11, 2020
setup.cfg	setup.cfg	update lighting version specification in OSS reagent (#711 )	Feb 1, 2023
setup.py	setup.py	Auto-format	Oct 7, 2020
tox.ini	tox.ini	import torchrec properly	Jul 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReAgent is officially archived and no longer maintained. For latest support on production-ready reinforcement learning open-source library, please refer to Pearl - Production-ready Reinforcement Learning AI Agent Library, by the Applied Reinforcement Learning team @ Meta.

Overview

Algorithms Supported

Installation

Tutorial

License

Citing

About

Releases

Packages

Contributors 84

Languages

License

facebookresearch/ReAgent

Folders and files

Latest commit

History

Repository files navigation

ReAgent is officially archived and no longer maintained. For latest support on production-ready reinforcement learning open-source library, please refer to Pearl - Production-ready Reinforcement Learning AI Agent Library, by the Applied Reinforcement Learning team @ Meta.

Overview

Algorithms Supported

Installation

Tutorial

License

Citing

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 84

Languages

Packages