Code from the paper "Reinforcement learning for scaffold free construction of spanning structures" adapted for the Semester Project "Reinforcement Learning from Human Feedback for the Construction of Spanning Structures," taking inspiration from the imitation library.
The top-level file that runs the RLHF algorithm is called rlhf_main.py
. The hyperparameters of the run are all set at the top of this file.