Hello,
I started working with the provided implementation recently, thank you for sharing it. Just wanted to know why we require access to env for offline training . Do we need to make changes in code for it, if we have expert data but no access to env for training ?