v0.11.1: Benchmark update
New features
- Set max steps to 30 in webarena / visualwenarena benchmarks #214
- Benchmark dependency graph utilities #220
- Include nltk.download() in prepare_backend() for webarena / visualwebarena benchmarks #224
Bugfixes
- Rename benchmark after subset_from_split() #221
- ExpArgs.exp_dir sanitization #222
- get_step_info() bugfix #223
Full Changelog: v0.11.0...v0.11.1