Skip to content

v0.11.1: Benchmark update

Compare
Choose a tag to compare
@github-actions github-actions released this 30 Oct 19:29
· 16 commits to main since this release

New features

  • Set max steps to 30 in webarena / visualwenarena benchmarks #214
  • Benchmark dependency graph utilities #220
  • Include nltk.download() in prepare_backend() for webarena / visualwebarena benchmarks #224

Bugfixes

  • Rename benchmark after subset_from_split() #221
  • ExpArgs.exp_dir sanitization #222
  • get_step_info() bugfix #223

Full Changelog: v0.11.0...v0.11.1