Skip to content

Actions: ServiceNow/BrowserGym

Build and Publish

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
738 workflow runs
738 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

paper-specific action subsets
Build and Publish #638: Commit a164f12 pushed by gasse
October 23, 2024 16:51 37s benchmark_action_sets
October 23, 2024 16:51 37s
renaming
Build and Publish #637: Commit b750132 pushed by gasse
October 23, 2024 15:42 37s workarena-downstream-tests
October 23, 2024 15:42 37s
README update (#200)
Build and Publish #636: Commit 801fc40 pushed by gasse
October 23, 2024 14:52 35s main
October 23, 2024 14:52 35s
README update
Build and Publish #635: Commit 9c532b2 pushed by gasse
October 23, 2024 14:51 34s gasse/patch_52
October 23, 2024 14:51 34s
version bump 0.10.0
Build and Publish #634: Commit e430bb8 pushed by gasse
October 23, 2024 14:48 2m 7s v0.10.0
October 23, 2024 14:48 2m 7s
version bump 0.10.0
Build and Publish #633: Commit e430bb8 pushed by gasse
October 23, 2024 14:48 37s main
October 23, 2024 14:48 37s
Benchmark updates (#199)
Build and Publish #632: Commit fd1792c pushed by gasse
October 23, 2024 14:46 32s main
October 23, 2024 14:46 32s
fixes
Build and Publish #631: Commit 527a45d pushed by gasse
October 23, 2024 14:42 40s workarena_action_set
October 23, 2024 14:42 40s
becnhmark code split refactor
Build and Publish #630: Commit fe219bb pushed by gasse
October 23, 2024 14:38 37s workarena_action_set
October 23, 2024 14:38 37s
Merge branch 'main' into workarena_action_set
Build and Publish #629: Commit 73cdb19 pushed by ThibaultLSDC
October 23, 2024 14:27 34s workarena_action_set
October 23, 2024 14:27 34s
Fixing logging with multiple jobs (#182)
Build and Publish #628: Commit c7f77ba pushed by gasse
October 23, 2024 14:24 39s main
October 23, 2024 14:24 39s
Default browsergym_split metadata for every benchmark (#190)
Build and Publish #627: Commit f25bdcd pushed by gasse
October 23, 2024 14:04 36s main
October 23, 2024 14:04 36s
updating the workarena L2/L3 action set
Build and Publish #626: Commit 7216796 pushed by ThibaultLSDC
October 23, 2024 13:55 36s workarena_action_set
October 23, 2024 13:55 36s
Added splits for workarena-l1
Build and Publish #625: Commit 11fd62e pushed by Megh-Thakkar
October 22, 2024 23:34 34s benchmarks
October 22, 2024 23:34 34s
eval -> valid split
Build and Publish #624: Commit 6d0391a pushed by gasse
October 22, 2024 21:22 39s benchmarks
October 22, 2024 21:22 39s
miniwob splits
Build and Publish #623: Commit 82a5c3a pushed by gasse
October 22, 2024 20:54 37s benchmarks
October 22, 2024 20:54 37s
added train/test splits for wa/vwa (#192)
Build and Publish #622: Commit 6a1644e pushed by gasse
October 22, 2024 13:45 34s benchmarks
October 22, 2024 13:45 34s
New benchmark AssistantBench (#186)
Build and Publish #621: Commit 78222aa pushed by gasse
October 22, 2024 13:44 39s main
October 22, 2024 13:44 39s
added train/test splits for wa/vwa (#192)
Build and Publish #620: Commit 95a0934 pushed by gasse
October 21, 2024 18:59 29s benchmarks
October 21, 2024 18:59 29s
changes to col names
Build and Publish #619: Commit a91d321 pushed by gasse
October 21, 2024 18:59 38s newsplits
October 21, 2024 18:59 38s
Benchmarks update (#197)
Build and Publish #618: Commit 994ce59 pushed by gasse
October 21, 2024 16:31 32s main
October 21, 2024 16:31 32s
Reverting workarena_l1 benchmark to original seed sampling (#198)
Build and Publish #617: Commit 32796ca pushed by gasse
October 21, 2024 16:30 36s main
October 21, 2024 16:30 36s
test fix
Build and Publish #616: Commit 99d0f11 pushed by gasse
October 21, 2024 16:28 32s fix_miniwob
October 21, 2024 16:28 32s
less benchmark variants
Build and Publish #615: Commit 200aeff pushed by gasse
October 21, 2024 16:25 36s fix_miniwob
October 21, 2024 16:25 36s
updating assertions
Build and Publish #614: Commit 4d2e2b7 pushed by ThibaultLSDC
October 21, 2024 15:00 42s workarena_l1_sampling
October 21, 2024 15:00 42s