Conversation
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Trigger by: Pull Request (integration-test label on PR #8)
You can download the full evaluation outputs here. |
|
@openhands-agent Make the integration-runner workflow work also on schedule, a nightly schedule. |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Trigger by: Pull Request (integration-test label on PR #8)
Integration Tests Report (DeepSeek)
Download evaluation outputs (includes both Haiku and DeepSeek results): Download |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Trigger by: Pull Request (integration-test label on PR #8)
Integration Tests Report (DeepSeek)
Download evaluation outputs (includes both Haiku and DeepSeek results): Download |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Trigger by: Pull Request (integration-test label on PR #8)
Integration Tests Report (DeepSeek)
Download evaluation outputs (includes both Haiku and DeepSeek results): Download |
b37602a to
0c22181
Compare
…nhands-fix-issue-5076
* Fix issue OpenHands#5076: Integration test github action * Update integration-runner.yml * Update integration-runner.yml * update variables * use haiku * use base url * fix report name * Fix pr #8: Integration tests (openhands fix issue 5076) * Revert "Fix pr #8: Integration tests (openhands fix issue 5076)" This reverts commit dcd4681. * Fix pr #8: Integration tests (openhands fix issue 5076) * use haiku explicitly, in results too * remove duplicate * Update .github/workflows/integration-runner.yml * Revert "Update .github/workflows/integration-runner.yml" This reverts commit 7e7200e. * funny space * Fix pr #8: Integration tests (openhands fix issue 5076) * artifact fix * clean up remote runtimes * clean up runtimes more aggressively - a bit unexpected though * Fix pr #8: Integration tests (openhands fix issue 5076) * fix type issue that was preventing checking results * try with waiting time * add eval notes * increase timeouts * try with CI local builds * fix eval output * set debug * fix tests! * fix outputs * keep details in logs, not github comment * tweak schedule * lint-y --------- Co-authored-by: openhands <openhands@all-hands.dev>
End-user friendly description of the problem this fixes or functionality that this introduces
Give a summary of what the PR does, explaining any non-trivial design decisions
Link of any specific issues this addresses