feat: add integration tests and cron job workflow#219
Conversation
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
@OpenHands please fix the failing actions on PR #219 at branch |
|
I'm on it! simonrosenberg can track my progress at all-hands.dev |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
Trigger by: Pull Request (integration-test label on PR #219)
|
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
Trigger by: Pull Request (integration-test label on PR #219)
|
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Trigger by: Pull Request (integration-test label on PR #219) Integration Tests Report - 6ba6805_sonnet_runSuccess rate: 100.00% (1/1) Total cost: USD 0.00 Test Results
Integration Tests Report (DeepSeek) Integration Tests Report - 6ba6805_deepseek_runSuccess rate: 0.00% (0/1) Total cost: USD 0.00 Test Results
Download testing outputs (includes both Claude Sonnet 4 and DeepSeek results): Download |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Trigger by: Pull Request (integration-test label on PR #219) Integration Tests Report - ce66b45_sonnet_runSuccess rate: 100.00% (1/1) Total cost: USD 0.00 Test Results
Integration Tests Report (DeepSeek) Integration Tests Report - ce66b45_deepseek_runSuccess rate: 0.00% (0/1) Total cost: USD 0.00 Test Results
Download testing outputs (includes both Claude Sonnet 4 and DeepSeek results): Download |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
2 similar comments
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
Integration Tests ReportTrigger: Pull Request (integration-test label on PR #219) Test Results Summary
Detailed ResultsClaude Sonnet 4GPT-5 MiniDeepSeek ChatOverall Status: 3 models tested |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
2 similar comments
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
- Removed sys.path.insert() from run_infer.py - Both scripts now use clean global imports without path manipulation - Maintained clean import structure with format_cost from separate module - All imports work correctly with PYTHONPATH environment variable Co-authored-by: openhands <openhands@all-hands.dev>
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
2 similar comments
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Looks like there are a few issues preventing this PR from being merged!
If you'd like me to help, just leave a comment, like Feel free to include any additional details that might help me get this PR into a better state. You can manage your notification settings |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
2 similar comments
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
|
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
Integration Tests ReportTrigger: Pull Request (integration-test label on PR #219) Test Results Summary
Detailed ResultsGPT-5 MiniDeepSeek ChatClaude Sonnet 4Overall Status: 3 models tested |
No description provided.