[Core] Update inference/core for vllm-v0.7.2 #503
Triggered via pull request
February 13, 2025 14:21
Status
Failure
Total duration
16h 29m 49s
Artifacts
–
all-tests.yml
on: pull_request
flagscale-report-clean
/
clean-report
0s
megatron-report-clean
/
clean-report
26s
Matrix: flagscale-unit-tests
Waiting for pending jobs
Matrix: megatron-unit-tests
Matrix: functional-tests-train
Waiting for pending jobs
Matrix: functional-tests-hetero
Waiting for pending jobs
Matrix: functional-tests-serve
Waiting for pending jobs
flagscale-coverage-test
/
test-coverage
megatron-coverage-test
/
test-coverage
all-tests
0s
Annotations
23 errors and 1 warning
flagscale-report-clean / clean-report
The self-hosted runner: p-phy-dgx-a100-node-prod-038 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
|
megatron-data / unit-test
ambiguous argument 'HEAD': unknown revision or path not in the working tree.
|
megatron-data / unit-test
RPC failed; curl 92 HTTP/2 stream 0 was not closed cleanly: CANCEL (err 8)
|
megatron-data / unit-test
5 bytes of body are still expected
|
megatron-data / unit-test
early EOF
|
megatron-data / unit-test
fetch-pack: invalid index-pack output
|
megatron-data / unit-test
RPC failed; curl 92 HTTP/2 stream 0 was not closed cleanly: CANCEL (err 8)
|
megatron-data / unit-test
6333 bytes of body are still expected
|
megatron-data / unit-test
early EOF
|
megatron-data / unit-test
fetch-pack: invalid index-pack output
|
megatron-data / unit-test
RPC failed; curl 92 HTTP/2 stream 0 was not closed cleanly: CANCEL (err 8)
|
megatron-export / unit-test
FailFast: cancelling since parallel instance has failed
|
megatron-dist_checkpointing / unit-test
FailFast: cancelling since parallel instance has failed
|
megatron-distributed / unit-test
FailFast: cancelling since parallel instance has failed
|
megatron-fusions / unit-test
FailFast: cancelling since parallel instance has failed
|
megatron-pipeline_parallel / unit-test
FailFast: cancelling since parallel instance has failed
|
megatron-ssm / unit-test
FailFast: cancelling since parallel instance has failed
|
megatron-models / unit-test
FailFast: cancelling since parallel instance has failed
|
megatron-tensor_parallel / unit-test
FailFast: cancelling since parallel instance has failed
|
megatron-transformer / unit-test
FailFast: cancelling since parallel instance has failed
|
megatron-transformer/moe / unit-test
FailFast: cancelling since parallel instance has failed
|
megatron-root / unit-test
FailFast: cancelling since parallel instance has failed
|
megatron-root / unit-test
The operation was canceled.
|
megatron-data / unit-test
Unable to clean or reset the repository. The repository will be recreated instead.
|