Skip to content

Actions: intelligent-machine-learning/dlrover

Actions

CI

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,642 workflow runs
2,642 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Skip memory limitation for gpu type node relaunch operation. (#1341)
CI #4231: Commit d2ea4a7 pushed by BalaBalaYi
November 18, 2024 06:51 12m 28s master
November 18, 2024 06:51 12m 28s
Expose ckpt events
CI #4230: Pull request #1321 synchronize by samplise
November 18, 2024 06:18 12m 4s samplise:expose-ckpt-events
November 18, 2024 06:18 12m 4s
Expose ckpt events
CI #4229: Pull request #1321 synchronize by samplise
November 18, 2024 05:36 12m 30s samplise:expose-ckpt-events
November 18, 2024 05:36 12m 30s
Expose ckpt events
CI #4228: Pull request #1321 synchronize by samplise
November 18, 2024 04:53 12m 29s samplise:expose-ckpt-events
November 18, 2024 04:53 12m 29s
update pip to pip3 (#1336)
CI #4227: Commit 4dc55c5 pushed by samplise
November 18, 2024 04:09 12m 6s master
November 18, 2024 04:09 12m 6s
fix a bug in infer method
CI #4224: Pull request #1340 opened by jlsong01
November 16, 2024 16:14 12m 24s jlsong01:bugfix_for_InferenceChain
November 16, 2024 16:14 12m 24s
Fix the issue that len(indices) and num_samples might not be equal
CI #4223: Pull request #1339 opened by sunjq1
November 15, 2024 12:47 Action required sunjq1:fix_sampler
November 15, 2024 12:47 Action required
Update build_proto.sh to use pip3 instead of pip
CI #4222: Pull request #1336 synchronize by jinqinn
November 15, 2024 06:07 12m 24s jinqinn:update-pip-to-pip3
November 15, 2024 06:07 12m 24s
WIP: handle GPU lost in resource monitor
CI #4220: Pull request #1335 synchronize by samplise
November 15, 2024 03:18 12m 41s samplise:report-gpu-lost
November 15, 2024 03:18 12m 41s
WIP: handle GPU lost in resource monitor
CI #4219: Pull request #1335 synchronize by samplise
November 15, 2024 02:58 12m 37s samplise:report-gpu-lost
November 15, 2024 02:58 12m 37s
WIP: handle GPU lost in resource monitor
CI #4218: Pull request #1335 synchronize by samplise
November 15, 2024 01:15 2m 37s samplise:report-gpu-lost
November 15, 2024 01:15 2m 37s
Fix diagnosis agent action consuming (#1334)
CI #4217: Commit ec94ab6 pushed by BalaBalaYi
November 14, 2024 11:23 12m 15s master
November 14, 2024 11:23 12m 15s
fix process leak in ascend npu (#1331)
CI #4216: Commit 24bf1e8 pushed by BalaBalaYi
November 14, 2024 11:03 12m 35s master
November 14, 2024 11:03 12m 35s
fix process leak in ascend npu
CI #4215: Pull request #1331 synchronize by majieyue
November 14, 2024 10:09 12m 6s majieyue:fix-subprocess-leak-in-npu
November 14, 2024 10:09 12m 6s
fix process leak in ascend npu
CI #4214: Pull request #1331 synchronize by majieyue
November 14, 2024 09:12 12m 23s majieyue:fix-subprocess-leak-in-npu
November 14, 2024 09:12 12m 23s
fix process leak in ascend npu
CI #4212: Pull request #1331 synchronize by majieyue
November 14, 2024 08:29 12m 19s majieyue:fix-subprocess-leak-in-npu
November 14, 2024 08:29 12m 19s
fix process leak in ascend npu
CI #4210: Pull request #1331 synchronize by majieyue
November 14, 2024 07:07 12m 14s majieyue:fix-subprocess-leak-in-npu
November 14, 2024 07:07 12m 14s
fix process leak in ascend npu
CI #4209: Pull request #1331 synchronize by majieyue
November 14, 2024 04:22 12m 33s majieyue:fix-subprocess-leak-in-npu
November 14, 2024 04:22 12m 33s
Training hang detection based on XPU Timer metric. (#1288)
CI #4208: Commit 07b18ac pushed by samplise
November 13, 2024 22:22 8m 15s master
November 13, 2024 22:22 8m 15s
Expose ckpt events
CI #4207: Pull request #1321 synchronize by samplise
November 13, 2024 20:20 8m 24s samplise:expose-ckpt-events
November 13, 2024 20:20 8m 24s