Implement judicious skipping of CI jobs #94

vyasr · 2024-08-21T16:24:25Z

Currently any change to any RAPIDS repo triggers a complete run of the entire build and test suite of CI jobs. This is very expensive, and is often unnecessary. RAPIDS libraries are typically structured such that they have a clear, linear dependency chain between different components. Some examples:

Changes to Python code should have no effect whatsoever on C++ testing
For repositories that publish multiple Python packages, there is typically a linear dependency between at least some of these such that changes to downstream packages has no effect on the tests of upstream package (e.g. changes to cugraph would not affect pylibcugraph tests).
Changes to documentation should only require documentation rebuilds, no test runs.

We can reduce the number of unnecessary jobs that we run in CI by more judiciously skipping jobs that are unnecessary. The simplest approach to do this is by simply checking what files have changed. We have previously implemented a form of this in cudf for both cudf.pandas and now for cudf-polars. To do this, I would propose the following steps:

Generalizing the above logic for detecting changes into a shared workflow
Enable filtering jobs using the above shared workflow
Generalize the pr-builder job to allow some jobs to be skipped under appropriate circumstances.

KyleFromNVIDIA · 2024-08-21T22:13:02Z

pytest-incremental may help us with skipping unneeded Python tests:

https://pypi.org/project/pytest-incremental/

It hasn't been updated in a while though, it may need some work.

Only run tests based on things that have actually changed. For example, if only Python files have changed, we don't need to run the C++ tests. Contributes to rapidsai/build-planning#94

Only run tests based on things that have actually changed. For example, if only Python files have changed, we don't need to run the C++ tests. Contributes to rapidsai/build-planning#94 Authors: - Kyle Edwards (https://github.com/KyleFromNVIDIA) Approvers: - Bradley Dice (https://github.com/bdice) - Vyas Ramasubramani (https://github.com/vyasr) - Robert Maynard (https://github.com/robertmaynard) URL: #16642

Only run tests based on things that have actually changed. For example, if only Python files have changed, we don't need to run the C++ tests. Contributes to rapidsai/build-planning#94

Contributes to rapidsai/build-planning#94 Depends on rapidsai/shared-workflows#239 Authors: - Kyle Edwards (https://github.com/KyleFromNVIDIA) - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Bradley Dice (https://github.com/bdice) - GALI PREM SAGAR (https://github.com/galipremsagar) URL: #16713

Only run tests based on things that have actually changed. For example, if only Python files have changed, we don't need to run the C++ tests. Contributes to rapidsai/build-planning#94 Depends on rapidsai/shared-workflows#239 Authors: - Kyle Edwards (https://github.com/KyleFromNVIDIA) Approvers: - Robert Maynard (https://github.com/robertmaynard) - Jake Awe (https://github.com/AyodeAwe) URL: #4634

vyasr · 2024-10-01T18:30:18Z

@KyleFromNVIDIA now that we have a common shared workflow and about a month of data on how pruning the jobs works, do we want to roll this feature out to the rest of RAPIDS and then close this issue?

Contributes to rapidsai/build-planning#94

Contributes to rapidsai/build-planning#94 Authors: - Kyle Edwards (https://github.com/KyleFromNVIDIA) Approvers: - James Lamb (https://github.com/jameslamb) URL: #6094

Contributes to rapidsai/build-planning#94

Contributes to rapidsai/build-planning#94 Authors: - Kyle Edwards (https://github.com/KyleFromNVIDIA) Approvers: - James Lamb (https://github.com/jameslamb) URL: #2466

Contributes to rapidsai/build-planning#94 Authors: - Kyle Edwards (https://github.com/KyleFromNVIDIA) Approvers: - James Lamb (https://github.com/jameslamb) URL: #392

Contributes to rapidsai/build-planning#94

Contributes to rapidsai/build-planning#94 Authors: - Kyle Edwards (https://github.com/KyleFromNVIDIA) Approvers: - James Lamb (https://github.com/jameslamb) URL: #1695

Contributes to rapidsai/build-planning#94 Authors: - Kyle Edwards (https://github.com/KyleFromNVIDIA) Approvers: - James Lamb (https://github.com/jameslamb) URL: #635

Contributes to rapidsai/build-planning#94 Authors: - Kyle Edwards (https://github.com/KyleFromNVIDIA) Approvers: - James Lamb (https://github.com/jameslamb) URL: #226

Contributes to rapidsai/build-planning#94 Authors: - Kyle Edwards (https://github.com/KyleFromNVIDIA) Approvers: - James Lamb (https://github.com/jameslamb) URL: #296

Contributes to rapidsai/build-planning#94 Authors: - Kyle Edwards (https://github.com/KyleFromNVIDIA) Approvers: - James Lamb (https://github.com/jameslamb) URL: #489

jakirkham · 2024-10-10T21:37:05Z

In some cases we want to run all CI. Is there a good way to bypass this selection behavior?

KyleFromNVIDIA · 2024-10-11T13:53:55Z

Please describe a scenario in which you want to run a CI job even though the relevant files haven't changed. We can look at ways to force it to run.

In the meantime, as a workaround, you can add a comment to a relevant file to force CI to run, then remove it again once CI has passed.

KyleFromNVIDIA · 2024-10-11T14:49:53Z

We could create a new label called "Force CI Run". Then the condition could be something like this:

if: fromJSON(needs.changed-files.outputs.changed_file_groups).test_cpp || fromJSON(needs.pr-info.outputs.pr-info).labels.*.name == "Force CI Run"

Or we could even modify the changed-files workflow with a new output to give us the "force CI run" status:

if: fromJSON(needs.changed-files.outputs.changed_file_groups).test_cpp || needs.changed-files.outputs.force_ci_run

If the label is initially not provided, and is added later, we can then trigger a new workflow run with the label applied by:

git commit --allow-empty -m 'Re-run Ci'
git push

jameslamb · 2024-10-11T15:04:00Z

My perspective: if the main goal is just to support PRs like rapidsai/pynvjitlink#107 of the form "I just want to see if a CI issue is unrelated to my code changes", I'd prefer just adding a comment to a central file (like a C++ source file) instead of adding complexity to the changed-files mechanism.

KyleFromNVIDIA · 2024-10-11T15:16:30Z

I've opened rapidsai/shared-workflows#249 and rapidsai/cudf#17064 to test out the approach outlined at #94 (comment).

KyleFromNVIDIA · 2024-10-11T15:23:17Z

The above POC works, but if @jameslamb is correct then it may be better to just modify a C++ file with a comment to force the run. @jakirkham WDYT?

vyasr · 2024-10-11T16:13:18Z

I made the same assumption as James regarding the purpose of the request and come to the same conclusion. I would prefer to discourage using CI as a way to "just run tests", which is one possible use case. If we are specifically testing the behavior of CI (e.g. CI fails but locally tests pass), then adding a trivial change is not a very high barrier. Alternatively, instead of using PR CI you can use either build or test workflows. Those both have workflow triggers available so they can be run directly from the Actions tab, no PR needed. A special label feels like extra machinery that we don't need.

jameslamb · 2024-10-22T18:02:44Z

@KyleFromNVIDIA what is left for this? It's difficult for me to tell which repos we still haven't made these changes for.

KyleFromNVIDIA · 2024-10-22T20:10:32Z

Looks like cuspatial has not yet been done. I'll go work on that now. I'm not sure if there's anything else. The way to tell is to look at .github/workflows/pr.yaml and see if there is a changed-files workflow.

Contributes to rapidsai/build-planning#94

jameslamb · 2024-10-23T14:15:35Z

The way to tell is to look at .github/workflows/pr.yaml and see if there is a changed-files workflow.

Ok, I was hoping you had a list of what remained and it just hadn't made it into this issue yet. I'm not planning to go through all the repos looking for that workflow, I trust your estimation of what's left.

Contributes to rapidsai/build-planning#94 Authors: - Kyle Edwards (https://github.com/KyleFromNVIDIA) Approvers: - James Lamb (https://github.com/jameslamb) URL: #1479

jameslamb · 2024-11-07T13:48:34Z

@KyleFromNVIDIA could you please do one more check like the one you mentioned in #94 (comment), and then close this issue if you think it's done?

vyasr mentioned this issue Aug 21, 2024

[Meta] Reduce CI runtimes #95

Open

KyleFromNVIDIA mentioned this issue Aug 22, 2024

Prune workflows based on changed files rapidsai/cudf#16642

Merged

3 tasks

This was referenced Aug 27, 2024

Prune workflows based on changed files rapidsai/cugraph#4634

Merged

Add changed-files workflow rapidsai/shared-workflows#239

Merged

Use changed-files shared workflow rapidsai/cudf#16713

Merged

jameslamb assigned KyleFromNVIDIA Sep 24, 2024

KyleFromNVIDIA added a commit to KyleFromNVIDIA/cuml that referenced this issue Oct 3, 2024

Prune workflows based on changed files

1203698

Contributes to rapidsai/build-planning#94

KyleFromNVIDIA mentioned this issue Oct 3, 2024

Prune workflows based on changed files rapidsai/cuml#6094

Merged

KyleFromNVIDIA added a commit to KyleFromNVIDIA/raft that referenced this issue Oct 3, 2024

Prune workflows based on changed files

f98db12

Contributes to rapidsai/build-planning#94

KyleFromNVIDIA mentioned this issue Oct 3, 2024

Prune workflows based on changed files rapidsai/raft#2466

Merged

KyleFromNVIDIA added a commit to KyleFromNVIDIA/kvikio that referenced this issue Oct 3, 2024

Prune workflows based on changed files

bbb1054

Contributes to rapidsai/build-planning#94

KyleFromNVIDIA mentioned this issue Oct 3, 2024

Prune workflows based on changed files rapidsai/kvikio#489

Merged

KyleFromNVIDIA added a commit to KyleFromNVIDIA/cuvs that referenced this issue Oct 3, 2024

Prune workflows based on changed files

f4ee66e

Contributes to rapidsai/build-planning#94

KyleFromNVIDIA mentioned this issue Oct 3, 2024

Prune workflows based on changed files rapidsai/cuvs#392

Merged

KyleFromNVIDIA added a commit to KyleFromNVIDIA/ucxx that referenced this issue Oct 4, 2024

Prune workflows based on changed files

a249ef6

Contributes to rapidsai/build-planning#94

KyleFromNVIDIA mentioned this issue Oct 4, 2024

Prune workflows based on changed files rapidsai/ucxx#296

Merged

KyleFromNVIDIA added a commit to KyleFromNVIDIA/cuxfilter that referenced this issue Oct 4, 2024

Prune workflows based on changed files

717894a

Contributes to rapidsai/build-planning#94

KyleFromNVIDIA mentioned this issue Oct 4, 2024

Prune workflows based on changed files rapidsai/cuxfilter#635

Merged

KyleFromNVIDIA added a commit to KyleFromNVIDIA/rmm that referenced this issue Oct 4, 2024

Prune workflows based on changed files

0c1e79a

Contributes to rapidsai/build-planning#94

KyleFromNVIDIA mentioned this issue Oct 4, 2024

Prune workflows based on changed files rapidsai/rmm#1695

Merged

3 tasks

KyleFromNVIDIA added a commit to KyleFromNVIDIA/wholegraph that referenced this issue Oct 4, 2024

Prune workflows based on changed files

26b302d

Contributes to rapidsai/build-planning#94

KyleFromNVIDIA mentioned this issue Oct 4, 2024

Prune workflows based on changed files rapidsai/wholegraph#226

Merged

KyleFromNVIDIA added a commit to KyleFromNVIDIA/cuspatial that referenced this issue Oct 22, 2024

Prune workflows based on changed files

f205bab

Contributes to rapidsai/build-planning#94

KyleFromNVIDIA mentioned this issue Oct 22, 2024

Prune workflows based on changed files rapidsai/cuspatial#1479

Merged

3 tasks

jameslamb closed this as completed Nov 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement judicious skipping of CI jobs #94

Implement judicious skipping of CI jobs #94

vyasr commented Aug 21, 2024

KyleFromNVIDIA commented Aug 21, 2024

vyasr commented Oct 1, 2024

jakirkham commented Oct 10, 2024

KyleFromNVIDIA commented Oct 11, 2024

KyleFromNVIDIA commented Oct 11, 2024

jameslamb commented Oct 11, 2024

KyleFromNVIDIA commented Oct 11, 2024

KyleFromNVIDIA commented Oct 11, 2024

vyasr commented Oct 11, 2024

jameslamb commented Oct 22, 2024

KyleFromNVIDIA commented Oct 22, 2024

jameslamb commented Oct 23, 2024 •

edited

Loading

jameslamb commented Nov 7, 2024

Implement judicious skipping of CI jobs #94

Implement judicious skipping of CI jobs #94

Comments

vyasr commented Aug 21, 2024

KyleFromNVIDIA commented Aug 21, 2024

vyasr commented Oct 1, 2024

jakirkham commented Oct 10, 2024

KyleFromNVIDIA commented Oct 11, 2024

KyleFromNVIDIA commented Oct 11, 2024

jameslamb commented Oct 11, 2024

KyleFromNVIDIA commented Oct 11, 2024

KyleFromNVIDIA commented Oct 11, 2024

vyasr commented Oct 11, 2024

jameslamb commented Oct 22, 2024

KyleFromNVIDIA commented Oct 22, 2024

jameslamb commented Oct 23, 2024 • edited Loading

jameslamb commented Nov 7, 2024

jameslamb commented Oct 23, 2024 •

edited

Loading