Adds a CI/CD workflow to detect performance regression. #264

cheqianh · 2023-04-26T21:52:09Z

Description

This PR enables GitHub Actions to detect performance regression using ion-python-benchmark-cli.

This PR adds

a --results-file option to specify benchmark results destination,
a compare command to compare perf results between the previous commit and the current one, and
a CI/CD workflow to detect performance regression

Important Notes

The threshold is set to 1, which should be optimized later. Refer to ion-java-benchmark-cli#53 for details.
Currently the workflow compare the new commit with itself since the main branch doesn't supports features such as compare and output-results. Need to change the target branch back to the main branch here after merging this PR.

Details

The `--results-file` Option

The --results-file writes the benchmark results to the specified destination in Ion. Otherwise the tool will print the result table in stdout. An example result below.
Running

python amazon/ionbenchmark/ion_benchmark_cli.py read -o my_output tests/benchmark_sample_data/integers.ion

and you can find the result in file my_output

[{file_name:"integers.ion",'file_size (MB)':"9.54e-07",command:"read",options:["load_dump","ion_binary","file"],'total_time (s)':"2.17e-04",'memory_usage_peak (MB)':"4.43e-02"}]

The `compare` command

This compare command compares two commits and identifies if regression has occurred. The results will be outputted to the specified destination. An example of the comparison result looks like the following:

Running below command, note that both previous_result and current_result are very similar to the my_output above section but just has an additional --format ion_text.

python amazon/ionbenchmark/ion_benchmark_cli.py compare --benchmark-result-previous previous_result --benchmark-result-new new_result my_compare_result

and will see the result in the my_compare_result (Edited to be pretty print for better read experience)

$ion_1_0 [
    {input:"integers.ion",
    command:"read"
    options:["load_dump","ion_binary","file"],
    relative_difference_score:{'total_time (s)':-0.20599999999999998868e+0,'file_size (MB)':0e0}
    }, 
    {input:"integers.ion",
    command:"read"
    options:["load_dump","ion_text","file"],
    relative_difference_score:{'total_time (s)':-0.31395949896868e+0,'file_size (MB)':0e0}
    },
]

both -0.2 and -0.3 are smaller than the threshold +0.6, so no regression is detected!

The CI/CD workflow

A workflow that generates sample data, benchmarks write/read performance of both previous and new commits, compares the results, and identifies if there are any regressions.

Example Output

A good example workflow including both ✅ and ❌ summaries.

and the detailed pipeline log when regression is detected:

After downloading the benchmark results:

$ion_1_0 [{input:"/home/runner/work/ion-python/ion-python/testData/testSexps.10n",command:"write",options:["load_dump","ion_text","file"],relative_difference_score:{'file_size (MB)':0.0e0,'total_time (s)':0.09815950920245405e0}},
{input:"/home/runner/work/ion-python/ion-python/testData/testSexps.10n",command:"write",options:["load_dump","ion_text","buffer"],relative_difference_score:{'file_size (MB)':0.0e0,'total_time (s)':-0.1616766467065868e0}},
{input:"/home/runner/work/ion-python/ion-python/testData/testSexps.10n",command:"write",options:["load_dump","ion_binary","file"],relative_difference_score:{'file_size (MB)':0.0e0,'total_time (s)':0.07633587786259535e0}},
{input:"/home/runner/work/ion-python/ion-python/testData/testSexps.10n",command:"write",options:["load_dump","ion_binary","buffer"],relative_difference_score:{'file_size (MB)':0.0e0,'total_time (s)':0.6116504854368932e0}}
]

we can see that the relative execution time difference for command:"write",options:["load_dump","ion_binary","buffer"] exceeds the threshold 0.6 (back then it's 0.6) so the workflow is failed. Note that we use the current commit to compare against itself so there's nothing really affect the performance. I increased the threshold to 1.

Recommended Review Order

Recommend to start with the GitHub Actions workflow to see the big picture of the workflow, then look into the benchmark-cli implementation to learn more about the technical details.

Test

See CI/CD below, and will create a new PR to change the targeted comparison commit back to the main branch.

* Adds a `--results-file` option. * Adds a `compare` command.

.github/workflows/performance-regression.yml

amazon/ionbenchmark/ion_benchmark_cli.py

tgregg · 2023-05-01T23:38:50Z

amazon/ionbenchmark/ion_benchmark_cli.py

+        for field in relative_difference_score:
+            value_diff = relative_difference_score[field]
+            # TODO simply set the threshold to 1. Need optimization.
+            if value_diff > REGRESSION_THRESHOLD:


Is 0 a safer value to choose in the meantime? To exceed 1, wouldn't the new version have to be twice as slow?

tests/test_benchmark_cli.py

.github/workflows/performance-regression.yml

cheqianh · 2023-05-03T23:32:03Z

The above commit addressed feedback except the threshold one.

tgregg

Approving, acknowledging that there is still work to do to refine the threshold value.

cheqianh · 2023-05-10T19:29:30Z

Approving, acknowledging that there is still work to do to refine the threshold value.

Okay, I'll open a GH issue for the threshold value, and discuss it with you later.

cheqianh · 2023-05-10T19:40:03Z

Opened an issue here. I'll merge this PR first.

Adds a CI/CD workflow to detect performance regression.

eb48f40

* Adds a `--results-file` option. * Adds a `compare` command.

cheqianh requested review from tgregg and linlin-s April 26, 2023 22:15

cheqianh marked this pull request as ready for review April 26, 2023 22:15

Fixes some typos

3492635

tgregg reviewed May 1, 2023

View reviewed changes

linlin-s reviewed May 2, 2023

View reviewed changes

.github/workflows/performance-regression.yml Show resolved Hide resolved

linlin-s reviewed May 3, 2023

View reviewed changes

.github/workflows/performance-regression.yml Show resolved Hide resolved

worked on comments.

7534384

cheqianh added 9 commits May 4, 2023 13:27

test content - fail

a62bc88

test content - pass

43877b9

test content - fail

9349626

Use the content check action from ion-java.

e8f2ef6

test content - pass

44d9328

test content - pass

c6420af

test content - pass

14ba959

test content - fail

d8c8a7d

test content - pass

2ba5f74

tgregg approved these changes May 10, 2023

View reviewed changes

cheqianh merged commit 8cc4e16 into amazon-ion:master May 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds a CI/CD workflow to detect performance regression. #264

Adds a CI/CD workflow to detect performance regression. #264

cheqianh commented Apr 26, 2023 •

edited

Loading

tgregg May 1, 2023

cheqianh commented May 3, 2023

tgregg left a comment

cheqianh commented May 10, 2023 •

edited

Loading

cheqianh commented May 10, 2023

Adds a CI/CD workflow to detect performance regression. #264

Adds a CI/CD workflow to detect performance regression. #264

Conversation

cheqianh commented Apr 26, 2023 • edited Loading

Description

Important Notes

Details

The --results-file Option

The compare command

The CI/CD workflow

Example Output

Recommended Review Order

Test

tgregg May 1, 2023

Choose a reason for hiding this comment

cheqianh commented May 3, 2023

tgregg left a comment

Choose a reason for hiding this comment

cheqianh commented May 10, 2023 • edited Loading

cheqianh commented May 10, 2023

cheqianh commented Apr 26, 2023 •

edited

Loading

The `--results-file` Option

The `compare` command

cheqianh commented May 10, 2023 •

edited

Loading