Fix Hodges-Lehmann distribution ratio calculation #111

mgrzaslewicz · 2024-07-22T13:32:45Z

I hoped to see classification change for added distribution. As it should be detected as an improvement, while in production we have no impact:

However previous classification was fine for default tolerance. We're just using a different production tolerance in RegressionCheckIT and improvement is below threshold.
So at least we have a properly calculated Hodges-Lehmann coming with this change.

src/test/kotlin/com/atlassian/performance/tools/report/ShiftedDistributionRegressionTestTest.kt

...ain/kotlin/com/atlassian/performance/tools/report/api/distribution/DistributionComparator.kt

dagguh · 2024-07-22T14:45:03Z

...in/com/atlassian/performance/tools/report/api/judge/RelativeNonparametricPerformanceJudge.kt

@@ -2,7 +2,7 @@ package com.atlassian.performance.tools.report.api.judge

 import com.atlassian.performance.tools.jiraactions.api.ActionType
 import com.atlassian.performance.tools.report.ActionMetricsReader
-import com.atlassian.performance.tools.report.api.ShiftedDistributionRegressionTest
+import com.atlassian.performance.tools.report.api.distribution.DistributionComparator


No difference detected by RelativeNonparametricPerformanceJudgeTest?

No diff here

So we don't see why the new one is better.

How do you define better? Classification has not changed for unit tested cases and as stated in first PR comment, it's expected

src/main/kotlin/com/atlassian/performance/tools/report/api/ShiftedDistributionRegressionTest.kt

dagguh · 2024-07-22T14:47:36Z

...ain/kotlin/com/atlassian/performance/tools/report/api/distribution/DistributionComparator.kt

+                values[k++] = func(baseline[i], experiment[j])
+            }
+        }
+        return Median().withNaNStrategy(NaNStrategy.MINIMAL).evaluate(values)


When used on a set of latency measurements, why would we inject fake negative infinities?

It's needed for cases when part of the distribution is the same and other part is different. Without it small difference will be calculated as whole distribution difference and instead of NaN you will get -0.45623836126629425 in attached example

Yes, but the workaround injects fake extreme values. Both "before" and "after" seem wrong.
PS. this case is untested, right?

The only value of this PR is to have a relativeShift correctly calculated, it does not improve classification as I hoped. I will prepare a small PR to fix ShiftedDistributionRegressionTest and decline this one

...in/com/atlassian/performance/tools/report/api/judge/RelativeNonparametricPerformanceJudge.kt

...ain/kotlin/com/atlassian/performance/tools/report/api/distribution/DistributionComparator.kt

mgrzaslewicz requested a review from a team as a code owner July 22, 2024 13:32

mgrzaslewicz force-pushed the fix-hodges-lehmann branch from ceafd55 to fee23ec Compare July 22, 2024 14:32

Fix Hodges-Lehmann distribution ratio calculation

1d2eb85

mgrzaslewicz force-pushed the fix-hodges-lehmann branch from fee23ec to 1d2eb85 Compare July 22, 2024 14:41

dagguh suggested changes Jul 22, 2024

View reviewed changes

Address PR comments

89b2a3e

mgrzaslewicz force-pushed the fix-hodges-lehmann branch from 92d2c11 to 89b2a3e Compare July 22, 2024 15:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Hodges-Lehmann distribution ratio calculation #111

Fix Hodges-Lehmann distribution ratio calculation #111

mgrzaslewicz commented Jul 22, 2024 •

edited

Loading

dagguh Jul 22, 2024

mgrzaslewicz Jul 22, 2024

dagguh Jul 22, 2024

mgrzaslewicz Jul 22, 2024

dagguh Jul 22, 2024

mgrzaslewicz Jul 22, 2024 •

edited

Loading

dagguh Jul 22, 2024 •

edited

Loading

mgrzaslewicz Jul 22, 2024

Fix Hodges-Lehmann distribution ratio calculation #111

Are you sure you want to change the base?

Fix Hodges-Lehmann distribution ratio calculation #111

Conversation

mgrzaslewicz commented Jul 22, 2024 • edited Loading

dagguh Jul 22, 2024

Choose a reason for hiding this comment

mgrzaslewicz Jul 22, 2024

Choose a reason for hiding this comment

dagguh Jul 22, 2024

Choose a reason for hiding this comment

mgrzaslewicz Jul 22, 2024

Choose a reason for hiding this comment

dagguh Jul 22, 2024

Choose a reason for hiding this comment

mgrzaslewicz Jul 22, 2024 • edited Loading

Choose a reason for hiding this comment

dagguh Jul 22, 2024 • edited Loading

Choose a reason for hiding this comment

mgrzaslewicz Jul 22, 2024

Choose a reason for hiding this comment

mgrzaslewicz commented Jul 22, 2024 •

edited

Loading

mgrzaslewicz Jul 22, 2024 •

edited

Loading

dagguh Jul 22, 2024 •

edited

Loading