add threshold for RCA #5218

suxi-ms · 2020-06-09T09:57:22Z

Update Root Cause Analysis interface.

For root cause analysis interface, we added a new parameter called anomalyDeltaThreshold for users to choose.
It won't break existing interface if some users are using it.
Fixes Add a threshold parameter for root cause analysis #5188 .
Related PR add root cause localization transformer #4925

lisahua · 2020-06-09T16:35:10Z

src/Microsoft.ML.TimeSeries/ExtensionsCatalog.cs

        /// <example>
        /// <format type="text/markdown">
        /// <![CDATA[
        /// [!code-csharp[LocalizeRootCause](~/../docs/samples/docs/samples/Microsoft.ML.Samples/Dynamic/Transforms/TimeSeries/LocalizeRootCause.cs)]
        /// ]]>
        /// </format>
        /// </example>
-        public static RootCause LocalizeRootCause(this AnomalyDetectionCatalog catalog, RootCauseLocalizationInput src, double beta = 0.5)
+        public static RootCause LocalizeRootCause(this AnomalyDetectionCatalog catalog, RootCauseLocalizationInput src, double beta = 0.5, double rootCauseThreshold = 0.95)


nit: another option as suggested by Harish (for AD) is to create an Options class to avoid multiple overload in future. #Resolved

nit: another option as suggested by Harish (for AD) is to create an Options class to avoid multiple overload in future.

I found that the option is always served for transformer implementation. Does it makes sense to use it in the extentions catalog? #Resolved

I think ML.Net team treats it at a pattern of handling multiple overloaded methods thus not confined to which type of class it can be used. yet it's just a minor comments so feel free to ignore #Resolved

Four parameters to a function is not too bad. If you are expecting to enhance this API further in the future by adding more parameters, I would suggest adding an Options parameter now. If there are no plans to add new parameters now, please add an Options parameter the next time there is a new parameter added.

E.g. We have now migrated rootCauseThreshold from being a private constant in RootCauseAnaylyzer to being a parameter here. But I also see _anomalyRatioThreshold and _anomalyPreDeltaThreshold as private constants there?

Does it make sense to convert them to parameters? If so, please create an Options class and add all those parameters there.

In reply to: 438493061 [](ancestors = 438493061)

Four parameters to a function is not too bad. If you are expecting to enhance this API further in the future by adding more parameters, I would suggest adding an Options parameter now. If there are no plans to add new parameters now, please add an Options parameter the next time there is a new parameter added.

E.g. We have now migrated rootCauseThreshold from being a private constant in RootCauseAnaylyzer to being a parameter here. But I also see _anomalyRatioThreshold and _anomalyPreDeltaThreshold as private constants there?

Does it make sense to convert them to parameters? If so, please create an Options class and add all those parameters there.

In reply to: 438493061 [](ancestors = 438493061)

Thanks for the suggestion, have updated #Resolved

Sorry, I don't think I communicated well.

The pattern in ML.NET is to have two overloaded functions. A convenience function and a full function. The convenience function takes in a minimal set of parameters that will meet the common use case and use default values for all other parameters. We always have this function. The second function is if a particular feature requires a function with lots of parameters. In this case we add a second overloaded function that takes an Options parameter where the user can set each parameter individually.

So at the very least you should have one function without an Options parameter. You add a second function with an Options parameter only if you are going to need lots of parameters.

Like I mentioned above, if all you need are four parameters, a single function is okay. Only if you were going to migrate the other constants from RootCauseAnalyzer, please add a second function with an Options parameter.

In reply to: 442571173 [](ancestors = 442571173)

lisahua · 2020-06-09T16:51:28Z

src/Microsoft.ML.TimeSeries/RootCauseAnalyzer.cs

@@ -421,7 +422,7 @@ private TimeSeriesPoint GetPointByDimension(Dictionary<string, TimeSeriesPoint>

        private static string GetDicCode(Dictionary<string, Object> dic)
        {


is this an ideal way to create a key? or probably more efficient (and accurate) to maintain a tuple dictionary for <key, value> pair and invoke object.equal methods? #Resolved

Also for this piece, does it assume that the value is additive? what if the values are not additive? Sample1 and Sample2, when we set the AggregateType.Unknown, is this piece still applicable?

p.Value = +leave.Value; p.ExpectedValue = +leave.ExpectedValue; p.Delta = +leave.Delta; ``` #Resolved

Also for this piece, does it assume that the value is additive? what if the values are not additive? Sample1 and Sample2, when we set the AggregateType.Unknown, is this piece still applicable?

p.Value = +leave.Value; p.ExpectedValue = +leave.ExpectedValue; p.Delta = +leave.Delta;

Currently, for the unknown type, as the calculation might be complex, we use a default method like above to treat it.
#Resolved

is this an ideal way to create a key? or probably more efficient (and accurate) to maintain a tuple dictionary for <key, value> pair and invoke object.equal methods?

It is hard to implement it as the object is very complex. #Resolved

hmmm... will user just override the equal/Hashcode methods and ML.Net just needs to do object.EqualTo(..)? #Resolved

hmmm... will user just override the equal/Hashcode methods and ML.Net just needs to do object.EqualTo(..)?

Thanks for the reminder, have changed #Resolved

src/Microsoft.ML.TimeSeries/ExtensionsCatalog.cs

frank-dong-ms-zz

You don't need to change libmf, right? Please take change from submodules(when git clone or git pull, add --recurse-submodules option as well)

harishsk · 2020-06-10T18:01:22Z

To clarify @frank-dong-ms's comment above, you are seeing test failures above because your rebase didn't include an update to the submodule. Recently there was an update to the libmf submodule that fixed test failures. Please pull and rebase with --recurse-submodules and update your branch and these test failures should go away. #Resolved

Co-authored-by: Mustafa Bal <5262061+mstfbl@users.noreply.github.com>

codecov · 2020-06-12T10:21:08Z

Codecov Report

Merging #5218 into master will decrease coverage by 3.92%.
The diff coverage is 38.35%.

@@            Coverage Diff             @@
##           master    #5218      +/-   ##
==========================================
- Coverage   73.31%   69.38%   -3.93%     
==========================================
  Files        1007      770     -237     
  Lines      187969   144849   -43120     
  Branches    20241    18435    -1806     
==========================================
- Hits       137812   100507   -37305     
+ Misses      44630    39195    -5435     
+ Partials     5527     5147     -380

Flag	Coverage Δ
#Debug	`69.38% <38.35%> (-3.93%)`	⬇️
#production	`69.38% <38.35%> (+0.31%)`	⬆️
#test	`?`

Impacted Files	Coverage Δ
...crosoft.ML.TimeSeries/RootCauseLocalizationType.cs	`49.42% <0.00%> (-1.77%)`	⬇️
src/Microsoft.ML.TimeSeries/RootCauseAnalyzer.cs	`54.20% <40.29%> (-2.25%)`	⬇️
src/Microsoft.ML.TimeSeries/ExtensionsCatalog.cs	`92.68% <50.00%> (-2.32%)`	⬇️
...c/Microsoft.ML.FastTree/Utils/ThreadTaskManager.cs	`79.48% <0.00%> (-20.52%)`	⬇️
src/Microsoft.ML.Featurizers/Common.cs	`57.14% <0.00%> (-7.62%)`	⬇️
src/Microsoft.ML.AutoML/Utils/BestResultUtil.cs	`53.84% <0.00%> (-3.69%)`	⬇️
...rosoft.ML.AutoML/ColumnInference/TextFileSample.cs	`59.60% <0.00%> (-2.65%)`	⬇️
...L.AutoML/TrainerExtensions/TrainerExtensionUtil.cs	`84.71% <0.00%> (-2.19%)`	⬇️
...or/CodeGenerator/CSharp/TrainerGeneratorFactory.cs	`81.25% <0.00%> (-2.09%)`	⬇️
...rc/Microsoft.ML.Featurizers/DateTimeTransformer.cs	`87.15% <0.00%> (-1.81%)`	⬇️
... and 334 more

suxi-ms · 2020-06-12T14:30:19Z

You don't need to change libmf, right? Please take change from submodules(when git clone or git pull, add --recurse-submodules option as well)

Thanks for the reminder, updated #Resolved

suxi-ms · 2020-06-12T14:31:14Z

To clarify @frank-dong-ms's comment above, you are seeing test failures above because your rebase didn't include an update to the submodule. Recently there was an update to the libmf submodule that fixed test failures. Please pull and rebase with --recurse-submodules and update your branch and these test failures should go away.

Thanks for the reminder, updated #Resolved

harishsk · 2020-06-17T17:49:37Z

src/Microsoft.ML.TimeSeries/ExtensionsCatalog.cs

@@ -187,14 +187,15 @@ public static class TimeSeriesCatalog
        /// It is used when score is calculated for each root cause item. The range of beta should be in [0,1].
        /// For a larger beta, root cause items which have a large difference between value and expected value will get a high score.
        /// For a small beta, root cause items which have a high relative change will get a low score.</param>
+        /// <param name="rootCauseThreshold">A threshold to determine whether the point should be root cause. If the point's delta is equal to or larger than rootCauseThreshold multiplied by anomaly dimension point's delta, this point is treated as a root cause. Different threshold will turn out different results. Users can choose the delta according to their data and requirments. </param>
        /// <example>


What is the range of valid values for this parameter? #Resolved

What is the range of valid values for this parameter?

Have updated the range

harishsk

suxi-ms added 29 commits June 3, 2020 15:02

add root cause localization transformer

2c7a62b

merge with remote repo

f9af073

rebase

0551ecd

merge

3138c39

temp save for internal review

946ac43

merge

d39e657

merge

eadada4

merge

6cf9b59

merge

36d1625

merge

0f81982

merge

f3fad18

merge

612be4d

merge

23261f2

merge

fa10bff

update

919ed6b

refine internal logic

0407282

merge

0efee95

update

bde4a53

merge

a1ab905

merge

711dfca

merge

e202a04

fix type

1a2d569

update model signature

b059b53

update rca interface by removing transformer

fa834fd

update

2b29426

update

0860e01

merge

6cf15c8

update

a3eee1a

some update

66261e6

suxi-ms requested a review from a team as a code owner June 9, 2020 09:57

update

887e992

lisahua reviewed Jun 9, 2020

View reviewed changes

mstfbl reviewed Jun 10, 2020

View reviewed changes

src/Microsoft.ML.TimeSeries/ExtensionsCatalog.cs Outdated Show resolved Hide resolved

frank-dong-ms-zz reviewed Jun 10, 2020

View reviewed changes

suxi-ms and others added 4 commits June 11, 2020 09:19

Update src/Microsoft.ML.TimeSeries/ExtensionsCatalog.cs

421157d

Co-authored-by: Mustafa Bal <5262061+mstfbl@users.noreply.github.com>

fix typo in extension catalgo comments

54727ea

merge

0e60d3c

update libmf

9d4a45e

update point map

cd19889

harishsk reviewed Jun 17, 2020

View reviewed changes

suxi-ms added 2 commits June 18, 2020 20:32

add root cause options

28bf0a5

revert option changes

8e5784e

harishsk approved these changes Jun 19, 2020

View reviewed changes

harishsk merged commit 1c2469f into dotnet:master Jun 19, 2020

lisahua mentioned this pull request Jul 2, 2020

[TimeSeries Explanation] Score is NaN #5274

Closed

ghost locked as resolved and limited conversation to collaborators Mar 18, 2022

		@@ -421,7 +422,7 @@ private TimeSeriesPoint GetPointByDimension(Dictionary<string, TimeSeriesPoint>

		private static string GetDicCode(Dictionary<string, Object> dic)
		{

add threshold for RCA #5218

add threshold for RCA #5218

Uh oh!

Conversation

suxi-ms commented Jun 9, 2020 • edited by mstfbl Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lisahua Jun 9, 2020 • edited by harishsk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

suxi-ms Jun 10, 2020 • edited by harishsk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lisahua Jun 11, 2020 • edited by harishsk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

harishsk Jun 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

suxi-ms Jun 19, 2020 • edited by harishsk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

harishsk Jun 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lisahua Jun 9, 2020 • edited by harishsk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lisahua Jun 9, 2020 • edited by harishsk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

suxi-ms Jun 10, 2020 • edited by harishsk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

suxi-ms Jun 10, 2020 • edited by harishsk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lisahua Jun 11, 2020 • edited by harishsk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

suxi-ms Jun 12, 2020 • edited by harishsk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

frank-dong-ms-zz left a comment

Choose a reason for hiding this comment

Uh oh!

harishsk commented Jun 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jun 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

suxi-ms commented Jun 12, 2020 • edited by harishsk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

suxi-ms commented Jun 12, 2020 • edited by harishsk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

harishsk Jun 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

suxi-ms Jun 19, 2020

Choose a reason for hiding this comment

Uh oh!

harishsk left a comment

suxi-ms commented Jun 9, 2020 •

edited by mstfbl

Loading

lisahua Jun 9, 2020 •

edited by harishsk

Loading

suxi-ms Jun 10, 2020 •

edited by harishsk

Loading

lisahua Jun 11, 2020 •

edited by harishsk

Loading

harishsk Jun 17, 2020 •

edited

Loading

suxi-ms Jun 19, 2020 •

edited by harishsk

Loading

harishsk Jun 19, 2020 •

edited

Loading

lisahua Jun 9, 2020 •

edited by harishsk

Loading

lisahua Jun 9, 2020 •

edited by harishsk

Loading

suxi-ms Jun 10, 2020 •

edited by harishsk

Loading

suxi-ms Jun 10, 2020 •

edited by harishsk

Loading

lisahua Jun 11, 2020 •

edited by harishsk

Loading

suxi-ms Jun 12, 2020 •

edited by harishsk

Loading

harishsk commented Jun 10, 2020 •

edited

Loading

codecov bot commented Jun 12, 2020 •

edited

Loading

suxi-ms commented Jun 12, 2020 •

edited by harishsk

Loading

suxi-ms commented Jun 12, 2020 •

edited by harishsk

Loading

harishsk Jun 17, 2020 •

edited

Loading