[WIP-DO NOT MERGE] Add InjectFault and InjectLatency features #508

mebjas · 2018-10-01T20:45:20Z

The issue or feature being addressed

Support for failure / chaos injection policies to test resilience of the system. Being discussed in details - #499

Details on the issue fix or feature implementation

Introduced a base Monkey Policy along with support for Policy.InjectFault and Policy.InjectLatency as first class citizens in the library.

Confirm the following

I have merged the latest changes from the dev vX.Y branch
I have successfully run a local build
I have included unit tests for the issue/feature
I have targeted the PR against the latest dev vX.Y branch

dnfclas · 2018-10-01T20:45:31Z

All CLA requirements met.

mebjas · 2018-10-01T20:47:07Z

Created this PR to get reviews on the top level functions. Will start working on the tests.

reisenberger

@mebjas This is looking great. Thank you very much for all the work on it!

(Review here may look like lots of comments, but mainly it is just a simple few things repeating because of the repetitive Polly syntax model; the concept coded is 👍 .)

src/Polly.Shared/Monkey/InjectFaultSyntax.cs

src/Polly.Shared/Monkey/MonkeyEngine.cs

reisenberger · 2018-10-02T20:28:52Z

src/Polly.Shared/Monkey/MonkeyEngineAsync.cs

+            Func<Context, CancellationToken, Task<TResult>> action, 
+            Context context,
+            CancellationToken cancellationToken,
+            Func<Context, Task> fault,


Func<Context, CancellationToken, Task>

reisenberger · 2018-10-02T20:29:50Z

src/Polly.Shared/Monkey/MonkeyEngineAsync.cs

+            Func<Context, CancellationToken, Task<TResult>> action,
+            Context context,
+            CancellationToken cancellationToken,
+            Func<Context, Task<Exception>> fault,


Func<Context, CancellationToken, Task<DelegateResult<TResult>>

Ah this is a todo, I'll add.

reisenberger · 2018-10-02T21:31:30Z

@mebjas Apart from minor review above we can move to adding tests 👍 (as you say).

For writing deterministic unit tests for the code involving randomisation, an approach could be similar to the way existing Polly tests involving time abstract SystemClock:

Abstract out the randomiser in the same way Polly already abstracts out SystemClock.
Tests can then manipulate the abstracted dependency. In this case, tests could replace the randomiser with a deterministic implementation returning fixed values, for specific tests.
We attribute such test classes to ensure tests which manipulate these ambient contexts do not run in parallel and pollute each other. (Although loss of parallelism of test-running may seem a cost, the time-saving of eliminating real-time delays in wait and timeout tests far compensates.)

(Time fits the ambient-context pattern for the dependency rather than constructor injection or property injection because there is a sensible local default and there are not known good use cases for varying the implementation in production. Similar arguments can apply to randomisation.)

mebjas · 2018-10-08T06:59:25Z

@reisenberger I have dealt with most of the comments. Had some comments in the Engine implementation for supporting TResult.
About the random function, are you proposing to write the method to return deterministic values during the test with some preprocessor directives?

reisenberger · 2018-10-08T10:12:40Z

@mebjas Thank you again for your contributions! Re:

About the random function

If we adopt the same strategy as SystemClock, then in RandomGenerator we could have:

public static Func<Double> GetRandomNumber = () => rand.NextDouble();

Tests can then replace the Func<Double> to provide deterministic values eg:

RandomGenerator.GetRandomNumber = () => 0.1;  // If this appears in a test, all calls to RandomGenerator.GetRandomNumber() in that test will return 0.1

It's important then to add this attribute to classes containing tests which manipulate RandomGenerator.GetRandomNumber in this way, to prevent these tests cross-polluting each other.

And: the test class should be IDisposable with the Dispose() method using a RandomGenerator.Reset() method to reset the GetRandomNumber method.

public static void Reset()
{
    GetRandomNumber = () => rand.NextDouble();
}

(xUnit uses the Dispose() method to clean up after each individual test.)

Overall, this is the same pattern you can see in use with tests manipulating SystemClock.

I will aim to check your other qs and the overloads in the coming days.

mebjas · 2018-10-08T12:55:22Z

@reisenberger pretty neat, thanks for that.

The other question is around usage of DelegateResult<TResult>. Meanwhile, I'll start adding tests.

- added proper engine for TReult based faults and implementation in InjectFault layer - RandomGenerator class corrected

…ed faults

- Added Tests for MonkeySyntax, MonkeyTResultSyntax, MonkeySyntaxAsync and MonkeyTResultSyntaxAsync

Covered all overloads

mebjas · 2018-10-14T14:45:08Z

@reisenberger I have added decent amount of test cases to cover all syntaxes. Please have a look.

src/Polly.Shared/Monkey/MonkeyEngine.cs

martincostello · 2018-10-14T14:57:41Z

I was just having a look at this PR out of curiosity, and I wondered is there any argument for the delegates for checking if faults being enabled in the async cases to also have an overload to accept a CancellationToken?

The use case I'm thinking of is for where someone might plumb in something where something external is called out to to determine if things are enabled or not? Of course that might be an edge-case and/or a bad idea, but thought I'd just throw the idea out there.

mebjas · 2018-10-14T15:03:53Z

@martincostello

I was just having a look at this PR out of curiosity, and I wondered is there any argument for the delegates for checking if faults being enabled in the async cases to also have an overload to accept a CancellationToken?

The use case I'm thinking of is for where someone might plumb in something where something external is called out to to determine if things are enabled or not? Of course that might be an edge-case and/or a bad idea, but thought I'd just throw the idea out there.

Like this - https://github.com/mebjas/Polly/blob/dev-minhazv/src/Polly.Shared/Monkey/MonkeyEngineAsync.cs#L14 ?

martincostello · 2018-10-14T15:05:00Z

Yeah, like that.

mebjas · 2018-10-14T15:06:10Z

Yeah, like that.

Yeah problems similar to this was addressed in one or other comments. And was implemented post that.
Let me know if you are doing further review - I'll take care of all your comments / suggestions in a single push.

@martincostello - Thanks!

mebjas · 2018-10-15T07:24:20Z

@reisenberger AFAIK the fluent assertion tests would be running in parallel.

Even if I set

RandomGenerator.GetRandomNumber = () => 0.1;

and reset in Dispose() method. It seems different state of tests are interfering with this common static object. How to deal with this here?

reisenberger · 2018-10-15T17:58:39Z

Hi @mebjas . To force certain tests to not run in parallel, so that they don't cross-pollute:

add this attribute to classes containing tests which manipulate RandomGenerator.GetRandomNumber .

(We should probably rename the attribute to something like: AmbientContextDependentTestCollection.)

mebjas · 2018-10-15T18:22:03Z

Hi @mebjas . To force certain tests to not run in parallel, so that they don't cross-pollute:

add this attribute to classes containing tests which manipulate RandomGenerator.GetRandomNumber .

(We should probably rename the attribute to something like: AmbientContextDependentTestCollection.)

Added the attribute. The attribute is being used in a lot of places. Should it be renamed in this PR?

reisenberger · 2018-10-15T20:36:28Z

Added the attribute. The attribute is being used in a lot of places. Should it be renamed in this PR?

Great, thanks! I am happy if you want to leave the renaming to me as a clean-up pass after we have finished all the changes for v6.2.0.

bartelink · 2018-12-02T11:07:57Z

Sounds great

separate libs FTW (and making sure the path for wrapper libs is smooth and continues to be is great for the ecosystem in general (plug: https://github.com/jet/CallPolly)). Muchos kudos for taking on the stacks of extra work in the name of doing the right thing @reisenberger
Simmy works best for me naming-wise (I guess Simmy is a monkey-pirate with a parrot on it's shoulders in the logo?)
I agree with doing a merge with immediate demerge even if the guys are too humble
New levels of hierarchy such as MonkeyPolicy and CustomPolicy really need to earn their keep as they start with -100 points for consumers and people walking the code; I'd favor having a common interface (if necessary) but not a common type until it really hurts (I could be missing something and don't have time to dig deep as to whether this even makes sense in practice.)

vany0114 · 2018-12-02T21:17:13Z

@reisenberger sounds great!

I totally agree with the creation of a new package.
I like Simmy but definitively I love Molly.
See my name in Polly's contributors would make me happy and proud 😃
I could help with the creation of the new repo, etc since next week, I'll be on vacations leave 🎉

Thanks @reisenberger for all your extra work on this!

mebjas · 2018-12-03T02:17:25Z

@reisenberger

Makes sense to me too. +1 on this;
Simmy name sounds cool :) @bartelink Nice logo idea; another derivation from this could be a parrot with eye patch. Ah, nvm maybe too early for this.
Totally agree with following the similar interface suggestion. Would this library have a dependence on Polly? I guess yes!
Thanks for merge, de-merge.
I'd like to contribute both idea wise and code wise to this new project, kindly let me know where all the discussions happen.
Also, I am in if you need any help with the new project.

Thanks @reisenberger @vany0114 @bartelink

reisenberger · 2018-12-13T06:58:18Z

Thank you @mebjas @vany0114 @bartelink . Courtesy update: I am likely to next get a chunk of time to progress, in late December.

reisenberger · 2018-12-29T22:39:41Z

@mebjas @vany0114 I completed an extensive refactor of Polly (#552) to allow us to host policies outside Polly in a new manner. I need to set up the new Simmy/Molly repo, then we can pull the code over there, and we can all pitch in on any final code clean-up and documentation!

mebjas · 2018-12-30T01:39:41Z

Thanks for this!

Is there any documentation on how this hosting policies outside polly looks like?

reisenberger · 2018-12-30T14:43:22Z

Is there any documentation on how this hosting policies outside polly looks like?

I have rebased the PR onto v7.0.0 branch, and am aiming to push a commit to show how the policies would join to the new v7.0.0 approach.

Full documentation (for the wider user community) will also follow.

reisenberger · 2018-12-30T20:23:11Z

@mebjas @vany0114 I was unable to push further changes to this PR, being today's modifications to bring everything into line with v7.0.0. (Maybe we have rebased the PR too many times ... not digging further now.)

I have opened a separate PR #553 as a mechanism only to show you latest modifications:

Tie MonkeyPolicies into new way of integrating custom policies, Polly v7.0.0
Classes InjectBehaviourPolicy, InjectOutcomePolicy and InjectLatencyPolicy, to fit this new pattern
Move syntax to MonkeyPolicy.InjectFault(...)
Move syntax to MonkeyPolicy.InjectLatency(...)
Move syntax to MonkeyPolicy.InjectBehaviour(...)
Adjust specs for above
Tests for functionality change: make it valid for InjectFault policies to dynamically inject null (no fault)
Tests for new guard conditions: fault-injection thresholds being out of range

Thanks for the huge amount of work you did @mebjas @vany0114 to get PR 508 to functionality-ready. My last piece here bridges us to a way forward with Simmy as a separate package to Polly v7; and implements the syntax preferences described in the consulation document.

Any comment welcome.

Next steps (probably not before next weekend) : Set up the new Simmy repo and build; and move the code over to it.

Given the syntax preferences are done, I think the code as #553 is ready to go (in the new repo), bar trivial re-namespacing (namespace Simmy). Will be good to get all your hard work released to the Polly public! 👍

reisenberger · 2018-12-30T20:25:42Z

@mebjas @vany0114 Note: The state that #508 has got into due to repeated re-basing/upward merges, means we may not be able/be wise to do the merge/de-merge from to master on Polly (I'm not that happy about risking the state of the Polly master branch from this PR ...), but we will ensure that you get credit for this work all over the doco. Thanks!

vany0114 · 2018-12-31T02:30:35Z

Thanks for all your work on this PR, looking forward to see Polly v7 and Simmy!

mebjas · 2018-12-31T13:09:48Z

@reisenberger I can see a lot of efforts were put into the redesign and it looks much more polished. Thanks for the contribution :)

Look forward to the new repo and adaptation.

reisenberger · 2019-01-06T09:57:50Z

@vany0114 @mebjas , Courtesy update: I am working on the Polly v7.0 user documentation (custom policy documentation) which we need for launching v7 (which we need launched, in turn, for Simmy to reference Polly from the external repo).

mebjas · 2019-01-06T13:44:40Z

@vany0114 @mebjas , Courtesy update: I am working on the Polly v7.0 user documentation (custom policy documentation) which we need for launching v7 (which we need launched, in turn, for Simmy to reference Polly from the external repo).

Thanks @reisenberger
Where are the documentations maintained? Is there a data for v7?

vany0114 · 2019-01-06T17:33:20Z

@vany0114 @mebjas , Courtesy update: I am working on the Polly v7.0 user documentation (custom policy documentation) which we need for launching v7 (which we need launched, in turn, for Simmy to reference Polly from the external repo).

Thanks @reisenberger! If you need some help, just let me know.

reisenberger · 2019-01-08T22:11:56Z

@mebjas @vany0114 / anyone interested: I created the new repo for Simmy! Please head over! If you are interested to take up any of the items marked help-wanted, that will speed us all along! (Post on the issue to say anything you are picking up ...). Thanks!

I am half-way thru the custom policy documentation, and will check in with something for all to read (and comment) soon as I can. Thanks!

vany0114 · 2019-01-08T22:20:47Z

Guys, I picked this one up for now

reisenberger · 2019-01-12T12:16:13Z

Hi @mebjas Re:

Where are the documentations maintained?

The Readme and Wiki of this repo. We also sometimes blog.

Is there a data for v7?

The readme directs people to sources of info on changes

@vany0114

Bring Simmy codebase across from App-vNext/Polly#553 (and originally App-vNext/Polly#508). Represents original contributions by @vany0114 , @mebjas and @reisenberger, which @vany0114 has brought to the Simmy repo from Polly. Noting that all contributors @vany0114 , @mebjas and @reisenberger had signed the .NET Foundation Contributors License Agreement against the Polly repo.

reisenberger · 2019-01-20T18:10:46Z

Locking this thread so that conversation can relate to the latest issues and version of the codebase on the Simmy repo; interested users should head over there!

If anyone has concerns/questions about anything mentioned higher up this thread, please do mention it on Simmy or the design thread here on Polly.

reisenberger · 2019-03-02T07:26:30Z

Closing as this codebase now exists definitively in the Simmy repo, the readme now links out to Simmy and #499 still tracks on the issues page.

minhaz added 2 commits October 2, 2018 01:18

Add all the files executing basic functionalities

fec5e6c

Deleted unnecessary files

1b819bb

mebjas mentioned this pull request Oct 2, 2018

Project SIMMY: Support for failure / chaos injection policies to test resilience of the system #499

Closed

reisenberger reviewed Oct 2, 2018

View reviewed changes

minhaz added 2 commits October 6, 2018 06:51

Changes corresponding to PR Comments

6ebcdc3

Remaninng changes and removed incomplete test for now

12aa05e

minhaz added 6 commits October 13, 2018 19:37

Code changes based on PR comments

b7aeea3

- added proper engine for TReult based faults and implementation in InjectFault layer - RandomGenerator class corrected

Corrected Action syntax for MonkeyEngine and added some tests

ffa69a4

Added all tests for MonkeySyntax and an extra overload for Action bas…

898a852

…ed faults

Added more test cases to cover Monkey Syntaxes

54cac34

- Added Tests for MonkeySyntax, MonkeyTResultSyntax, MonkeySyntaxAsync and MonkeyTResultSyntaxAsync

Fix for build warnings leading to failures

8174b08

Added Unit Tests for InjectFault Policies

3ccfb6d

Covered all overloads

martincostello reviewed Oct 14, 2018

View reviewed changes

src/Polly.Shared/Monkey/MonkeyEngine.cs Outdated Show resolved Hide resolved

Minor PR comment fix

fce0b1d

Added attribute to run tests one by one

b4da7a7

reisenberger mentioned this pull request Dec 13, 2018

DecorrelatedJitter is a first-class citizen #536

Closed

4 tasks

reisenberger mentioned this pull request Dec 29, 2018

Extensibility: Enable custom policies - support Polly.Contrib, chaos engineering, fakes ... and anything ... #551

Closed

reisenberger changed the base branch from onv612dev to v700 December 30, 2018 12:37

Merge branch 'v700' into dev-minhazv

f2bec42

reisenberger mentioned this pull request Dec 30, 2018

[WIP-DO NOT MERGE] Chaos and Fault Injection polices #553

Closed

reisenberger changed the title ~~Add InjectFault and InjectLatency features~~ [WIP-DO NOT MERGE] Add InjectFault and InjectLatency features Dec 30, 2018

reisenberger mentioned this pull request Jan 19, 2019

Add logo Polly-Contrib/Simmy#6

Closed

reisenberger mentioned this pull request Jan 20, 2019

Add overloads taking CancellationToken, for async configuration-providing delegates Polly-Contrib/Simmy#19

Closed

App-vNext locked and limited conversation to collaborators Jan 20, 2019

reisenberger closed this Mar 2, 2019

[WIP-DO NOT MERGE] Add InjectFault and InjectLatency features #508

[WIP-DO NOT MERGE] Add InjectFault and InjectLatency features #508

Conversation

mebjas commented Oct 1, 2018 • edited by reisenberger Loading

The issue or feature being addressed

Details on the issue fix or feature implementation

Confirm the following

dnfclas commented Oct 1, 2018 • edited Loading

mebjas commented Oct 1, 2018

reisenberger left a comment

Choose a reason for hiding this comment

reisenberger Oct 2, 2018

Choose a reason for hiding this comment

reisenberger Oct 2, 2018

Choose a reason for hiding this comment

mebjas Oct 6, 2018

Choose a reason for hiding this comment

reisenberger commented Oct 2, 2018

mebjas commented Oct 8, 2018

reisenberger commented Oct 8, 2018

mebjas commented Oct 8, 2018

mebjas commented Oct 14, 2018

martincostello commented Oct 14, 2018

mebjas commented Oct 14, 2018 • edited Loading

martincostello commented Oct 14, 2018

mebjas commented Oct 14, 2018

mebjas commented Oct 15, 2018

reisenberger commented Oct 15, 2018

mebjas commented Oct 15, 2018

reisenberger commented Oct 15, 2018

bartelink commented Dec 2, 2018

vany0114 commented Dec 2, 2018 • edited Loading

mebjas commented Dec 3, 2018

reisenberger commented Dec 13, 2018

reisenberger commented Dec 29, 2018

mebjas commented Dec 30, 2018

reisenberger commented Dec 30, 2018

reisenberger commented Dec 30, 2018

reisenberger commented Dec 30, 2018

vany0114 commented Dec 31, 2018

mebjas commented Dec 31, 2018

reisenberger commented Jan 6, 2019

mebjas commented Jan 6, 2019

vany0114 commented Jan 6, 2019

reisenberger commented Jan 8, 2019

vany0114 commented Jan 8, 2019

reisenberger commented Jan 12, 2019

reisenberger commented Jan 20, 2019 • edited Loading

reisenberger commented Mar 2, 2019

mebjas commented Oct 1, 2018 •

edited by reisenberger

Loading

dnfclas commented Oct 1, 2018 •

edited

Loading

mebjas commented Oct 14, 2018 •

edited

Loading

vany0114 commented Dec 2, 2018 •

edited

Loading

reisenberger commented Jan 20, 2019 •

edited

Loading