Support automatic 'swarm testing' for example selection #1637

Zac-HD · 2018-10-11T05:37:14Z

To paraphrase Swarm Testing (Groce et al, 2012),

Swarm testing is way to improve the diversity of generated test cases. Instead of potentially including all features in every test case, a large “swarm” of randomly generated configurations is used, each of which omits some features. ... First, some features actively prevent the system from executing interesting behaviors; e.g., pop calls may prevent an overflow bug from executing. Second, test features compete for space in each test, limiting the depth to which logic driven by features can be explored. Experimental results show that swarm testing increases coverage and can improve fault detection dramatically.

This is at minimum worth evaluating for Hypothesis - stateful testing would be a close match to the paper, while the benefits of swarm testing for data generation in general are less well explored but potentially substantial for e.g. Unicode text. The latter would require exposing data structure to the Conjecture engine as proposed in e.g. #1621.

Quoting @DRMacIver from #1401, which prompted this issue:

I've thought a bit about how to do shrinker friendly swarm testing under the current model. It's actually not too bad, though I haven't actually tried it so there's probably some annoying bits that I'm missing. The main points are:

You need swarm flags to "shrink open" so that once the shrinker has run to completion, all flags are enabled. e.g. you could do this by generating a set of banned flags.

You need to use rejection sampling rather than anything more clever.

Basically if you can then do characters().filter(character_class_is_enabled), then what happens during shrinking is follows:

We delete all of the initial iterations of the loop. Now, as if by magic, we just happen to have picked only values that are enabled.

The flags now shrink open, so we've left swarm mode and everything is now enabled.

We can now shrink the values as normal characters.

[To ensure that choosing swarm flags works well with the shrinker:] ensure that every time we check whether a flag is enabled, if it's already been set we call data.write to record the flag in the data stream, so that if we delete the first use the subsequent uses turn into an initialisation.

The text was updated successfully, but these errors were encountered:

Zac-HD added the new-feature entirely novel capabilities or strategies label Oct 11, 2018

rsokl mentioned this issue Oct 17, 2018

Broad question about Hypothesis #1641

Closed

This was referenced Oct 19, 2018

Distribution of floats() has regressed its ability to find bugs since 1.11.0 #469

Closed

NumPy RandomState #1646

Closed

Zac-HD mentioned this issue Jan 10, 2019

WIP: PoC of meta strategy, which draws others by a given weight distribution #1734

Closed

Zac-HD mentioned this issue Feb 28, 2019

Ensure boundary cases are drawn by strategies #1847

Closed

Zac-HD mentioned this issue Sep 14, 2019

lists() generates fewer very long long lists than expected #1984

Closed

Zac-HD mentioned this issue Oct 24, 2019

Clean up core test case generation logic #2137

Merged

DRMacIver mentioned this issue Nov 27, 2019

Implement swarm testing and use it for rule based stateful tests #2238

Merged

DRMacIver closed this as completed in #2238 Nov 28, 2019

Zac-HD mentioned this issue May 4, 2020

Property-based testing for the parser we-like-parsers/cpython#91

Closed

Zac-HD mentioned this issue Oct 17, 2020

Expand our use of swarm testing #2643

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support automatic 'swarm testing' for example selection #1637

Support automatic 'swarm testing' for example selection #1637

Zac-HD commented Oct 11, 2018

Support automatic 'swarm testing' for example selection #1637

Support automatic 'swarm testing' for example selection #1637

Comments

Zac-HD commented Oct 11, 2018