Cut calculation time by half with population #862

sandcha · 2019-04-04T15:22:36Z

LexImpact has to run two calculations on bulk data (50 000 households) in less than 15 seconds.

Definition of done:
Explore in 2 days max

extracting vectors in calculation (shorten vectors)
neutralising variables
or by optimising byte code production

fpagnoux · 2019-05-03T16:40:16Z

Some experimentation on this issue, going more in depth into an idea initially introduced by @Morendil.

TLDR: There seems to be a way to speed up the calculations by 33% to 50% in common usecases

Introduction

OpenFisca (OF) runs calculations vectorially. Concretely, when given a large population, OF never loops on the individuals, but run each operation on the global population.
This approach is much faster that looping over the population when the latter is large enough. It however leads to performing a lot of useless calculations, as there is no conditional branching.
For instance, even if a benefit only applies to a small portion of population, it will still be computed for everyone, then zeroed for the ineligible individuals.

The experiment presented here tries to improve performances by reducing these useless calculations by introducing some kind of conditional branching, while respecting the vectorial paradigm.

The idea is, when facing a condition, to "split and combine" (SC):

Split the population between the individuals who respect the population, and the one who don't
Run different calculations on these 2 sub-populations
Combine the results to get it for the whole population

Experiment 1 (symmetric case disjunction)

The code is available on this gist.

We introduce a benefit that is calculated differently whether individuals are handicapped or not.
We calculate it both with OF current's vectorial approach, and with the SC approach, and measure the ratio of execution time between the 2.
We run this 10 times, and take the average ratio.
- A ratio of 2 for instance means that the SC approach is on average twice faster than the current approach
We study the impact of 3 factors:
- The size N of the population
- The frequency F of handicap in the population.
- The complexity C of calculations in each case. This is measured in number of basic vectorial operations in a formula. 3 different versions of the formulas are introduced to that extend.

Result

C=13	N=1	N=1M
F=0.1	2	1.5
F=0.5	2	1.1

C=6	N=1	N=1M
F=0.1	2	1.35
F=0.5	2	0.9

C=2	N=1	N=1M
F=0.1	1.5	0.6
F=0.5	1.5	0.4

fpagnoux · 2019-05-03T16:55:21Z

Experiment 2 (eligibility)

Similar to the first experiment, expect that this time only handicapped individuals can get a benefit. This creates a dissymmetry: one case is much less complex than the other one.

Result

C=13	N=1	N=1M
F=0.1	6	2
F=0.5	4	0.8

C=6	N=1	N=1M
F=0.1	4.5	1.5
F=0.5	3	0.67

C=2	N=1	N=1M
F=0.1	1.9	0.4
F=0.5	1.6	0.3

fpagnoux · 2019-05-03T16:57:29Z

Interpretation

SC is always more efficient for a single-individual population. This was expected, as in this case we can "short-circuit" the calculation and only use the relevant formula.
For a large population, SC is more efficient:
- when complexity is high. This makes sense, as there is some fixed overhead of splitting and combining. If the formulas operations are trivial, this overhead is bigger than the small cost of the operations.
- when frequency is low, especially in the "eligibility" cases where complex calculations only need to be run for a fraction of the population.

Conclusion

I think the approach is really promising: it doesn't take a lot of operations to make SC more efficient than our current implementation, especially when calculations are relevant to only a small fraction of the population.

fpagnoux · 2019-05-03T18:53:25Z

Implementation challenges

I think the main big challenges would be to adapt the cache to the fact that we would now only "partially" calculate some variables.

There will also be some plumbing to do, but the way formulas are written is actually helping a lot: because they take "persons" array as a first argument, this persons can be an object that represent a subset of the simulation's persons without too much adaptation.

Morendil · 2019-05-04T10:08:55Z

The "depth" factor would strike me as an important one to study too, understood as how many vector computations are chained that depend on one another.

The benefits of SC will multiply in the common cases but that is a lower bound; given small enough frequencies, or (and that is likely common) mutually exclusive conditions at various depths, SC will eliminate altogether some branches of the computation tree.

bonjourmauko · 2019-12-10T13:55:48Z

The initial need of <15s for LexImpact has been achieved since.

Closing for now, do not hesitate for reopen in the future.

Thanks for all the work! ✨

sandcha added the flow:next label Apr 4, 2019

bonjourmauko added the kind:discovery Issue requires discovery: value, ux and tech label Apr 25, 2019

fpagnoux added flow:discovery and removed flow:next labels May 7, 2019

fpagnoux mentioned this issue May 8, 2019

[WIP] Introduce subpopulations #871

Closed

bonjourmauko removed the flow:discovery label Jun 6, 2019

bonjourmauko closed this as completed Dec 10, 2019

bonjourmauko mentioned this issue Jul 28, 2021

Improve tests performance #1027

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cut calculation time by half with population #862

Cut calculation time by half with population #862

sandcha commented Apr 4, 2019

fpagnoux commented May 3, 2019 •

edited

Loading

fpagnoux commented May 3, 2019 •

edited

Loading

fpagnoux commented May 3, 2019 •

edited

Loading

fpagnoux commented May 3, 2019

Morendil commented May 4, 2019

bonjourmauko commented Dec 10, 2019

Cut calculation time by half with population #862

Cut calculation time by half with population #862

Comments

sandcha commented Apr 4, 2019

fpagnoux commented May 3, 2019 • edited Loading

Introduction

Experiment 1 (symmetric case disjunction)

Result

fpagnoux commented May 3, 2019 • edited Loading

Experiment 2 (eligibility)

Result

fpagnoux commented May 3, 2019 • edited Loading

Interpretation

Conclusion

fpagnoux commented May 3, 2019

Implementation challenges

Morendil commented May 4, 2019

bonjourmauko commented Dec 10, 2019

fpagnoux commented May 3, 2019 •

edited

Loading

fpagnoux commented May 3, 2019 •

edited

Loading

fpagnoux commented May 3, 2019 •

edited

Loading