`compute_summary_statistics` could have option to use standard-error #1555

tbhallett · 2024-12-12T22:28:47Z

Following discussion with @andrew-phillips-1 and @joehcollins it seems that some analysts and summarising uncertainty in model results using the 'standard error'.
We could implement this in the utility method compute_summary_statistics to provide ease of access. An implementation could be as follows (budding on #1457):

    if not use_standard_error:   # <--- `use_standard_error` is new boolean argument, defaulting to False
        lower_quantile = (1. - width_of_range) / 2.
        stats["lower"] = grouped_results.quantile(lower_quantile)
        stats["upper"] = grouped_results.quantile(1 - lower_quantile)
    else:
       #  Use standard error concept whereby we're using the intervals to express a 95% CI on the value of the mean. This will make width of uncertainty become narrower with more runs.
        std_deviation = grouped_results.std()
        std_error = std_deviation / np.sqrt(len(grouped_results))
        z_value = st.norm.ppf(1 - (1. - width_of_range) / 2.)  # (import scipy.stats as st)
        stats["lower"] = stats['central'] - z_value * std_error
        stats["upper"] = stats['central'] + z_value * std_error

A question for @andrew-phillips-1 -- I presume it's only appropriate to use this concept in certain circumstances - what would these be? And also, I presume we should raise an error if sometimes tries to use a summary measure other than mean?

The text was updated successfully, but these errors were encountered:

andrew-phillips-1 · 2024-12-13T07:35:18Z

Yes we would want the summary measure to be the mean or else raise an error. Since this is based on a mean over multiple runs with the same parameter values I can't think of exceptions where the mean and 95% CI would not be of value, but the analyst should obviously think for themselves also about what they are presenting they think makes sense.

tbhallett · 2024-12-13T10:08:20Z

Ok, when #1457 is merged, I'll add this in.

BinglingICL · 2024-12-13T10:35:08Z

Thanks very much all. This is very clear and helpful.

May I ask
(1) for each run of the same draw (i.e., the same parameter values), the population is independently sampled from the whole population? or, the symptoms/conditions are independently assigned to the same population sample?
(2) for each run, we have a result measure, such as the DALYs (which is scaled up for the whole population), are we treating it a single estimate of the whole population's health burden so that the mean of results from multiple runs represent the estimated mean health burden of the whole population and the standard error measure the variability of the estimated mean to the true mean of the population?

tbhallett · 2024-12-13T13:01:23Z

May I ask (1) for each run of the same draw (i.e., the same parameter values), the population is independently sampled from the whole population? or, the symptoms/conditions are independently assigned to the same population sample?

Yes, different random-number-generator sees for each run, so an "independent" draw of the same 'model'. This includes the properties of the population at the start of the simulation.

(2) for each run, we have a result measure, such as the DALYs (which is scaled up for the whole population), are we treating it a single estimate of the whole population's health burden so that the mean of results from multiple runs represent the estimated mean health burden of the whole population and the standard error measure the variability of the estimated mean to the true mean of the population?

Yes, I believe so. (check with @andrew-phillips-1)

andrew-phillips-1 · 2024-12-13T19:18:17Z

Yes this seems right. if you can imagine doing a million runs with a given set of parameter values and doing this say 3 times, the mean for the output should be essentially identical over those three times. Call this the true mean. You can think of the 95% confidence interval for the mean based on a limited number of runs as the interval in which there is a 95% chance that the true mean lies.

BinglingICL · 2024-12-13T20:51:47Z

Yes this seems right. if you can imagine doing a million runs with a given set of parameter values and doing this say 3 times, the mean for the output should be essentially identical over those three times. Call this the true mean. You can think of the 95% confidence interval for the mean based on a limited number of runs as the interval in which there is a 95% chance that the true mean lies.

Thanks very much Andrew. This is super clear and helpful!! And thanks Tim, too.

joehcollins · 2024-12-16T08:43:32Z

Thanks everyone - think this will be really helpful!

tbhallett assigned BinglingICL, andrew-phillips-1 and joehcollins Dec 12, 2024

tbhallett linked a pull request Dec 16, 2024 that will close this issue

Allow Computing Standard Errors Across Runs in compute_summary_statistics #1558

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`compute_summary_statistics` could have option to use standard-error #1555

`compute_summary_statistics` could have option to use standard-error #1555

tbhallett commented Dec 12, 2024

andrew-phillips-1 commented Dec 13, 2024

tbhallett commented Dec 13, 2024

BinglingICL commented Dec 13, 2024

tbhallett commented Dec 13, 2024 •

edited

Loading

andrew-phillips-1 commented Dec 13, 2024

BinglingICL commented Dec 13, 2024

joehcollins commented Dec 16, 2024

compute_summary_statistics could have option to use standard-error #1555

compute_summary_statistics could have option to use standard-error #1555

Comments

tbhallett commented Dec 12, 2024

andrew-phillips-1 commented Dec 13, 2024

tbhallett commented Dec 13, 2024

BinglingICL commented Dec 13, 2024

tbhallett commented Dec 13, 2024 • edited Loading

andrew-phillips-1 commented Dec 13, 2024

BinglingICL commented Dec 13, 2024

joehcollins commented Dec 16, 2024

`compute_summary_statistics` could have option to use standard-error #1555

`compute_summary_statistics` could have option to use standard-error #1555

tbhallett commented Dec 13, 2024 •

edited

Loading