Refine the observation generation through internal testing #82

hellais · 2024-08-23T13:15:48Z

The goal of this issue is to collect initial feedback internally on how the observation generation and analysis is currently working.

There is a test instance of this running here:

Observations: https://data.ooni.org/observations/
Analysis: https://data.ooni.org/analysis/

For example if you want to see the observations and analysis generated for the recent nepal report measurements, you can do:

This issue is a place to collect all relevant bugs that are spotted and identify any issues we might find. The primary focus is on observation generation, but it's also useful to start cataloging any specific cases where the analysis is grossly off.

One important caveat (which mostly applies to analysis) is that the processing is done direct in real-time when you access the page. This means that if a ground truth database does not exist, the analysis might be a bit off compared to how it will look like in a real production scenario.

sitinurliza95 · 2024-09-26T12:48:41Z

Checking through some measurements that I found with issues so far:

Confirmed blocking that is false positive - blocked 0.2 https://data.ooni.org/analysis/m/20230724181047.970068_IN_webconnectivity_d6410089030032d2
A likely blocking of a CSO website that I can never confirm and confused about - blocked 0.75
https://data.ooni.org/analysis/m/20240828134537.631369_KH_webconnectivity_9591f3f13ba876bd
Confirmed blocking that I found suspicious - blocked 0.2
https://data.ooni.org/analysis/m/20230327103147.775055_HK_webconnectivity_be8efe1e62303da1

Seems ok. But will need to go through more HK measurements as they have the most issues

sitinurliza95 · 2024-09-26T12:55:02Z

An OK measurement that is a blocking - blocked 0.6 (not sure why it's been showing a lot of OK on Explorer. I've just added the SG fingerprints ooni/blocking-fingerprints#17)
https://data.ooni.org/analysis/m/20240401153454.011394_SG_webconnectivity_74d14611a801555f

hellais · 2024-10-04T12:58:36Z

Thanks for documenting these cases! The ones marked as blocked 0.2 are instances in which the fingerprint is inconsistent with the country hence we mark it as 0.2 blocked and use the down indicator to flag it as "down" in the sense that the probe is badly located or misconfigured.

It might make sense to support these cases to have an extra parameter which is used specifically to flag bad measurements with also some likelyhood distribution. Similar to what we do ATM with the failed flag.

For the NXDOMAIN one, I will have to look at that more carefully.

I think the one marked as blocked 0.6, given we don't have the fingerprint in our DB, the value seems reasonable, though maybe it ought to be weighed a bit differently.

All in all thanks for highlighting these cases they are super helpful!

hellais mentioned this issue Aug 23, 2024

V5.0.0 alpha-4 #83

Merged

hellais added funder/drl2022-2024 priority/low labels Aug 26, 2024

hellais self-assigned this Aug 26, 2024

hellais mentioned this issue Sep 6, 2024

Setup initial web interface for observation viewing and aggregation ooni/backend#879

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refine the observation generation through internal testing #82

Refine the observation generation through internal testing #82

hellais commented Aug 23, 2024

sitinurliza95 commented Sep 26, 2024

sitinurliza95 commented Sep 26, 2024

hellais commented Oct 4, 2024

Refine the observation generation through internal testing #82

Refine the observation generation through internal testing #82

Comments

hellais commented Aug 23, 2024

sitinurliza95 commented Sep 26, 2024

sitinurliza95 commented Sep 26, 2024

hellais commented Oct 4, 2024