-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Challenges in data curation #127
Comments
On 2022-07-26, G.h observed discrepancy in U.S. confirmed case data between different webpages within CDC's website: U.S. Map & Case Count page at 3,487 https://www.cdc.gov/poxvirus/monkeypox/response/2022/us-map.html |
Thank you for providing the dataset! Sorry for jumping in, but I tried to create a I assumed the followings. Is my understanding correct?
Is it possible to provide recovered/fatal data as well as confirmed? |
@lisphilar You're welcome! Please do not apologize for jumping in; we made our work open source because we want your input! Would you kindly open a new issue in this repository for the problem you described? This allows us to keep all "epics", features, and bugfixes discrete. I would also refer you to our data dictionary, which might help answer some of your questions. |
@jim-sheldon Thank you for your quick response! I just have created four issues #177 #178 #179 #180 and I'm looking forward to having discussion with you and your team there. |
Line list is discontinued as of 2022-09-22 |
Comments from discussion 2022-07-13
Errors.
Observed increase in reporting errors. Examples: ECDC report (Argentina, Australia), Spain (computer reporting issue), Belgium (cases don't sum to total https://epidemio.wiv-isp.be/ID/Documents/Monkeypox/MPX_Update_12072022_FR.pdf), U.S. CDC (Illinois reporting errors), etc. Sometimes errors are acknowledged, other times data is changed without notice.
Pattern of inconsistencies in reporting among global/regional reports (e.g. WHO, PAHO, ECDC) and in comparison to country level MOH reporting. Currently, curators identify a change in cases from these global/regional report updates and then look for secondary .gov (national/local) sources of information as verification. But, if we cannot find secondary sources, then we default to the global/regional report numbers. Example, Mexico (PAHO reporting 27 cases, could not verify through MOH site, defaulted to PAHO report #), Malta (WHO reporting 9 cases, could not verify through MOH site, default to WHO #), etc. Reminder to curators that it's important to look for secondary sources of information.
Changes in reporting formats.
Change in cumulative case calculations. Some countries now include probable counts in totals. confirmed + probable = total. Examples: Belgium, Australia.
Standard reporting format no longer supports tracking of confirmed and/or suspected cases. Example, changes to Brazil’s heat map that displays suspected case counts changed to aggregate numbers – so, now we only track confirmed cases.
"Active" versus "recovered/inactive" case status (no longer have the clinical symptoms of monkeypox, they have recovered from acute illness). Example, Italy, Andalusia cases have been reported as active case totals, but we are tracking cumulative totals. Reminder to curators to check cumulative counts (active + inactive). Due to limited metadata, we are not currently able to update individual case status to "recovered/inactive." https://www.rtvsol.es/noticias/andalucia/salud-y-familias-informa-de-que-actualmente-en-andalucia-hay-193-casos-activos-de-viruela-del-mono
The text was updated successfully, but these errors were encountered: