In this repository you can find a dataset with details about medalists at the 2020 Summer Olympics extracted from Wikipedia and Wikidata. When place of birth for a medalist was available, additional data have been included about the NUTS 2 or NUTS 3 region where they were born (relevant for European countries that are covered by NUTS).
The following files should be of most interest:
- medalists_all.csv: the main file, including all available data
- medalists_nuts_only.csv: only medalists with a place of birth available in Wikidata, and located within a NUTS region
- medalists_missing_place_of_birth.csv: table with all medalists with no recorded data of birth in Wikidata (you are welcome to contribute to reduce the size of this table by adding the relevant information to Wikidata if it publicly available).
- medals_per_million_residents_in_nuts2.csv: a possible way to look at the data
An interactive map with all medalists by place of birth is available following this link.
You can find the script used to generate this dataset in the file
index.Rmd
, or you can look at the rendered version with
comments.
For some early results based on this dataset, see:
- Total medalists: 2 401
- Total medalists with place of birth recorded in Wikidata: 2 334
- Total medalists with place of birth recorded in a NUTS region: 829
Last updated: 2022-01-27 17:14:17
This dataset has been generated by Giorgio Comai (OBCT/CCI) within the scope of EDJNet, the European Data Journalisn Network. It is released under a CC-BY license (Giorgio Comai/OBCT/EDJNet)