This is a scraper for The World Factbook.
The data is exported to a json
and to a csv
.
The exported data is in the public domain.
The Factbook is in the public domain. Accordingly, it may be copied freely without permission of the Central Intelligence Agency (CIA). The official seal of the CIA, however, may NOT be copied without permission as required by the CIA Act of 1949 (50 U.S.C. section 403m). Misuse of the official seal of the CIA could result in civil and criminal penalties. About The World Factbook - Copyright and Contributors
The file country_data_codes.csv
is used as the list of countries to scrape.
It is in the public domain, as it also provided by the CIA (source).
The resulting dataset is available on kaggle.
- Remove text from numerical values and add text to column name.
- Now:
column name = "Coastline"
,value = "7,600 km"
- After:
column name = "Coastline (km)"
,value = 7600
- Now:
- Scrape additional content:
- when the page was last updated,
- photos,
- audio samples,
- oceans,
- references,
- the CIA world factbook from previous years.