This repo aims to analyze EUBUCCO v0.1 to inform cleaning and downstream applications.
One main feature is to create automated factsheets for every country. These enable us to have an overview of what is available in EUBUCCO, how it compares to Microsoft ML Buildings, how reliable seems to be our data, where there seems to be issues, etc. The factsheets are currently built using EUBUCCO's city-level overviews and are located here.
A second main feature of this collaborative repo is that any user of EUBUCCO is invited to contribute the insights they have gathered through their interaction with the data.
The repo also aims to transparently share with all users the existing limitations of the dataset that are important to be aware of for downstream usage.
To contribute, please fork the repo, commit some changes to the Markdown files / add images (images of issues go here with the name <country>_<issue>.<format>
), and make a PR.
Please add any issue you came across while using the EUBUCCO data in the Known Issues section of a country factsheet (example), following the template.
What cleaning steps could we undertake to fix the data? Leave your ideas in the Recommandations section.
What other metrics would be interesting to carry out? Shoot me an email at milojevic@mcc-berlin.net.