-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Develop geographic sampling strategy based on Worldcover #28
Comments
As much as possible, please try to tweak this to add other sampling sources. If we only sample to have all and roughly equal classes of land cover, we are teaching the model that that is the important variance to pay attention. This is not wrong, but I worry we will underindex the semantics that many applications need to learn about nature (besides land cover) and people. I.e. if possible sample also from: For nature: For people and human assets:
|
* Add landcover based sampling scripts Closes #28 * Drop duplicates, fix typo, uncomment compute_stats function. * Fix comment that was out of sync with code
To have balanced data we probably need to oversample certain land cover types such as urban areas and wetlands. We will use a simplified version of the Worldcover dataset for this sampling effort.
Deliverables:
The text was updated successfully, but these errors were encountered: