-
-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ geography request ] - add all GADM data #5654
Comments
If you need help finding the source Wikipedia, there are several countries we need and could prepare that data. |
@sharpphyl I'm not sure how much lookup help I'll need (some I'm sure, but I've been working on my scripts so hopefully it won't be entirely manual), I could definitely use help prioritizing. |
I don't know that I can help here - this seems like something we should have planned to do at the time we made the switch to GADM. Any idea how much time this will take? This could help determine if we should just keep adding things as needed or take a day? week? of @dustymc time to get it done. Also, how often will this need re-doing? Do we know when GADM updates occur and how are we going to keep up with them? |
Does this issue mean that ultimately we won't need to request the addition of subdivisions that are listed in GADM or should we go ahead and fill out the Geography Request form when needed? We need provinces in Myanmar and Sri Lanka for catalog records we've already uploaded with incomplete higher geography - so there's no rush but ultimately we want to include the first level administrative unit once it's available. Myanmar should be fun. Per Wikipedia "Myanmar is divided into twenty-one administrative subdivisions, which include 7 regions, 7 states, 1 union territory, 1 self-administered division, and 5 self-administered zones." Online, GADM lists 15 first-level divisions. Arctos has 11 - a mixture of states, districts, etc. We need Kayah State. Sri Lanka is also an issue as GADM (online) shows 25 districts as first-level subdivisions and Wikipedia says "Sri Lanka is divided into 9 provinces, which are further subdivided into 25 districts." Do we want to go with Wikpedia's 9 provinces or stick with GADM's 25 districits? We have 11 of the districts in Arctos. We need Saharagamuwa Province although could use one of the two districts in this province if that's what's in Arctos. I can work up a list of the missing administrative units and file it as an issue under Geography Request or wait until the rest of the administrative units are added per this issue. But I agree that this would be much better done by a Geography committee that can recommend how to deal with countries that don't have straight-forward administrative divisions as other collections may have different priorities and need a different approach from what our collection needs. |
Yes. As usual I'm swamped (but I'm sorta in between waves and can breath for the next 9 seconds or something!) and don't know how to prioritize (but I think #5331 is the biggest squeak at the moment), so I suppose country requests are still the way to go for now. I have got that decently refined (I hope!) so just "Myanmar" is plenty to get started.
This is where our "do what GADM does" policy really shines: we don't have to make any decisions whatsoever, we've already made it. And whatever GADM does, most everybody who does anything spatial will use that anyway so we can still talk to them (if only to grumble about how weird GADM is!). |
PS. I am working on processing GADM 4.1 I will look into those
countries's adm_1 divisions
so more soon...
…On Tue, Mar 14, 2023 at 2:59 PM dustymc ***@***.***> wrote:
ultimately we won't need to request the addition of subdivisions
Yes.
As usual I'm swamped (but I'm sorta in between waves and can breath for
the next 9 seconds or something!) and don't know how to prioritize (but I
think #5331 <#5331> is the
biggest squeak at the moment), so I suppose country requests are still the
way to go for now. I have got that decently refined (I hope!) so just
"Myanmar" is plenty to get started.
don't have straight-forward administrative divisions
This is where our "do what GADM does" policy really shines: we don't have
to make any decisions whatsoever, we've already made it. And whatever GADM
does, most everybody who does anything spatial will use that anyway so we
can still talk to them (if only to grumble about how weird GADM is!).
—
Reply to this email directly, view it on GitHub
<#5654 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AATH7UKQ32B32IG74EOFSMTW4DS5PANCNFSM6AAAAAAU36TQ44>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
??? |
Oh: processing= download 5 gb geodb file; sort through adm_0,1,2, simplify and send |
Ah, thanks. No, I've got scripts set up to pull straight from GADM. |
@dustymc can you reload Myanmar from service? I dont know why Kayah State is missing for Phyllis because it's there in the shapefile (and the other missing states) Same with reloading Sri Lanka-- if we stick with adm_1 then we should have 25 districts (all included in GADM v4.1) |
Yep, but priorities. (I think mine is currently ironing out what I can regarding identifiers then #5331 - but #5193 doesn't seem addressed but I can't do it by myself and IDK if folks have checked and like it (doesn't seem possible) or ????????? - pleasepleaseplease redirect me if I'm lost!!) The biggest of those is #5383 (but that got weird and I think maybe now I'm also expected to clean up the nonsense that's been abandoned in the bulkloader and #5594 isn't getting a response and .... help?)
I have scripts, you are VERY welcome to run them, but you need to arrange access to the VMs and a scary postgres password and some understanding of some very specialized 'me-tools.'
I'm tempted to propose an issue per country (which could be immediately closed if there are no problems) but ???????? My scripts can't ENTIRELY automate that, but it would not be a difficult thing to produce either. |
This doesn't seem like it would be super difficult - just remove "county" or "parish" from any higher geography that includes United States? |
@mkoo - can we concat engtype_1 back in, which will take us back to having Bla County (and such). redo myanmar and ping issues committee |
@mkoo bringing engtype_1 back in isn't going to work, or at the very least will be inconsistent or complicated. I pulled a fresh copy of the US, and I get...
Nice.
I think consistency would require "Connecticut State" - somehow I didn't see that coming! "District of Columbia Federal District" is nice through.... Here's what we were hoping for:
I'm not sure where to go for that. I can't see how this is going to work without us being consistent - enough so that some external user (GBIF or whatever) could follow. I also noticed the Also, there are a bunch of interesting "counties" in this, and if we're to follow out "do what gadm does" rule (and I think we must if this is going to make sense as scale) then I think we have to include them?? And if we're pulling "County" then we're stuck with "Water body."
I'll pull some data so we can all see what I see. Can we schedule another task force locality meeting to talk about this? I'll go make #6051 in some painful and probably inconsistent way.... |
https://docs.google.com/spreadsheets/d/1crtMJWnEzHLjyT_GZqrfYpqck9GvhW6T41VptZGjS2Y/edit?usp=sharing is everything GADM knows about USA, minus the geometry, as it makes it into Arctos, in three tabs. The important nodes of that pathway:
(And I think I figured out some of the internal mystery: I changed tools when I got a newer VM, it's bringing in more data - and letting me direct it - than the previous tool was. tl;dr: I can see stuff I previously could not!) |
@ArctosDB/arctos-working-group-officers @ArctosDB/geo-group help! |
I'm not sure how to help? do we need a quick call? |
IDK? I was thinking scheduling a group meeting, but it'd be pretty nifty if I'm just missing something obvious and someone can tell me how to fix this here so at the very least I can add a missing county without giving myself a bunch of chances to muck it up. Maybe this is as simple as "ranking" only GADM2 (which will make this only relevant to US and UK at the moment)?? IDK if that's predictable enough to eg pass seamlessly between Arctos and BerkeleyMapper and GBIF, but I could probably keep it straight, and that's something. ?????? |
I thought about that - but I feel certain that the "ranks" for some GADM1 stuff will get requested eventually? If I had no input and needed to do this NOW, then "ranking" only GADM2 seems like a viable option. I can't answer the timing question though! |
That was my initial thought-- only needed for GADM2 (so only Eng_type2) I dont want to imagine if we started with California State everywhere! That should get us far for now |
Sounds like a proclamation from above to me, going back active..... |
My own proclamation: Ignore case. GADM insists https://en.wikipedia.org/wiki/DeSoto_County,_Florida is Desoto, I don't think it's right. |
Our first GADM2 entity that does not more or less mean "county": https://arctos.database.museum/place.cfm?action=detail&geog_auth_rec_id=10020505 |
Virginia because its weird: virginia.csv.zip |
United States should now be synced with GADM, except a few cases with their own Issues. |
United Kingdom is synced with GADM or has Issues. |
This can be cool - but it would also be nice to see United States, Lake Michigan Water body |
There are two ways in which that could happen.
|
GADM has 112 subdivisions under England. We have 46. We could use a few more for databasing. Do I need to file an issue or are they in the works? |
|
Got it. |
Happy to help also |
Done? There are some issues, #6059, and I'm sure a few things that just got ignored, but Arctos should now very nearly have full global spatial coverage through GAMD and IHO. |
Explain what geography needs created.
All GADM not already in Arctos.
@mkoo will help find a way to address https://github.com/ArctosDB/internal/issues/222, let's just fill in the gaps.
(Please fix whatever I got wrong.)
The text was updated successfully, but these errors were encountered: