Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE]: Population per state/year on IBGE #191

Closed
fccoelho opened this issue Mar 30, 2024 · 6 comments
Closed

[FEATURE]: Population per state/year on IBGE #191

fccoelho opened this issue Mar 30, 2024 · 6 comments
Assignees

Comments

@fccoelho
Copy link
Collaborator

The dataSUS FTP has this info in the directory /dissemin/publicos/IBGE/pop

This should me made accessible via the IBGE module

@fccoelho fccoelho self-assigned this Mar 30, 2024
@maxbiostat
Copy link

Gotta be careful with how these projections are done, however.

@turicas
Copy link

turicas commented Apr 1, 2024

I've implemented the populacao_estimada.py script which finds, downloads and converts/normalizes data from IBGE. It currently downloads and normalizes estimates only but I'm willing to also add support for census data. Maybe we can reuse some of that code here.
The result CSVs are hosted on repository, so you can check if it has the information you need (it can download more than one estimate for year, if available).

@fccoelho
Copy link
Collaborator Author

fccoelho commented Apr 1, 2024

Interesting because it pulls from IBGE FTP server directly. What we are doing so far is to use the data collected made available by DATASUS.

I think it is worth to integrate your script as well, and make available both sets of estimates, because sometimes people have reasons to stick with one or the other source. Also, the estimating methodologies for non-census years may be different. @luabida can you take a Look at @turicas code?

@fccoelho
Copy link
Collaborator Author

fccoelho commented Apr 1, 2024

It would be nice to adapt the code to use the same strategy to scan the FTP server direcly as we do in PySUS for the DATASUS FTP server, instead of relying on a list of hardcoded urls for the xls files.

@turicas
Copy link

turicas commented Apr 1, 2024

It would be nice to adapt the code to use the same strategy to scan the FTP server direcly as we do in PySUS for the DATASUS FTP server, instead of relying on a list of hardcoded urls for the xls files.

The list of XLS files is hard-coded just because I didn't want to scrape everytime I run the script (since it does not change very often), but I implemented the code to extract the URLs automatically (so you don't need to rely on that hard-coded dict).

@luabida
Copy link
Collaborator

luabida commented May 14, 2024

Finished on #193, the code can be seen here

@luabida luabida closed this as completed May 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants