-
Notifications
You must be signed in to change notification settings - Fork 661
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Captcha for the CNPJ #42
Comments
@jaysonsantos I guess not. Well, at least it is not a big deal, since many Brazilian companies sell it as service/product. Do you have annotated data for that? |
@montenegrodr drop me an email and i can share what I have with you and maybe you can use it. |
@jaysonsantos montenegrodr at GMAIL |
According to the 8.777 / 2016 decree the Ministry of Finance must open the CNPJ data up until november 11th http://www.planalto.gov.br/ccivil_03/_Ato2015-2018/2016/Decreto/D8777.htm |
Hi all, i've published the code here https://github.com/jaysonsantos/captcha-breaker |
That is a high value dataset that unfortunately isn't open yet in Brazil. At least there are some other sources for company data. Incomplete, but still could be useful. OpenLEIs has 1047 Brazilian companies. Companies that trade assets publicly abroad (such as the NYSE) are legally required to register for a LEI, which is an international legal entity identifier. This dataset is already open. The voluntary federal fund transfer system (Siconv) has a daily updated open data API with some 43962 legal entities. These are mostly city council public bodies and non-profits civil society organizations. However, it might be useful to include those in the database to detect possible cases where a politician might use an NGO's legal entity to be reimbursed for a fictional expense. The federal government's procurement system also has an open data API. If a company has done business with the federal government or has registered itself to be able to participate in procurement at some point in its life, it is on the supplier registry (SICAF). Unfortunately, the API's data has not been updated since 2015, however, its registry of 333977 legal entities ought to be useful. |
@caduvieira do we have any update on the status of the Decreto 8.777, will the Receita release such information? |
Not yet. I open a LAI request but I don't have a response yet. |
Based on the conversations I had at CGU this week, Federal Revenue is probably far from publicly share their databases (regardless the existence of this decree). We can and should open a official request, but start discussing alternatives is already welcome. @caduvieira Can you post the request number so we can follow the progress with you? |
The request was already denied. I asked for a review and I am waiting for a response. They claimed that they don't have the technology to open the data. |
Meanwhile we could use the https://www.receitaws.com.br/ for check data |
That's what we're doing ; ) |
Closing this: ReceitaWS is already in use and recently started to offer data that was missing (partners list). |
Add tests for loadsuppliers static methods
Is it illegal to break the captcha on Receita's website? If it is not maybe I can share a project that I did using scikit-learn.
The text was updated successfully, but these errors were encountered: