Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Captcha for the CNPJ #42

Closed
jaysonsantos opened this issue Sep 7, 2016 · 13 comments
Closed

Captcha for the CNPJ #42

jaysonsantos opened this issue Sep 7, 2016 · 13 comments

Comments

@jaysonsantos
Copy link

Is it illegal to break the captcha on Receita's website? If it is not maybe I can share a project that I did using scikit-learn.

@montenegrodr
Copy link

@jaysonsantos I guess not. Well, at least it is not a big deal, since many Brazilian companies sell it as service/product. Do you have annotated data for that?

@jaysonsantos
Copy link
Author

@montenegrodr drop me an email and i can share what I have with you and maybe you can use it.

@montenegrodr
Copy link

@jaysonsantos montenegrodr at GMAIL

@caduvieira
Copy link
Contributor

According to the 8.777 / 2016 decree the Ministry of Finance must open the CNPJ data up until november 11th http://www.planalto.gov.br/ccivil_03/_Ato2015-2018/2016/Decreto/D8777.htm

@jaysonsantos
Copy link
Author

Hi all, i've published the code here https://github.com/jaysonsantos/captcha-breaker

@augusto-herrmann
Copy link

That is a high value dataset that unfortunately isn't open yet in Brazil.

At least there are some other sources for company data. Incomplete, but still could be useful.

OpenLEIs has 1047 Brazilian companies. Companies that trade assets publicly abroad (such as the NYSE) are legally required to register for a LEI, which is an international legal entity identifier. This dataset is already open.

The voluntary federal fund transfer system (Siconv) has a daily updated open data API with some 43962 legal entities. These are mostly city council public bodies and non-profits civil society organizations. However, it might be useful to include those in the database to detect possible cases where a politician might use an NGO's legal entity to be reimbursed for a fictional expense.

The federal government's procurement system also has an open data API. If a company has done business with the federal government or has registered itself to be able to participate in procurement at some point in its life, it is on the supplier registry (SICAF). Unfortunately, the API's data has not been updated since 2015, however, its registry of 333977 legal entities ought to be useful.

@tomascamargo
Copy link

@caduvieira do we have any update on the status of the Decreto 8.777, will the Receita release such information?

@caduvieira
Copy link
Contributor

Not yet. I open a LAI request but I don't have a response yet.

@Irio
Copy link
Collaborator

Irio commented Nov 11, 2016

Based on the conversations I had at CGU this week, Federal Revenue is probably far from publicly share their databases (regardless the existence of this decree). We can and should open a official request, but start discussing alternatives is already welcome.

@caduvieira Can you post the request number so we can follow the progress with you?

@caduvieira
Copy link
Contributor

The request was already denied. I asked for a review and I am waiting for a response. They claimed that they don't have the technology to open the data.
https://esic.cgu.gov.br/sistema/Pedido/DetalhePedido.aspx?id=UX6bMBnVD30=

ESIC Report

@caduvieira
Copy link
Contributor

Meanwhile we could use the https://www.receitaws.com.br/ for check data

@cuducos
Copy link
Collaborator

cuducos commented Mar 14, 2017

Meanwhile we could use the https://www.receitaws.com.br/ for check data

That's what we're doing ; )

@cuducos
Copy link
Collaborator

cuducos commented Mar 24, 2017

Closing this: ReceitaWS is already in use and recently started to offer data that was missing (partners list).

@cuducos cuducos closed this as completed Mar 24, 2017
Irio pushed a commit that referenced this issue Feb 27, 2018
Add tests for loadsuppliers static methods
cuducos added a commit that referenced this issue Feb 28, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

7 participants