Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What's expected as outcome of processing? #13

Open
abolotnov opened this issue Dec 17, 2018 · 3 comments
Open

What's expected as outcome of processing? #13

abolotnov opened this issue Dec 17, 2018 · 3 comments

Comments

@abolotnov
Copy link

Hi,

I've gotten it installed per the instruction in local mode and downloaded filing index for 2018 and process_all_filing_index(year=2018, form_type_list=["10-Q"])

celery picked up and after some time ended up with a lot of txt (looks like mixture of txt and HTML) content in edgar/data folder and records in _companyinfo and _filing, _filingdata records. But no actual content broken down into sections/individual pieces. Is this expected outcome? Do I need to do additional processing to extract the actual content?

Also, the django app - is this just a skeleton and not supposed to do anything other than user registration and login/logout?

thanks!

@johllmichael
Copy link

Last I checked, this script was written for 10-Ks. Try, "10-K" instead. I know C++, but I am learning Python so I can figure out how to get this work. Do you use Visual Studios for coding?

@abolotnov
Copy link
Author

I gave up on this one, it does work, but I don't understand how to validate it completed everything properly because I don't understand what it's outcome should ultimately be. Tika keeps dying with out of memory, regardless of different configurations I tried, including large instances. Besides, looks like developers abandoned the project or something.

@jcrben
Copy link

jcrben commented Dec 20, 2019

@abolotnov did you find anything better?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

3 participants