-
Notifications
You must be signed in to change notification settings - Fork 661
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Backup pictures from all receipts #33
Comments
I can write a But let's find out ; ) If you don't mind, assign that Issue to me and I tackle it soon. (OCR'd be awesome in a near future!) |
As I understand, the initial idea is just to have some kind of routine to download everything to a given folder for local storage. I'm ok with that. It's within the scope of this issue to create a secondary, online source, for this image bank? Something like S3? If this is desirable, making this backup routine somewhat incremental and recurrent is necessary. What do you both think about it? Maybe split this task into a different issue? |
@andrewhr Exactly : ) As we say in the
The |
Complementing my last comment: I don't mean that everything we got is working perfectly, I just wanted to say that we have a basis to what you're proposing, @andrewhr — we are in tune! That said any kind of improvement in this pipeline is welcomed. Feel free to start a new Issue to discuss and implement enhancements on that topic ; ) |
Script to fetch receipt images Fix #33
It is vital for the project have a way of accessing all receipts, from any reimbursement since the first available and not depend from Chamber of Deputies.
Besides having proofs for legal reports, its useful for offline analyses. #32 is one I think about; doing OCR for generating new structured data is another.
Here's a function that, based on a record from quota datasets, returns the picture URL from the Chamber of Deputies' website:
The text was updated successfully, but these errors were encountered: