Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Campos dos Goytacazes-RJ spider refactor
The way the spider was implemented assumed that there could only be a single file_url per day per is_extra_edition value, which was not always true. This refactoring gathers all the various files per day and is_extra_edition. We addressed the text format for Saturday gazettes to be considered is_extra_edition. We also included the start_date and end_date handling, and edition_number when applicable. resolve okfn-brasil#637
- Loading branch information