Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Monster PR] Upgrade to PDFBox 2.0 (#150)
* Starting with upgrade to PDFBox 2.0 (#52) * 2.0 * little progress in upgrading to pdfbox 2 * upgrade to pdfbox 2 starting to show signs of life * Fix TextElement creation * fix tabs * Use the code from LegacyPDFStreamEngine to create the TextElements * Fix removeText function using the example: org.apache.pdfbox.examples.util.RemoveAllText * close the document * close removed text document * fix array serialization * add spanning cells test with CSV format * - Remove capheight calculation - Temporally set height * Test writer two tables checking the json result object instead of the string Add a test writer two tables for CSV output * Fix pageTransform when there is a rotation Add more csv tests * fix path iterator * update json tests * update json outputs * upgrade pdfbox version * back to the old implementation and catch the IndexOutOfBoundsException * Remove hardcoded code * Remove more hardcoded code * test all the elements of the detected table * Change the expected table top value * Increase the threshold factor to support a greater headings * Fix rectangle comparator. * fix wrong expected column size, 5 instead of 6. add more tests * update expected table, more spaces are expected to respect the alingment. * when the text value has length > 1, clean the spaces. * clean code * remove stackstrace * add log error * upgrade all dependencies * code formatting * setting pom to snapshot version
- Loading branch information