-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ENH: Addition of optional visitor-functions in extract_text() #1252
Commits on Aug 18, 2022
-
ENH: Added visitor-callbacks in PageObject.extract_text(...).
You may use this callbacks to visit all operators and its arguments and to get the positions of the text-objects. You may use this to extract the rectangles of a table and the texts in its cells in some PDF files.
Configuration menu - View commit details
-
Copy full SHA for 76801d7 - Browse repository at this point
Copy the full SHA 76801d7View commit details -
TST: Test of visitor-callbacks in extract_text().
It extracts labels of rectangles in Figure 2 of GeoBase_NHNC1_Data_Model_UML_EN.
Configuration menu - View commit details
-
Copy full SHA for 39a9f08 - Browse repository at this point
Copy the full SHA 39a9f08View commit details
Commits on Aug 19, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 92c0cf8 - Browse repository at this point
Copy the full SHA 92c0cf8View commit details -
Configuration menu - View commit details
-
Copy full SHA for c320ea8 - Browse repository at this point
Copy the full SHA c320ea8View commit details
Commits on Aug 20, 2022
-
TST: Added function extractTable(...) to read text in cells of a table.
The function extractTable(listTexts, listRects) uses the function extractTextAndRectangles(page, rectFilter) which uses the function extract_text with visitors to extract text in cells of a table.
Configuration menu - View commit details
-
Copy full SHA for 177fea2 - Browse repository at this point
Copy the full SHA 177fea2View commit details
Commits on Aug 22, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 4389590 - Browse repository at this point
Copy the full SHA 4389590View commit details -
ENH: Added visitor-callbacks in PageObject.extract_text(...).
You may use this callbacks to visit all operators and its arguments and to get the positions of the text-objects. You may use this to extract the rectangles of a table and the texts in its cells in some PDF files.
Configuration menu - View commit details
-
Copy full SHA for eccc779 - Browse repository at this point
Copy the full SHA eccc779View commit details -
TST: Test of visitor-callbacks in extract_text().
It extracts labels of rectangles in Figure 2 of GeoBase_NHNC1_Data_Model_UML_EN.
Configuration menu - View commit details
-
Copy full SHA for 8297b13 - Browse repository at this point
Copy the full SHA 8297b13View commit details -
Configuration menu - View commit details
-
Copy full SHA for 165b686 - Browse repository at this point
Copy the full SHA 165b686View commit details -
TST: Added function extractTable(...) to read text in cells of a table.
The function extractTable(listTexts, listRects) uses the function extractTextAndRectangles(page, rectFilter) which uses the function extract_text with visitors to extract text in cells of a table.
Configuration menu - View commit details
-
Copy full SHA for ed784e9 - Browse repository at this point
Copy the full SHA ed784e9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9922f1c - Browse repository at this point
Copy the full SHA 9922f1cView commit details -
ENH: visitor_text additionally gets font-dictionary and font-size.
When executing extract_text(...) the optional visitor-function visitor_text gets the font-dictionary and the font-size. The font-dictionary contains the font-name and other font properties.
Configuration menu - View commit details
-
Copy full SHA for ae7c993 - Browse repository at this point
Copy the full SHA ae7c993View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4afa052 - Browse repository at this point
Copy the full SHA 4afa052View commit details
Commits on Aug 23, 2022
-
Configuration menu - View commit details
-
Copy full SHA for f83ae31 - Browse repository at this point
Copy the full SHA f83ae31View commit details
Commits on Sep 14, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 18d2f4a - Browse repository at this point
Copy the full SHA 18d2f4aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 19003b3 - Browse repository at this point
Copy the full SHA 19003b3View commit details -
Configuration menu - View commit details
-
Copy full SHA for a5b8b44 - Browse repository at this point
Copy the full SHA a5b8b44View commit details
Commits on Sep 17, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 17f2d61 - Browse repository at this point
Copy the full SHA 17f2d61View commit details
Commits on Sep 18, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 72e51be - Browse repository at this point
Copy the full SHA 72e51beView commit details
Commits on Sep 24, 2022
-
Configuration menu - View commit details
-
Copy full SHA for fe11b54 - Browse repository at this point
Copy the full SHA fe11b54View commit details -
Configuration menu - View commit details
-
Copy full SHA for ab5d118 - Browse repository at this point
Copy the full SHA ab5d118View commit details -
Configuration menu - View commit details
-
Copy full SHA for c5733f5 - Browse repository at this point
Copy the full SHA c5733f5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9aad439 - Browse repository at this point
Copy the full SHA 9aad439View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3809522 - Browse repository at this point
Copy the full SHA 3809522View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5b87ecc - Browse repository at this point
Copy the full SHA 5b87eccView commit details -
Configuration menu - View commit details
-
Copy full SHA for e47e16c - Browse repository at this point
Copy the full SHA e47e16cView commit details -
Configuration menu - View commit details
-
Copy full SHA for fb7807c - Browse repository at this point
Copy the full SHA fb7807cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1969c9f - Browse repository at this point
Copy the full SHA 1969c9fView commit details