You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to customize the underlying pdfminer text extract process for better tune my text extraction pipeline by setting some LAParams values falling into an error:
Traceback (most recent call last):
File "/usr/local/anaconda3/envs/biome/lib/python3.7/unittest/case.py", line 59, in testPartExecutor
yield
File "/usr/local/anaconda3/envs/biome/lib/python3.7/unittest/case.py", line 615, in run
testMethod()
File "/Users/frascuchon/recognai/pdfplumber/tests/test_laparams_customization.py", line 20, in test_load_with_custom_laparams
print(first_page.chars)
File "/Users/frascuchon/recognai/pdfplumber/pdfplumber/container.py", line 35, in chars
return self.objects.get("char", [])
File "/Users/frascuchon/recognai/pdfplumber/pdfplumber/page.py", line 66, in objects
self._objects = self.parse_objects()
File "/Users/frascuchon/recognai/pdfplumber/pdfplumber/page.py", line 167, in parse_objects
process_object(obj)
File "/Users/frascuchon/recognai/pdfplumber/pdfplumber/page.py", line 140, in process_object
fork, vinobj.__dict__.items()
File "/Users/frascuchon/recognai/pdfplumber/pdfplumber/page.py", line 141, in<genexpr>if k not in IGNORE)
KeyError: 'index'
I'm trying to customize the underlying pdfminer text extract process for better tune my text extraction pipeline by setting some
LAParams
values falling into an error:You can check on your own with this code snippet:
The text was updated successfully, but these errors were encountered: