You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A question, is there a way to tweak the settings or add a pipeline step to detect highlight boxes? Many PDF documents use these and lay them out in a manner that splits sentences, in my postprocessing I try to detect this based on node type and text content but it is unreliable (second half of split sentence beginning with a caps abbreviation resulting in "runaway boxes" etc).
It would be of great help if it would be possible to group box content, for instance in a GroupItem? Is that possible somehow? For instance by tweaking the table detection to detect these as single cell tables or other solution?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
First off, great job devs!
A question, is there a way to tweak the settings or add a pipeline step to detect highlight boxes? Many PDF documents use these and lay them out in a manner that splits sentences, in my postprocessing I try to detect this based on node type and text content but it is unreliable (second half of split sentence beginning with a caps abbreviation resulting in "runaway boxes" etc).
It would be of great help if it would be possible to group box content, for instance in a GroupItem? Is that possible somehow? For instance by tweaking the table detection to detect these as single cell tables or other solution?
Thanks
Beta Was this translation helpful? Give feedback.
All reactions