You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi everyone, I have some difficulties extracting text from my pdf files using pdfjs-dist 4.6.82
I'm using react.js
I'll briefly explain the purpose of using the library.
Purpose:
By selecting a portion of a pdf file I have to extract from it all the texts found within the selection.
As long as I use pdf, let's say in A4 format, the program can extract the text correctly, but if I use A0 files in landscape when I extract the texts through the page.getTextContent() the corresponding values of x and y are not correct
I'll give a practical example with two images
getTextContent() result
coordinate display when I position the mouse on the word "PLANIMETRIA"
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hi everyone, I have some difficulties extracting text from my pdf files using pdfjs-dist 4.6.82
I'm using react.js
I'll briefly explain the purpose of using the library.
Purpose:
By selecting a portion of a pdf file I have to extract from it all the texts found within the selection.
As long as I use pdf, let's say in A4 format, the program can extract the text correctly, but if I use A0 files in landscape when I extract the texts through the page.getTextContent() the corresponding values of x and y are not correct
I'll give a practical example with two images
I hope I was clear
thank you for your help
Paoul
Beta Was this translation helpful? Give feedback.
All reactions