-
Notifications
You must be signed in to change notification settings - Fork 548
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
getDataTm is not returning all the text #761
Comments
// Extract PDF (Text Based Content) This works for me |
Yup. That is because the pdfparser/src/Smalot/PdfParser/Document.php Line 439 in 0ddcc54
But sadly, I need the position of the text too. So I can't just use the Maybe a solution could be to move that check to |
I created a PR to fix this: #762. Can someone take a look at it? |
Description:
PDF input
pdf-with-text.pdf
Expected output & actual output
The PDF contains the word
{{signer1}}
, but thegetDataTm
does not return this text for the second page.Note that: The
getTextArray
is returning{{signer1}}
. The problem has to do something withgetTextArray
returning an empty string for the second page. (Probably because of the image?)Code
The text was updated successfully, but these errors were encountered: