Replies: 1 comment 5 replies
-
Without seeing the actual PDF file, this is just guessing, but in general, pypdf just extracts the characters present there. You might be able to filter this by text position, but it sounds like this is not what you are looking for. |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am trying to extract text using pdfreader. Symbols for email, linkedin , phone are converted to text. Is there any way, I can restrict them to not to convert to text?
Beta Was this translation helpful? Give feedback.
All reactions