Last post Jan 10, 2018 09:43 PM by PaulTheSmith
Jan 09, 2018 06:04 PM|MikeT89|LINK
When I open and read the pdf file everything looks fine, but whenever I try to read and parse that same pdf file all of a sudden there are a bunch of extra characters. And so whenever my code is looking for a specific string, it's not finding it.
When I open the pdf file I see this
Membership ID: 1111111
But when I open and parse each line I get this
MembershipMembership ID:ID: <<MemberId>>1111111
Can someone explain to me why those extra characters are there? And how can I get rid of them or account for them in my code when I'm reading and parsing pdf files.
Jan 09, 2018 07:08 PM|ryanbesko|LINK
What library are you using to parse the pdf? If not using any library you are most likely seeing the objects the text is actually stored in.
Jan 10, 2018 05:27 PM|MikeT89|LINK
I'm using aspose library
Jan 10, 2018 09:43 PM|PaulTheSmith|LINK
Maybe the Aspose support forums would give better help on Aspose products?