Many thanks dear for your input !!!! I know how to do a google and find out a third party solution like iTextSharp and others , but client requirement is somewhat different , they do not like to rely on any third party component .
If PHP and other other open source forums can provide open source solution and ready to plug in codes then why do I need to write it from the scratch which will only increase time and cost rather than focussing on the business logic???
I know conversion process is complex thats why I am looking for suggestions from .Net pundits
Well, does this word document need to be editable? If not, you could render the PDF to an image and then drop the image into a doc file. The largest hurdle with PDF is because it is not in a standardized form like word, excel, etc. To actually turn a PDF
into a real word document might require using an OCR component to recognize the characters, however the caveat here is the formatting of fonts and images will get lost. Either way, it is one of the greater challenges in dot net. I will keep looking around
to see if I can find anything that could help you do this since it would probably be very useful to all of us. Have you checked into iTextSharp? I think they provide the source for it so it might be useful in some way.
Mark as answer posts that helped you.
Marked as answer by bbcompent1 on Nov 26, 2012 11:00 AM
upamanyuroyc...
0 Points
10 Posts
Re: Convert PDF to Word
Nov 16, 2012 12:20 PM|LINK
Many thanks dear for your input !!!! I know how to do a google and find out a third party solution like iTextSharp and others , but client requirement is somewhat different , they do not like to rely on any third party component .
If PHP and other other open source forums can provide open source solution and ready to plug in codes then why do I need to write it from the scratch which will only increase time and cost rather than focussing on the business logic???
I know conversion process is complex thats why I am looking for suggestions from .Net pundits
bbcompent1
All-Star
32978 Points
8502 Posts
Moderator
Re: Convert PDF to Word
Nov 16, 2012 12:26 PM|LINK
Well, does this word document need to be editable? If not, you could render the PDF to an image and then drop the image into a doc file. The largest hurdle with PDF is because it is not in a standardized form like word, excel, etc. To actually turn a PDF into a real word document might require using an OCR component to recognize the characters, however the caveat here is the formatting of fonts and images will get lost. Either way, it is one of the greater challenges in dot net. I will keep looking around to see if I can find anything that could help you do this since it would probably be very useful to all of us. Have you checked into iTextSharp? I think they provide the source for it so it might be useful in some way.
upamanyuroyc...
0 Points
10 Posts
Re: Convert PDF to Word
Nov 16, 2012 12:31 PM|LINK
I just need it to be converted to word, it does not has to be editable.
But again converting it to an image will definitely reduce the quality of output.
iTextSharp and others only provide exe or dll , they do not provide any source code
upamanyuroyc...
0 Points
10 Posts
Re: Convert PDF to Word
Nov 16, 2012 12:33 PM|LINK
I will just show it in a viewer control in Active Report
bbcompent1
All-Star
32978 Points
8502 Posts
Moderator
Re: Convert PDF to Word
Nov 16, 2012 12:33 PM|LINK
If you convert it to SVG, it won't lose quality.
upamanyuroyc...
0 Points
10 Posts
Re: Convert PDF to Word
Nov 16, 2012 12:36 PM|LINK
Ok then I will try to do it this way
Input ( pdf ) ---Process ---> output ( svg image) ---> Append in word doc --->Display in Viewer
Let me try it once, I will keeep you posted.
Many thanks for your time