Hirendra Sisodiya
posted
489 posts
since
Sep 01, 2009
from
|
|
Re: c# code to extract data with table from pdf file
|
|
|
|
|
|
|
|
|
|
|
Hello Dennis
you can extract text from pdf file in these ways:
1. if you can change this pdf file in to .Tiff or .mdi format than you can use OCR technique through microsoft office Document Imaging(MODI.dll).
2.You can use any third party tool like 'LeadTool', some time ago i develop a Document management application and i use lead tool for image editing and extrating text from the images. Lead tool provides good experience with image management.
if you have any other wat for doing this please tell me also
thanks
Please mark as answer if it helps
|
|
|
|
|
hirendra
BCA, MCA, M.Phil., MVP(Mindcraker)
|
|
|
|
|
|
Dennis Jose
posted
2 posts
since
Mar 18, 2009
from
|
|
Re: c# code to extract data with table from pdf file
|
|
|
|
|
|
|
|
|
|
|
Thanks Hirendra.
Actually my aim is to convert a PDF document into EXCEL file.Using ITextSharp i can convert all text informations into excel.But i did n't get the right alignment as PDF.Is there any way to fetch PDF structure (rows and colums) from a PDF document so that i can transfer that alignment into EXCEL(this is my logic.I don't know is it correct or not ) ? .Or is there any way to transfer PDF documnet into Excel with the same structure?. Thanks in advance.
|
|
|
|
|
|