Pankaj Nagarsekar

Pankaj Nagarsekar

  • NA
  • 3
  • 15.5k

read data from pdf files using c#

Oct 5 2012 2:04 PM
I have this question. I have 5 PDFs having around 38,000 objective questions. So i want to make an application which imports this questions and save it into database and then give interface to the user for choosing question with four objective. I used itextsharp to read from PDFs as a chunk and also line by line. The content after reading is scattered and i cannot figure out a sequence by which i can split or differentiate between the question and the four objectives. Is there any better way by which I can import data from PDFs?? The content in PDFs is in tabular format.
Please check Pdf file (s8.postimage.org/owm0hsej9/Qbank.jpg)
resulting string in Window(s13.postimage.org/4toy70lqf/Resulting_String.jpg)

Answers (3)