Print Page | Close Window

Extracting and Searching Text

Printed From: Debenu Quick PDF Library - PDF SDK Community Forum
Category: For Users of the Library
Forum Name: Sample Code
Forum Description: Share Debenu Quick PDF Library sample code with other forum members
URL: http://www.quickpdf.org/forum/forum_posts.asp?TID=2166
Printed Date: 22 Nov 24 at 7:11PM
Software Version: Web Wiz Forums 11.01 - http://www.webwizforums.com


Topic: Extracting and Searching Text
Posted By: dgdwilliams
Subject: Extracting and Searching Text
Date Posted: 21 Feb 12 at 2:13PM
I am using vb.net and are having some issue.  Is it possible to search through a pdf for a string and return the values in the next row at specific character locations?  is it possible to extract a PDF to text without it inserting blank lines where they would normally not be?  i am looking for some examples if anyone could help it would be appreciated.
 
Thanks



Replies:
Posted By: edvoigt
Date Posted: 21 Feb 12 at 2:41PM
Hi,

your question might be answered in a few posts at here. You should try the different options for ExtractPageFileText and play with them using a smaller PDF, to get some feeling. After this, it is easier to understand the problems around searching and adding text.

Please keep in mind, that a PDF is no document like word works with. It is for example a lot of work to insert a forgotten letter in a word, because all things, which may surround this word will not automatically change their own position. It is more like a picture (because a PDF describes a layout) which want to paint on later.

Your question about blank lines shows in a wrong direction. You want to hold a place free. Ok, but for this is it will be enough, to put the next line more down.

Try a look into the PDF-specification from Adobe. Read about operators and text. Then you get an idea what all QuickPDF does for you and where are limits.


Good luck,
Werner



Print Page | Close Window

Forum Software by Web Wiz Forums® version 11.01 - http://www.webwizforums.com
Copyright ©2001-2014 Web Wiz Ltd. - http://www.webwiz.co.uk