Print Page | Close Window

Searching a PDF

Printed From: Debenu Quick PDF Library - PDF SDK Community Forum
Category: For Users of the Library
Forum Name: General Discussion
Forum Description: Discussion board for Debenu Quick PDF Library and Debenu PDF Viewer SDK
URL: http://www.quickpdf.org/forum/forum_posts.asp?TID=1224
Printed Date: 23 Nov 24 at 7:40AM
Software Version: Web Wiz Forums 11.01 - http://www.webwizforums.com


Topic: Searching a PDF
Posted By: chandy21
Subject: Searching a PDF
Date Posted: 23 Sep 09 at 1:17PM
Hi, I was wondering if there is a mechanism to search and find text within QuickPDF? Or do I delegate that task to a search engine like Lucene?

Once I find Text, I want to be able to locate it in the document for highlight or whatever.

For example, something like the Ctrl-F in Acrobat., but I know QuickPDF is not a Visual Tool, but I would like to programatically mark-up the document.

Any suggestions?

Thanks in advance.



Replies:
Posted By: DELBEKE
Date Posted: 23 Sep 09 at 3:05PM
Yes, there is a way
Look at "GetPageText" function, the option 4 let you get each word with it's position inside the page, the font used and it's size
 


Posted By: Ingo
Date Posted: 23 Sep 09 at 3:10PM
Hi!

Similar to Jean-Luc's answer:
First do the textextraction and then search with Pos-syntax (or similar) to strings.
The textextraction works page by page... so if you only want the pagenumber where a search-string is located you can use textextraction with option 0,1 or 2 as well. If you need the special position on a page then you should use option 4 (word by word).

Cheers, Ingo



Posted By: stakon
Date Posted: 09 Oct 09 at 12:52PM
Good day,
I hope this thread is appropriate for my questions.

I have the following questions about the "searchability" and manipulation level of a PDF document by using QuickPDF.

1. Is there a way i can retrieve and manipulate objects such as TextBoxes , images, etc. created by QuickPDF or the only "searchable" thing is text? (I regard this is impossible but i just thought i'd ask)

2. If i search and find something in a PDF document can i actually delete it/change its appearance/position etc. ?

3. If i search and find the position of some text and append some more text to it and as a result i go out of the page bounds, is there an automatic way to prevent/correct this?

Thanx in advance,

Stakon.


Posted By: Ingo
Date Posted: 12 Oct 09 at 6:33AM
Hi Stakon!

To point 1...
The really searchable stuff is in textformat. This can be the real textcontent or the text from meta data like subject, author, keywords, and so on. Additionally you can solve questionsabout the layout, if there are images, what are the settings of a document, and so on.
To point 2...
You can change all things having to do with the layout.
You can add, delete and clone pages.
You can't change special parts of a page.
You can overwrite special parts but then they are only covered.
To point 3...
With QP you can change, delete special text parts instead you can add complete new pages or you can delete a page and insert a new one at the same position.

Cheers, Ingo




Print Page | Close Window

Forum Software by Web Wiz Forums® version 11.01 - http://www.webwizforums.com
Copyright ©2001-2014 Web Wiz Ltd. - http://www.webwiz.co.uk