Print Page | Close Window

searching Reference Guide

Printed From: Debenu Quick PDF Library - PDF SDK Community Forum
Category: For Users of the Library
Forum Name: General Discussion
Forum Description: Discussion board for Debenu Quick PDF Library and Debenu PDF Viewer SDK
URL: http://www.quickpdf.org/forum/forum_posts.asp?TID=728
Printed Date: 23 Nov 24 at 4:23AM
Software Version: Web Wiz Forums 11.01 - http://www.webwizforums.com


Topic: searching Reference Guide
Posted By: mgbalaji
Subject: searching Reference Guide
Date Posted: 16 May 07 at 8:59AM
Hi,

    I installed latest version ie., iSED.dll v5.22 in my system.Previously i worked on iSED.dll v4.41. While working in new version some functions is not working that is used in the previous version.
    Here i put one Example.

GetPageText(4) - This function get the words in a line from pdf in v4.41.

   but the same function is not working in the newer version(5.22).ie., It will extract lines from the pdf.

   Please tell the function how to extract words from pdf using v5.22 or provide the reference guide for v5.22.

Thanks.....



Replies:
Posted By: Ingo
Date Posted: 16 May 07 at 9:32AM
Hi!

Ref. 5.11 is the last official one we have ... I'll post it here if i've time enough. Here's the syntax you're asking for:

GetPageText

Description

This function provides two different methods for extracting text from the selected page, and presents the
results in a variety of formats.

Definition

function GetPageText(ExtractOptions As Long) As String

Parameters

ExtractOptions Using the standard text extraction algorithm:
0 = Extract text in human readable format
1 = Deprecated
2 = Return a CSV string including font, color, size and position of each piece of text on
the page
Using the more accurate text extraction algorithm:
3 = Return a CSV string for each piece of text on the page with the following format:
Font Name, Text Color, Text Size, X1, Y1, X2, Y2, X3, Y3, X4, Y4, Text
The co-ordinates are the four points bounding the text, measured using the units set
with the SetMeasurementUnits function and the origin set with the SetOrigin function.
4 = Similar to option 3, but individual words are returned, making searching for words
easier.

Returns

The text of the selected page, or an empty string if a problem occurred. Lines are separated with CR-LF
characters.


Best regards,
Ingo



Posted By: mgbalaji
Date Posted: 17 May 07 at 1:20AM
Hi Ingo,

   I have v5.11 reference guide. I already mentioned, this GetPageText function is not working properly.

This is my piece of code:

   Dim data As iSED.QuickPDF
   txt = data.GetPageText(4)

I used this code as per the reference. We put ExtractOptions is 4 means it will extract individual words, but it is not extracting words. It will extract line by line from pdf.I want word by word extraction.

Please tell if any function to extract word by word in v5.22.
I am strugling because of this.

Thanks...


Posted By: Ingo
Date Posted: 17 May 07 at 9:35AM
Hi!

If you've tried 5.22 and option 4 doesn't work then it's like it is ... i can't do anything against it. Try 5.21 ... perhaps it's better. You can call Marian, too... perhaps he can do something for you.

Best regards,
Ingo


Posted By: marian_pascalau
Date Posted: 17 May 07 at 8:29PM
Hi there, if you believe that something is not right inside QuickPDF 5.22 you should send us a bug report to support(at)quickpdf.org. Send please pdf document which gives you this problem and I will take a look into it.
 



Print Page | Close Window

Forum Software by Web Wiz Forums® version 11.01 - http://www.webwizforums.com
Copyright ©2001-2014 Web Wiz Ltd. - http://www.webwiz.co.uk