Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!
How to optimize text extraction? |
Post Reply |
Author | |
Dmitry
Team Player Joined: 21 Sep 06 Status: Offline Points: 47 |
Post Options
Thanks(0)
Posted: 11 Mar 07 at 3:40AM |
Hi to all!
I have a question. How to optimize time of executing the function GetPageText? Average time of execution is about one second per page. It's too long for me :-) How to reduce this time? if I understand correctly during the text extraction qPDF library extract also all images from page. May be it will be more faster not to extract and save to harddrive images ??? |
|
marian_pascalau
Debenu Quick PDF Library Expert Joined: 28 Mar 06 Location: Germany Status: Offline Points: 278 |
Post Options
Thanks(0)
|
Dmitry,
there is only one way to influence the Text extraction: the Option parameter.
As you may know there are 5 parameters:
0: contents scan
1: internally same as 0
2: contents scan, CVS output
3: CVS text collection with rendering (may read image dictionary)
4: CVS text collection with rendering and word separation.
As information for you using the 0-2 Option may bring some improvements.
|
|
Dmitry
Team Player Joined: 21 Sep 06 Status: Offline Points: 47 |
Post Options
Thanks(0)
|
marian_pascalau, yes I know. But I need exactly parameter 5.
|
|
Ingo
Moderator Group Joined: 29 Oct 05 Status: Offline Points: 3524 |
Post Options
Thanks(0)
|
". . .
qPDF library extract also all images from page . . ." Hi! The actual library version doesn't extract the images anymore. Best regards, Ingo |
|
marian_pascalau
Debenu Quick PDF Library Expert Joined: 28 Mar 06 Location: Germany Status: Offline Points: 278 |
Post Options
Thanks(0)
|
Hi Dmitry, Hi Ingo,
I cannot follow both of you:
Dmitry, what do you mean with parameter 5?
Ingo, is it now working as expected or this is an error?
Marian
|
|
Ingo
Moderator Group Joined: 29 Oct 05 Status: Offline Points: 3524 |
Post Options
Thanks(0)
|
Hi Marian!
It's working like accepted... I think months ago this was fixed... Here's a thread pointing in the same direction: http://www.quickpdf.org/forum/search_results_posts.asp?SearchID=20070312070924&KW=asachoi Best regards, Ingo |
|
Dmitry
Team Player Joined: 21 Sep 06 Status: Offline Points: 47 |
Post Options
Thanks(0)
|
marian_pascalau, sorry, I meant parameter 4
Ingo, please give me just direct link to the thread. |
|
marian_pascalau
Debenu Quick PDF Library Expert Joined: 28 Mar 06 Location: Germany Status: Offline Points: 278 |
Post Options
Thanks(0)
|
Dmitry, if you consider a sponsorship and I will try to optimize the text extraction (Option=4) for you. Otherwise you should to use the option 2 and split text with your own program.
|
|
Dmitry
Team Player Joined: 21 Sep 06 Status: Offline Points: 47 |
Post Options
Thanks(0)
|
marian_pascalau
No, thanks |
|
Post Reply | |
Tweet
|
Forum Jump | Forum Permissions You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |
Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. About — Contact — Blog — Support — Online Store