Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!
![]() |
Problem with GetPageTex |
Post Reply ![]() |
Author | |
lopezik ![]() Beginner ![]() ![]() Joined: 15 Feb 11 Status: Offline Points: 3 |
![]() ![]() ![]() ![]() ![]() Posted: 15 Feb 11 at 11:20PM |
Hello to all forum users!
First of all sorry for my bad English. I have a problem with the library functions to text extraction from a PDF file. GetPageText function is not working as it should. I have a file which does not return all the characters. How can that be? I also tried ExtractFilePageText - without success. I'm using: Quick PDF Library 7.23 Delphi XE Windows XP 32bit For the tests I threw sample PDF file here: http://uploading.com/files/fe47f7cf/pdf_test.pdf/ Please help. Thanks for any tips. |
|
![]() |
|
Ingo ![]() Moderator Group ![]() ![]() Joined: 29 Oct 05 Status: Offline Points: 3529 |
![]() ![]() ![]() ![]() ![]() |
Hi Lopezik!
This behavior depends on your used fonts and/or characterset. You should search here in the forum-threads with strings like "unicode" or "utf8" or ... We have unicode-samples here, too. Then you should get all characters. Without unicode i've got only "Ró y Wiatrów 7" from your pdf. Cheers and welcome here, Ingo Edited by Ingo - 16 Feb 11 at 1:49PM |
|
![]() |
|
Dimitry ![]() Team Player ![]() Joined: 18 Feb 10 Status: Offline Points: 37 |
![]() ![]() ![]() ![]() ![]() |
The problem was replicated and fixed.
Please check next Quick PDF Library versions. Edited by Dimitry - 16 Feb 11 at 2:53PM |
|
Regards,
Dmitry |
|
![]() |
|
lopezik ![]() Beginner ![]() ![]() Joined: 15 Feb 11 Status: Offline Points: 3 |
![]() ![]() ![]() ![]() ![]() |
Thank you for your reply, but unfortunately the problem is still valid.
I used the following code, but it does not help. procedure TForm1.Button1Click(Sender: TObject); var QP: TQuickPDF; S: AnsiString; FS: TFileStream; UTF8BOM: AnsiString; begin QP := TQuickPDF.Create; try QP.UnlockKey(''); QP.LoadFromFile('c:\pdf_test.pdf'); S := QP.GetPageText(0); FS := TFileStream.Create('c:\pdf_test.txt', fmCreate); UTF8BOM := #$EF#$BB#$BF; FS.Write(UTF8BOM[1], Length(UTF8BOM)); if Length(S) > 0 then FS.Write(S[1], Length(S)); FS.Free; finally QP.Free; end; end; A newer version (Quick PDF Library 7.24 Beta 2) it does not give advice. Still missing one character - 'z' with the dot Do you have any ideas? Edited by lopezik - 16 Feb 11 at 10:39PM |
|
![]() |
|
Ingo ![]() Moderator Group ![]() ![]() Joined: 29 Oct 05 Status: Offline Points: 3529 |
![]() ![]() ![]() ![]() ![]() |
Hi!
Yes. The prob is still valid. Dimitry has written that the bug is fixed and future releases won't have this problem. So you can wait up to the next release 7.25. Cheers, Ingo |
|
![]() |
|
lopezik ![]() Beginner ![]() ![]() Joined: 15 Feb 11 Status: Offline Points: 3 |
![]() ![]() ![]() ![]() ![]() |
Hi
Waiting impatiently. This is a very big problem for us. When can we expect the first beta of 7.25 version? |
|
![]() |
Post Reply ![]() |
|
Tweet
|
Forum Jump | Forum Permissions ![]() You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |
Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. About — Contact — Blog — Support — Online Store