Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!
![]() |
DAExtractPageText losing characters |
Post Reply ![]() |
Author | |
Mike4ql ![]() Beginner ![]() Joined: 26 Jul 10 Status: Offline Points: 5 |
![]() ![]() ![]() ![]() ![]() Posted: 08 Oct 10 at 11:39AM |
I am trying to extract the text from a PDF and most of it works fine but occasionally letters are missed in the extract. This appears to be because the PDF is using octal codes for the characters. This is the text which should be produced and is rendered correctly by DARenderPageToString: Here is the command extract for this same section BT The DAExtractPageText (option 3) returns 2 lines with an empty string and a space (or perhaps 2) for the Top Line and misses out the "fl" from the begining of the Next Line. Is there any way I can correct this? |
|
![]() |
|
Mike4ql ![]() Beginner ![]() Joined: 26 Jul 10 Status: Offline Points: 5 |
![]() ![]() ![]() ![]() ![]() |
Has nobody else seen this?
It seems to be a fundamental flaw preventing anyone from using PDF Quick to extract text from a PDF.
I would be grateful for any suggestions.
Mike
|
|
![]() |
Post Reply ![]() |
|
Tweet
|
Forum Jump | Forum Permissions ![]() You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |
Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. About — Contact — Blog — Support — Online Store