Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!
ExtractFilePageText problem |
Post Reply |
Author | |
oyo@nois.no
Beginner Joined: 06 Jun 12 Status: Offline Points: 9 |
Post Options
Thanks(0)
Posted: 30 Aug 16 at 9:44AM |
HiI'm having problem extracting the text from the linked pdf. http://download.nois.no/isycad/outgoing/debenu/EKOZ-AK-M-85892-001-01.zip
The result looks something like this (strange characters): Do anyone know what the problem might be? Regards Øyvind Knappskog Olsen Norconsult Informasjonssystemer AS |
|
mLipok
Senior Member Joined: 23 Apr 14 Location: Poland, Zabrze Status: Offline Points: 453 |
Post Options
Thanks(0)
|
Post code snippet showing how you trying to do that.
|
|
Here you can find description how to test my examples:
http://www.quickpdf.org/forum/forum_posts.asp?TID=2932&PID=12600&title=drawcapturedpagematrix-matrix-howto#12600 |
|
oyo@nois.no
Beginner Joined: 06 Jun 12 Status: Offline Points: 9 |
Post Options
Thanks(0)
|
Hi
Here is my code. It Works fine for all other pdf files. Maybe there is something wrong with the pdf or maybe it is read protected... Adobe Version is 1.5. dllFile = result + "DebenuPDFLibraryDLL1115.dll"; if (File.Exists(dllFile)) { PDFLibrary qp = new PDFLibrary(dllFile); // A new blank document is created at this point in memory int docID = qp.NewDocument(); // Unlock the library int res = qp.UnlockKey(licenseKey); string li = qp.LicenseInfo(); int lec = qp.LastErrorCode(); // Check to see if the library has been successfully unlocked if (qp.Unlocked() == 1) { // Load the document that you want to extract text from into memory qp.LoadFromFile(pdfFile, "");
int iNumPages = qp.PageCount(); // Traverse all pages string documentText = ""; for (int nPage = 1; nPage <= iNumPages; nPage++) { string pageText = qp.ExtractFilePageText(pdfFile, ""
, nPage, 3); } } } Regards Øyvind |
|
Ingo
Moderator Group Joined: 29 Oct 05 Status: Offline Points: 3524 |
Post Options
Thanks(0)
|
Adobe version is 1.4.
There are no security settings - it's all allowed. No passwords... nothing. Only web optimized - that's all. My extractions have a similar result than the one from Oyvind. Should have something to do with fonts, used character codes, codepages ...? Somebody with a more detailed analysis here? Come on! ;-) Cheers and welcome here, Ingo |
|
Cheers,
Ingo |
|
mLipok
Senior Member Joined: 23 Apr 14 Location: Poland, Zabrze Status: Offline Points: 453 |
Post Options
Thanks(0)
|
Hello, Ingo.
I'm little busy, as I'm working on several projects as AutoIt MVP. I'll try to look at this in few next days. |
|
Here you can find description how to test my examples:
http://www.quickpdf.org/forum/forum_posts.asp?TID=2932&PID=12600&title=drawcapturedpagematrix-matrix-howto#12600 |
|
oyo@nois.no
Beginner Joined: 06 Jun 12 Status: Offline Points: 9 |
Post Options
Thanks(0)
|
Hi Thanks for your interest in my problem. Have you had a chance to look at it? Regards Øyvind
|
|
mLipok
Senior Member Joined: 23 Apr 14 Location: Poland, Zabrze Status: Offline Points: 453 |
Post Options
Thanks(0)
|
Try to use
$oQP.SelectPage($iPage_idx) $sDocumentText &= $oQP.GetPageText(8) & @CRLF Btw. I test it with DebenuPDFLibraryAX1311.dll, and I see the same problem. Regards, mLipok
|
|
Here you can find description how to test my examples:
http://www.quickpdf.org/forum/forum_posts.asp?TID=2932&PID=12600&title=drawcapturedpagematrix-matrix-howto#12600 |
|
oyo@nois.no
Beginner Joined: 06 Jun 12 Status: Offline Points: 9 |
Post Options
Thanks(0)
|
Hi I tried what you suggested but it didn't work. Ingo says something about the pdf being web optimized. Could that be a problem? Could there be any other problems With the file? Regards Øyvind
|
|
mLipok
Senior Member Joined: 23 Apr 14 Location: Poland, Zabrze Status: Offline Points: 453 |
Post Options
Thanks(0)
|
I know that this is not working (I said that I see this problem). This was only not related remark about using SelectPage... GetPageText.
Sorry for my English.., I can not say if this is related to the case mentioned by Ingo - I just do not know as I'm normal user as you are, and I'm not PDF technology expert. Sorry.... Try to post to this email: Debenu Support <support@debenu.com> |
|
Here you can find description how to test my examples:
http://www.quickpdf.org/forum/forum_posts.asp?TID=2932&PID=12600&title=drawcapturedpagematrix-matrix-howto#12600 |
|
oyo@nois.no
Beginner Joined: 06 Jun 12 Status: Offline Points: 9 |
Post Options
Thanks(0)
|
OK. Thank you.
|
|
Post Reply | |
Tweet
|
Forum Jump | Forum Permissions You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |
Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. About — Contact — Blog — Support — Online Store