Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!

Debenu Quick PDF Library - PDF SDK Community Forum Homepage
Forum Home Forum Home > For Users of the Library > I need help - I can help
  New Posts New Posts RSS Feed - Creating PDF from Image + hOCR data
  FAQ FAQ  Forum Search   Register Register  Login Login

Creating PDF from Image + hOCR data

 Post Reply Post Reply
Author
Message
Shuaib View Drop Down
Beginner
Beginner
Avatar

Joined: 12 Mar 13
Location: Pakistan
Status: Offline
Points: 2
Post Options Post Options   Thanks (0) Thanks(0)   Quote Shuaib Quote  Post ReplyReply Direct Link To This Post Topic: Creating PDF from Image + hOCR data
    Posted: 12 Mar 13 at 9:32AM
Hi,

I am using tesseract to ocr images, now I would like to create a pdf out of the original OCRed image, plus the hOCR output I get from ocr engine. Can anyone please guide me in the right direction on how I can use Quick PDF Library to achieve this? Google didn't turn up anything.

Thanks.
Back to Top
Ingo View Drop Down
Moderator Group
Moderator Group
Avatar

Joined: 29 Oct 05
Status: Offline
Points: 3529
Post Options Post Options   Thanks (0) Thanks(0)   Quote Ingo Quote  Post ReplyReply Direct Link To This Post Posted: 12 Mar 13 at 10:13PM
Hi!

With the draw-functionalities of QP you can insert the real text
without having an eye on the layout.
On a new layer you can use DrawImage to insert the text with
layout over all.
So you can work with textextraction as well as having the nice
original layout.
Here's the online reference:
http://www.debenu.com/docs/pdf_library_reference/FunctionGroups.php

Cheers and welcome here,
Ingo

Back to Top
AndrewC View Drop Down
Moderator Group
Moderator Group
Avatar

Joined: 08 Dec 10
Location: Geelong, Aust
Status: Offline
Points: 841
Post Options Post Options   Thanks (0) Thanks(0)   Quote AndrewC Quote  Post ReplyReply Direct Link To This Post Posted: 14 Mar 13 at 10:22AM
Hello,

QP.SetTextMode(3);  will allow you to draw invisible text.  The text will still be searchable.

You will also need to use AddImageFromFile and DrawImage to create the visible part of the page.  This link shows how import and draw an image correctly - http://www.quickpdf.org/forum/creating-a-multi-page-pdf-from-a-multipage-tiff_topic2125.html

You will need to use Google to better understand the hOCR data correctly.

Andrew.


Back to Top
 Post Reply Post Reply
  Share Topic   

Forum Jump Forum Permissions View Drop Down

Forum Software by Web Wiz Forums® version 11.01
Copyright ©2001-2014 Web Wiz Ltd.

Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. AboutContactBlogSupportOnline Store