<?xml version="1.0" encoding="utf-8" ?>
<?xml-stylesheet type="text/xsl" href="RSS_xslt_style.asp" version="1.0" ?>
<rss version="2.0" xmlns:WebWizForums="http://syndication.webwiz.co.uk/rss_namespace/">
 <channel>
  <title>Debenu Quick PDF Library - PDF SDK Community Forum : DAExtractPageText losing characters</title>
  <link>http://www.quickpdf.org/forum/</link>
  <description><![CDATA[This is an XML content feed of; Debenu Quick PDF Library - PDF SDK Community Forum : I need help - I can help : DAExtractPageText losing characters]]></description>
  <copyright>Copyright (c) 2006-2013 Web Wiz Forums - All Rights Reserved.</copyright>
  <pubDate>Sat, 04 Apr 2026 20:03:02 +0000</pubDate>
  <lastBuildDate>Tue, 12 Oct 2010 19:15:17 +0000</lastBuildDate>
  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
  <generator>Web Wiz Forums 11.01</generator>
  <ttl>360</ttl>
  <WebWizForums:feedURL>www.quickpdf.org/forum/RSS_post_feed.asp?TID=1596</WebWizForums:feedURL>
  <image>
   <title><![CDATA[Debenu Quick PDF Library - PDF SDK Community Forum]]></title>
   <url>http://www.quickpdf.org/forum/forum_images/QPDF_Forum_Title.png</url>
   <link>http://www.quickpdf.org/forum/</link>
  </image>
  <item>
   <title><![CDATA[DAExtractPageText losing characters : Has nobody else seen this?   It...]]></title>
   <link>http://www.quickpdf.org/forum/daextractpagetext-losing-characters_topic1596_post7050.html#7050</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1355">Mike4ql</a><br /><strong>Subject:</strong> 1596<br /><strong>Posted:</strong> 12 Oct 10 at 7:15PM<br /><br />Has nobody else seen this?&nbsp;&nbsp; <DIV>&nbsp;</DIV><DIV>It seems to be a fundamental flaw preventing anyone from using PDF Quick to extract text from a PDF.</DIV><DIV>&nbsp;</DIV><DIV>I would be grateful for any suggestions.</DIV><DIV>&nbsp;</DIV><DIV>Mike</DIV>]]>
   </description>
   <pubDate>Tue, 12 Oct 2010 19:15:17 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/daextractpagetext-losing-characters_topic1596_post7050.html#7050</guid>
  </item> 
  <item>
   <title><![CDATA[DAExtractPageText losing characters : I am trying to extract the text...]]></title>
   <link>http://www.quickpdf.org/forum/daextractpagetext-losing-characters_topic1596_post7037.html#7037</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1355">Mike4ql</a><br /><strong>Subject:</strong> 1596<br /><strong>Posted:</strong> 08 Oct 10 at 11:39AM<br /><br /><P>I am trying to extract the text from a PDF and most of it works fine but occasionally letters are missed in the extract.&nbsp;&nbsp;&nbsp; This appears to be because the PDF is using octal codes for the characters.</P><P>This is the text which should be produced and is rendered correctly by DARenderPageToString:<BR>Top Line -&gt;&nbsp; 6 fyodor dostoyevsky<BR>Space -&gt; <BR>Next Line -&gt; flowers in a stuffy city apartment, but because everybody is</P><P>Here is the command extract for this same section</P><P>BT<BR>0 0 0 1 k<BR>/GS0 gs<BR>/T1_0 1 Tf<BR>8.25 0 0 8.25 262.7389 564.0571 Tm<BR>&#091;(\036)-100(\035)-55(\034)-100(\033)-100(\034)-100(\032)-100( )-100(\033)-100(\034)-100(\031)-100(\030)-82(\034)-45(\035)-100(\027)-100(\026)-100(\031)-100(\025)-100(\035)&#093;TJ<BR>8.5 0 0 8.5 83.3622 564.0571 Tm<BR>(\f)Tj<BR>10.5104 0 0 10.25 83.3622 543.058 Tm<BR>&#091;(\023)10(o)10(w)10(e)10(r)10(s)10( )-125(i)10(n)10( )-126(a)10( )-125(s)10(t)10(u)10(f)10(f)10(y)10( )-125(c)10(i)10(t)10(y)10( )-126(a)10(p)10(a)10(r)10(t)10(m)10(e)10(n)10(t)10(,)47( )-126(b)10(u)10(t)10( )-125(b)10(e)10(c)10(a)10(u)10(s)10(e)10( )-125(e)10(v)10(e)10(r)10(y)10(b)10(o)10(d)10(y)10( )-125(i)10(s )&#093;TJ</P><P>The DAExtractPageText (option 3) returns 2 lines with an empty string and a space (or perhaps 2) for the Top Line and misses out the "fl" from the begining of the Next Line.</P><P>Is there any way I can correct this?</P><DIV></DIV>]]>
   </description>
   <pubDate>Fri, 08 Oct 2010 11:39:48 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/daextractpagetext-losing-characters_topic1596_post7037.html#7037</guid>
  </item> 
 </channel>
</rss>