DASetTextExtractionWordGap

Message Topic Search Topic Options Post Reply Create New Topic Printable Version Translate Topic

   I'm using library version 8.13 and I'm extracting text from pdf's using DAExtractPageText.  I've run into a few documents that aren't extracting quite the way I'd like using using an "Options" setting of 3 (3 = Return a CSV string for each piece of text on the page with the following format:  Font Name, Text Color, Text Size, X1, Y1, X2, Y2, X3, Y3, X4, Y4, Text).  It's putting some "words" together that shouldn be separate.  In order to address this issue, I figured I'd use the fairly new DASetTextExtractionWordGap function to try and clean things up a bit.

Unfortunately so far, I haven't been able to get this function to have any impact on what's being extracted at all.  Has anybody had any success using this command and if so, what sort of wordgap values were you using?  By default I've been using 0.7, which I _think_ is what the default is, but I'm not 100% certain of that anymore.  Adjusting that value both higher and lower seems to have no impact.  What's the trick to getting this to work?

Thanks!