Bug when extracting text

Message Topic Search Topic Options Post Reply Create New Topic Printable Version Translate Topic

   Hi,
I use QuickPDF 7.15 with Option #3 to extract text from PDF files and ran into an annoying bug.
Create a simple PDF file that contains the text "QuickPDF Library" and use another color for the character "P".
Then QuickPDF extracts the following content from "QuickPDF Library":

"BAAAAA+TimesNewRomanPSMT",#000000,12.00,56.8000,776.6920,147.3920,776.6920,147.3920,784.7920,56.8000,784.7920,"Quick DF Library"
"BAAAAA+TimesNewRomanPSMT",#FF0000,12.00,86.2000,776.6920,92.8720,776.6920,92.8720,784.7920,86.2000,784.7920,"P"

As you can see, "P" is extracted after "Quick DF Library" with a missing "P", but the output should definitely be:

...,"Quick"
...,"P"
...,"DF Library"

When you use however more than one character in another color, then it works correctly. Use another color for "PD", then the text extraction from "QuickPDF Library" works in the correct order:

"BAAAAA+TimesNewRomanPSMT",#000000,12.00,56.8000,776.6920,86.1040,776.6920,86.1040,784.7920,56.8000,784.7920,"Quick"
"BAAAAA+TimesNewRomanPSMT",#FF0000,12.00,86.2000,776.6920,101.5600,776.6920,101.5600,784.7920,86.2000,784.7920,"PD"
"BAAAAA+TimesNewRomanPSMT",#000000,12.00,101.5000,776.6920,147.1960,776.6920,147.1960,784.7920,101.5000,784.7920,"F Library"

So it seems that this happens only for single characters. Any chance to get this fixed in the next version?