Re: [NTG-context] ActualText

19 Sep 2009


      Barry Schwartz wrote:
...
Also, I noticed when playing around with the examples from the "Th"
ligature discussion that searching and extraction didn't work with
small caps, though it did work with the ligature. With ActualText tags
hm, mkiv has an analyser for names->unicode and afaik small caps should 
work, unless the glyph name cannot be interpreted (as i don't have the 
font i cannot see what happens or what goes wrong here)
...
these things always work, regardless of the ToUnicode map's
contents. The way Cairo's PDF backend handles this is to use an
ActualText tag for any glyphs that aren't included in the font's
encoding. What I did in my modified ant is to generate a ToUnicode map
from the Adobe glyph naming convention
(http://www.adobe.com/devnet/opentype/archives/glyph.html) and then
thanks for the pointer
...
put an ActualText tag on anything that happens not to match what you
would get from the ToUnicode mapping.
hm, if one knows the character (say c) then why not adapt the tounicode 
vector

Hans

-----------------------------------------------------------------
                                           Hans Hagen | PRAGMA ADE
               Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
      tel: 038 477 53 69 | fax: 038 477 53 74 | www.pragma-ade.com
                                              | www.pragma-pod.nl
-----------------------------------------------------------------