Barry Schwartz wrote:
Also, I noticed when playing around with the examples from the "Th" ligature discussion that searching and extraction didn't work with small caps, though it did work with the ligature. With ActualText tags
hm, mkiv has an analyser for names->unicode and afaik small caps should work, unless the glyph name cannot be interpreted (as i don't have the font i cannot see what happens or what goes wrong here)
these things always work, regardless of the ToUnicode map's contents. The way Cairo's PDF backend handles this is to use an ActualText tag for any glyphs that aren't included in the font's encoding. What I did in my modified ant is to generate a ToUnicode map from the Adobe glyph naming convention (http://www.adobe.com/devnet/opentype/archives/glyph.html) and then
thanks for the pointer
put an ActualText tag on anything that happens not to match what you would get from the ToUnicode mapping.
hm, if one knows the character (say c) then why not adapt the tounicode vector Hans ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | fax: 038 477 53 74 | www.pragma-ade.com | www.pragma-pod.nl -----------------------------------------------------------------