Arthur Reutenauer wrote:
can you explain in mode detail what you mean with 'actual text tags' ?
He means "ActualText tags" :-) See the PDF spec section 14.9.4, page 623. It's a more generic way to support searching than ToUnicode vectors: you just specify the actual string of underlying Unicode characters. The PDF spec uses hyphenated "ck" in German as an example: you typeset "Druk-ker" but you want to search for "Drucker". You can't do that with ToUnicode vectors.
Anyway, this needs support at the engine level and I don't think there is; actually it would be nice to add that to LuaTeX.
hm, if done with words it's probably doable with an unadapted engine (esp when we have a cleaner pdfliteral model, which is on the agenda) \starttext \dorecurse{100}{test } \pdfliteral{/Span <> BDC}arthur\pdfliteral{EMC} \dorecurse{100}{test } \stoptext not that hard to implement if we add a span around each word Hans ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | fax: 038 477 53 74 | www.pragma-ade.com | www.pragma-pod.nl -----------------------------------------------------------------