Idris Samawi Hamid ادريس سماوي حامد wrote:
It appears to me that zwj and zwnj etc. should be invisible in typeset-text output -- as explained above, but should still be encoded in the output pdf. Think pdf-text extraction, converting between Arabic and Farsi typesetting conventions, etc.
we can do that later (we can use an attribute to keep track of preceding/following special thingies and inject them in the output later on)
and of course others need to be handled as well
Even lsep's and psep's should be present in the output pdf (eg \par => psep). Will make text extraction much more useful, etc.
rather useless in pdf; at some point i might add proper structure to the pdf output but it has a rather low priority (never needed it) Hans ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | fax: 038 477 53 74 | www.pragma-ade.com | www.pragma-pod.nl -----------------------------------------------------------------