Re: [NTG-context] ec encoding and tcaron
Adam Lindsay wrote:
Am I right in thinking the main array in pdfr-ec maps from slots in the encoding to the unicode code point? Is there anything else I need to be mindful of?
According to the Adobe specification http://partners.adobe.com/public/developer/en/acrobat/5411.ToUnicode.pdf both existing pdfr-* files already violate at least one of the following two requirements: /CMapType 2 def ... 1 begincodespacerange <0000> <FFFF> endcodespacerange You can either define a range (for example for A-Z, 0-9, ...) or single characters. Mojca
Mojca Miklavec said this at Wed, 24 Aug 2005 23:41:47 +0200:
According to the Adobe specification http://partners.adobe.com/public/developer/en/acrobat/5411.ToUnicode.pdf
Thanks for the link, Mojca. I'm not sure about the invalidity you cite, but it does look like the reference I need to study! -- =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= Adam T. Lindsay, Computing Dept. atl@comp.lancs.ac.uk Lancaster University, InfoLab21 +44(0)1524/510.514 Lancaster, LA1 4WA, UK Fax:+44(0)1524/510.492 -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-
Adam Lindsay wrote:
Mojca Miklavec said this at Wed, 24 Aug 2005 23:41:47 +0200:
According to the Adobe specification http://partners.adobe.com/public/developer/en/acrobat/5411.ToUnicode.pdf
Thanks for the link, Mojca. I'm not sure about the invalidity you cite, but it does look like the reference I need to study!
Sorry, this paragraph confused me a bit (as I just answered you a couple of minutes before your post): Note from the above excerpt that /CMapType is set to 2 in "ToUnicode" mapping files. Because a "ToUnicode" mapping file is used to convert from CIDs (which begin at decimal 0, which is expressed as 0x0000 in hexadecimal notation) to Unicode code points, the following "codespacerange" definition, without exception, shall always be used: 1 begincodespacerange <0000> <FFFF> endcodespacerange I have to look closer as well before stating my next stupidity. (I don't know if CMapType 1 or 2 should better be used or if the range should be from 0000 FFFF only for Eastern languages where the number of glyphs certainly exceeds 256 ;) Sorry again. Mojca
participants (2)
-
Adam Lindsay
-
Mojca Miklavec