<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;">
Hi Karl,
<div class=""><br class="">
</div>
<div class="">Thanks for looking at this.</div>
<div class=""><br class="">
<blockquote type="cite" class="">On Jun 13, 2017, at 9:10 AM, Karl Berry <<a href="mailto:karl@freefriends.org" class="">karl@freefriends.org</a>> wrote:<br class="">
<br class="">
rm> ... test the full name<br class="">
(including œפער.œפעשs) first, for a datbase entry. If found, use it.<br class="">
Otherwise, try again using just the prefix (as at present).<br class="">
<br class="">
That surely sounds sensible.<br class="">
<br class="">
Or in case a name is multiply qualified; e.g.,<br class="">
delta.sc.ipa (occurs in cmu-tipx.enc ) also<br class="">
omega.sc.ipa q.sc.ipa f.sc.ipa<br class="">
then drop off the qualifications from the end.<br class="">
So test in order: delta.sc.ipa delta.sc delta<br class="">
<br class="">
Ack.<br class="">
<br class="">
Thanh, can you confirm that we should go ahead with this plan?<br class="">
</blockquote>
<div class=""><br class="">
</div>
<div class="">The point is that Unicode is not about glyphs, but characters.</div>
<div class="">This is stated very clearly in the Unicode documentation, and on numerous websites.</div>
<div class="">Then ‘character’ should be interpreted as 'how a (collection of) glyph(s) is used’.</div>
<div class=""><br class="">
</div>
<div class="">Thus 'a' and 'a.sc’ are used differently; the latter usually for some form of emphasis,</div>
<div class="">as in section headings or running headers, rather than the body text.</div>
<div class="">So it’s not just a matter of a different font style for these words.</div>
<div class="">It’s quite reasonable for Copy/Paste and screen- or Braille-readers to be able to</div>
<div class="">detect this difference, via the /ToUnicode map. </div>
<div class=""><br class="">
</div>
<div class="">Using just the glyph name 'a' is fine as a fall-back, but it should be up to the document</div>
<div class="">author or package-writer to enrich the full name with a more descriptive Unicode point,</div>
<div class="">when this is appropriate. At present pdftex prevents it, when using \pdfglyphtounicode .</div>
<div class=""><br class="">
</div>
<div class="">The only alternative, so far as I can see, is to provide an explicit CMap file, after having </div>
<div class="">temporarily set \pdfgentounicode=0 when the font is loaded. This is hard to get correct</div>
<div class="">and to find the right place to hook-in the relevant coding.</div>
<div class=""><br class="">
</div>
<div class="">Actually, another possibility is editing both the font and its encoding file - - something that </div>
<div class="">I’m sure we don’t want people doing.</div>
<br class="">
<blockquote type="cite" class=""><br class="">
Thanks,<br class="">
Karl<br class="">
</blockquote>
<div class=""><br class="">
</div>
<div class=""><br class="">
</div>
Hope this helps.</div>
<div class=""><br class="">
</div>
<div class=""><br class="">
</div>
<div class=""> Cheers,</div>
<div class=""><br class="">
</div>
<div class=""><span class="Apple-tab-span" style="white-space:pre"></span>Ross<br class="">
<br class="">
<div class=""><br class="">
Dr Ross Moore<br class="">
Mathematics Dept | 12 Wally’s Walk, 734<br class="">
Macquarie University, NSW 2109, Australia<br class="">
T: +61 2 9850 8955 | F: +61 2 9850 8114<br class="">
M:+61 407 288 255 | E: <a href="mailto:ross.moore@mq.edu.au" class="">ross.moore@mq.edu.au</a><br class="">
<br class="">
http://www.maths.mq.edu.au<br class="">
<br class="">
<span style="font-size: 12px; line-height: normal;"><a href="http://mq.edu.au/" target="_blank" style="font-size: 12px; line-height: normal;" class=""><span><br class="Apple-interchange-newline">
<span><img height="58" width="260" apple-inline="yes" id="E3C289AB-B1BF-40E4-892A-0D09323EF493" apple-width="yes" apple-height="yes" src="cid:image001.png@01D030BE.D37A46F0" class=""></span></span></a></span><br class="">
<br class="">
<br class="">
CRICOS Provider Number 00002J. Think before you print. <br class="">
Please consider the environment before printing this email.<br class="">
<br class="">
This message is intended for the addressee named and may <br class="">
contain confidential information. If you are not the intended <br class="">
recipient, please delete it and notify the sender. Views expressed <br class="">
in this message are those of the individual sender, and are not <br class="">
necessarily the views of Macquarie University.<br class="">
</div>
<br class="">
</div>
</body>
</html>