Hans Hagen (pragma@wxs.nl) wrote:
in xtag-utf.tex in .../tex/context/base (at least in my version and the beta)
On my SuSE 8.0 I didn't find it, but fortunately it's in the beta which I downloaded :-) So here I see something like: \defineUTFcharacter amacron 1 1 which corresponds to the Unicode code of amacron: U+0101 and it's according to the output of Vim's function: "ga" which shows: <ā> 257, Hex 0101, Octal 401. Now, it just a question of little work to slowly populate this vector with the values for different Unicode characters.
UTF-8_and_Unicode_FAQ has some test files and I'm sure this step is not a problem.
So, where can i find that doc?
The FAQ document is at: http://www.cl.cam.ac.uk/~mgk25/unicode.html, and the example files are under: http://www.cl.cam.ac.uk/~mgk25/unicode.html#examples Pls. take a look http://www.macchiato.com/unicode/Unicode_transcriptions.html under the example's list. There is also Unicode converter: http://www.macchiato.com/unicode/convert.html
a) c4 81 -> amacron b) 0101 -> amacron
so, c4 is the trigger, and 81 the character; this means that the function attached to c4 has to map the 81 onto \amacron
I'm not sure whether c4 is the trigger for the 81 character. c4 81 is two-byte representation in memory (that's what you'll see in some hexadecimal editor) of Unicode amacron character with the code U+0101, or simply said: utf-8 code for amacron :-)
can you make me a file with a list like:
amacron : 01/01 : c4/c8 : <utfcode> ^^^^^^^^^^^^^^^^^^^^^^^^^ ^^^^^^^^^ normal ascii real utf
So, the line for amacron should look like: amacron : 01/01 c4/c8 since c4/c8 is utfcode for amacron. Is this OK? -- Gour gour@mail.inet.hr Registered Linux User #278493