Re: [NTG-context] DocBookInContext & multi-languages (newbie) / utf

2 Dec 2002

      Hans Hagen (pragma@wxs.nl) wrote:
...
in xtag-utf.tex in .../tex/context/base (at least in my version and the 
beta)
On my SuSE 8.0 I didn't find it, but fortunately it's in the beta which I
downloaded :-)

So here I see something like:

\defineUTFcharacter amacron	1  1

which corresponds to the Unicode code of amacron: U+0101 and it's according to
the output of Vim's function: "ga" which shows:

<ā> 257, Hex 0101, Octal 401.

Now, it just a question of little work to slowly populate this vector with the
values for different Unicode characters.
...
...
UTF-8_and_Unicode_FAQ has some test files and I'm sure this step is not a
problem.
So, where can i find that doc?
The FAQ document is at: http://www.cl.cam.ac.uk/~mgk25/unicode.html,
and the example files are under:

http://www.cl.cam.ac.uk/~mgk25/unicode.html#examples

Pls. take a look http://www.macchiato.com/unicode/Unicode_transcriptions.html

under the example's list.

There is also Unicode converter: http://www.macchiato.com/unicode/convert.html
...
...
a) c4 81 -> amacron
b) 0101  -> amacron
so, c4 is the trigger, and 81 the character; this means that the function 
attached to c4 has to map the 81 onto \amacron
I'm not sure whether c4 is the trigger for the 81 character.

c4 81 is two-byte representation in memory (that's what you'll see in some
hexadecimal editor) of Unicode amacron character with the code U+0101, or
simply said: utf-8 code for amacron :-)
...
can you make me a file with a list like:
amacron : 01/01 : c4/c8 : <utfcode>
^^^^^^^^^^^^^^^^^^^^^^^^^ ^^^^^^^^^
normal ascii              real utf
So, the line for amacron should look like:

amacron	:	01/01	c4/c8

since c4/c8 is utfcode for amacron.

Is this OK?

-- 
Gour
gour@mail.inet.hr
Registered Linux User #278493