On Mon, Dec 02, 2002 at 06:40:30PM +0100, Gour wrote:
So here I see something like:
\defineUTFcharacter amacron 1 1
which corresponds to the Unicode code of amacron: U+0101 and it's according to the output of Vim's function: "ga" which shows:
<ā> 257, Hex 0101, Octal 401.
Now, it just a question of little work to slowly populate this vector with the values for different Unicode characters.
To save yourself time you could look at two C programmes that indicate CSXp, UTF-8 Roman, and UTF-8 Devanagari codings: `csxp2ur' -- converts CSXp --> UTF-8 Roman `ur2ud.c' -- converts UTF-8 Roman --> UTF-8 Devanagari Both are from: ftp://bombay.oriental.cam.ac.uk/pub/john/software/programs/ Regards, Richard -- Richard Mahoney | E-mail: rbm49@ext.canterbury.ac.nz 78 Jeffreys Road | r.mahoney@comnet.net.nz Fendalton | Telephone: 0064-3-351-5831 CHRISTCHURCH 8005 | Cellular: 0064-25-829-986 NEW ZEALAND | http://homepages.comnet.co.nz/~r-mahoney