[NTG-context] Re: DocBookInContext ... [CSX+, UTF-8 Roman, and Nagari Codings]

3 Dec 2002

      On Mon, Dec 02, 2002 at 06:40:30PM +0100, Gour wrote:
...
So here I see something like:
\defineUTFcharacter amacron	1  1
which corresponds to the Unicode code of amacron: U+0101 and it's according to
the output of Vim's function: "ga" which shows:
<ā> 257, Hex 0101, Octal 401.
Now, it just a question of little work to slowly populate this vector with the
values for different Unicode characters.
To save yourself time you could look at two C programmes that indicate
CSXp, UTF-8 Roman, and UTF-8 Devanagari codings:

 `csxp2ur' -- converts CSXp --> UTF-8 Roman

 `ur2ud.c' -- converts UTF-8 Roman --> UTF-8 Devanagari

Both are from:

  ftp://bombay.oriental.cam.ac.uk/pub/john/software/programs/

Regards,

 Richard

-- 
Richard Mahoney    |  E-mail: rbm49@ext.canterbury.ac.nz
78 Jeffreys Road   |          r.mahoney@comnet.net.nz
Fendalton          |  Telephone: 0064-3-351-5831
CHRISTCHURCH 8005  |  Cellular: 0064-25-829-986
NEW ZEALAND        |  http://homepages.comnet.co.nz/~r-mahoney