Hans Hagen (pragma@wxs.nl) wrote:
sure, but for that i need input on how those vector look like; xtag-utf is a starting point, but i also need the second 'bank', so whoi can
Where can I get some more info how this xtag-utf vector should look like?
(1) provide me with proper test files (can be simple text with utf 8 text)
UTF-8_and_Unicode_FAQ has some test files and I'm sure this step is not a problem.
(2) provide the mapping list in terms of <nr><nr> => \namedglyph
I need some explanation: e.g. amacron (small latin letter "a" with macron") has Unicode code U+0101. When I look within Vim (g8 function) it shows me that it has "c4 81" hex value in UTF-8 encoding. In this way it's possible to get Unicode code & hex value in UTF-8 encoding. So, I'm interested, what would be the correct entry for the above mentioned list in the case for amacron: a) c4 81 -> amacron b) 0101 -> amacron or something else? Please, give me some more info and I'd glad to help since I'm sure that utf-8 support for ConTeXt is the right way to go. With package like DocBookInConTeXt one can author directly in XML and have all the advantages of using standard DTD, then one can map the document in ConTeXt and take advantage of its capabilities, and get the superb TeX quality output. With the utf-8 support, to get the westernized transliteration of Sanskrit mentioned in other thread, is piece of cake.
later some kind of language dependency needs to be brought in
Sure.
[so, the framework is already there]
Nice to hear that. Let's move forward :-) Sincerely, Gour -- Gour gour@mail.inet.hr Registered Linux User #278493