Duncan Hothersall wrote:
I think maybe it does. Is there anyone who is running the *minimal install* (from Hans' zip files) on either windows or linux who could test this for me? I just need you to try out a unicode accented character within an <mtext> element inside MathML. Here's my template again - put an unicode accented char where 'HERE' appears:
\useXMLfilter[utf]\usemodule[mathml] \starttext\startXMLdata <formula><math><mtext>HERE</mtext></math></formula> \stopXMLdata\stoptext
It's usually my first port of call, but AFAIK it's not possible to control the way the web browser re-encodes stuff before it is submitted, so the results are not reliable. This is a real shame - TeX-Live is how I usually confirm all my queries.
I'm not sure about it, but my experience is that if I ask for an UTF-8 encoded page (at least in Mozilla), my input is also submitted as UTF-8. (When I worked with phpMyAdmin for example, I had to use the same encoding as the database which was sometimes annoying - the pages had to be shown in the wrong encoding in order to be able to work with the data properly.) The rest of the contextgarden pages already has UTF-8 encoding, but I'm not sure if that wouldn't disturb some newbies forgetting to add \enableregime[utf] at the beginning of the document. I tried this out: \enableregime[utf] \useXMLfilter[utf]\usemodule[mathml] \starttext äöüčćšđž % rendered properly \startXMLdata <formula><math><mtext>äöüčćšđž</mtext></math></formula> \stopXMLdata\stoptext It doesn't work here either, but the problem doesn't seem to be in unicode, but in the rendering of accented characters. This example: \usemodule[mathml] \starttext\startXMLdata <formula><math><mtext>\"{a}\"{o}\"{u}\v{c}\v{s}\v{z}</mtext></math></formula> \stopXMLdata\stoptext fails as well. An interesting observation: I tested on live.contextgarden.com, on the latest ConTeXt in MikTeX distribution and in an old minimal ConTeXt distribution for Windows (6.12.2004). The results from MikTeX and live.contextgarden.net were equal. \v{c} resulted in something like "leftdoubleguillemont" overlapped with "c", \"{a}, \"{o} and \"{u} resulted in a dash over the letter a/o/u. In the minimal ConTeXt distribution the line \"{a}\"{o}\"{u}\v{c}\v{s}\v{z} resulted in "{a}"{o}"{u}v {c}v {s}v {z} (literally). The example with utf didn't even mind to compile there. Mojca