On Wed, May 20, 2015 at 4:15 PM, David Kastrup <dak@gnu.org> wrote:Hans Hagen <pragma@wxs.nl> writes:
> (Concerning parsing logs: as the cnf is under user control you cannot
> assume that the log lines are the same always, as some users can set
> them different; i always did. So log file parsers should be flexible
> in this respect.)
Standard TeX is the most fun in that respect. It wraps after 79 bytes,
never mind whether you are in the middle of a UTF-8 character or not.
That's sort of ugly to process with a UTF-8-aware system.infact I see different output in pdftex luatex and xetex:Hello\message{%1xxxxxxxxxx%2xxxxxxxxx%3xxxxxxxxx%4xxxxxxxxx%5xxxxxxxxx%6xxxxxxxxx%7xxxxxxxxx%8xxxxxx鹿xx%9xxxxxxxxx%10xxxxxxxx%11xxxxxxxx%12xxxxxxxx%13xxxxxxxx%14xxxxxxxx%15xxxxxxxx%16xxxxxxxx%}\byexetex and luatex correctly display 鹿 but luatex has this off-by-one "bug" that I still have to catch.--luigi