On Wed, May 20, 2015 at 4:15 PM, David Kastrup <dak@gnu.org> wrote:
Hans Hagen <pragma@wxs.nl> writes:

> (Concerning parsing logs: as the cnf is under user control you cannot
> assume that the log lines are the same always, as some users can set
> them different; i always did. So log file parsers should be flexible
> in this respect.)

Standard TeX is the most fun in that respect.  It wraps after 79 bytes,
never mind whether you are in the middle of a UTF-8 character or not.

That's sort of ugly to process with a UTF-8-aware system.

infact I see different output in pdftex  luatex and xetex:
 
Hello\message{%
1xxxxxxxxxx%
2xxxxxxxxx%
3xxxxxxxxx%
4xxxxxxxxx%
5xxxxxxxxx%
6xxxxxxxxx%
7xxxxxxxxx%
8xxxxxx鹿xx%
9xxxxxxxxx%
10xxxxxxxx%
11xxxxxxxx%
12xxxxxxxx%
13xxxxxxxx%
14xxxxxxxx%
15xxxxxxxx%
16xxxxxxxx%
}
\bye

xetex and luatex correctly display 鹿 but luatex has this off-by-one "bug" that I still have to catch.
--
luigi