Taco Hoekwater
David Kastrup wrote:
Here at the DANTE conference I just learnt that Werner Lemberg is creating a large corpus of two separate "all hyphenations" and "main hyphenations" lists (about 400000 words IIRC) for German. So indeed it would appear that if LuaTeX offered hyphenation according to prioritized patterns, the data to make it typeset better documents in German would be reasonably well available.
If there are two 'hyphenation levels', wouldn't it be easier if luatex supported running through two (or even more) separate pattern sets, and added the 'hitcount' to the discretionary?
Easier on what account?
So breakpoint that appear in both sets of patterns would get an internal priority value of 2 instead of 1?
Main advantage: no need for a patched or postprocessed patgen.
Postprocessing is an obvious choice here.
Disadvantage: wastes a few CPU cycles because of multiple passes.
Well, hyphenation is not the fastest operation in the world. Doubling its runtime when one could instead add what amounts to an attribute to the final chosen point seems a bit pointless. On the other hand, running several patterns through, adding the valid points and making a decision based on that would allow to, say, choose a hyphen when it would look good in either English or German, or choose it when it's ok in 4 out of 5 selected European languages. -- David Kastrup, Kriemhildstr. 15, 44793 Bochum