[Dev-luatex] generalized hyphenation

David Kastrup dak at gnu.org
Thu Mar 6 22:56:25 CET 2008

Taco Hoekwater <taco at elvenkind.com> writes:

> David Kastrup wrote:
>> Here at the DANTE conference I just learnt that Werner Lemberg is
>> creating a large corpus of two separate "all hyphenations" and "main
>> hyphenations" lists (about 400000 words IIRC) for German.  So indeed
>> it would appear that if LuaTeX offered hyphenation according to
>> prioritized patterns, the data to make it typeset better documents in
>> German would be reasonably well available.
> If there are two 'hyphenation levels', wouldn't it be easier if luatex
> supported running through two (or even more) separate pattern sets,
> and added the 'hitcount' to the discretionary?

Easier on what account?

> So breakpoint that appear in both sets of patterns would get an
> internal priority value of 2 instead of 1?
> Main advantage: no need for a patched or postprocessed patgen.

Postprocessing is an obvious choice here.

> Disadvantage: wastes a few CPU cycles because of multiple passes.

Well, hyphenation is not the fastest operation in the world.  Doubling
its runtime when one could instead add what amounts to an attribute to
the final chosen point seems a bit pointless.

On the other hand, running several patterns through, adding the valid
points and making a decision based on that would allow to, say, choose a
hyphen when it would look good in either English or German, or choose it
when it's ok in 4 out of 5 selected European languages.

David Kastrup, Kriemhildstr. 15, 44793 Bochum

More information about the dev-luatex mailing list