On 2/3/2022 10:01 PM, Mojca Miklavec wrote:
On Thu, 3 Feb 2022 at 21:41, Hans Hagen wrote:
I have also merged the Serbian hyphenation patterns, so there is no need to switch the language in order to have hyphenation in transliterated text. That was possible because cyrillic and latin scripts use different code points, and there are no conflicts in patterns. So I suggest merging the patterns for Serbian cyrillic and latin.
I'd like to hear Arthur / Mojca on that .... we can of course load them both but if that is an upstream merge i'll wait for that
Yes, loading both patterns at once is definitely the correct approach. That's what the rest of the TeX world already does (at least LuaTeX and XeTeX; pdfTeX not of course), see https://github.com/hyphenation/tex-hyphen/blob/master/hyph-utf8/tex/generic/...
We have two sets of Cyrillic patterns (and several Latin ones as well), so composing a single file was a bit of a (somewhat political) challenge. Now at least in theory the users are free to choose which of the two sets of patterns they want.
I never checked what ConTeXt was doing with the Serbian patterns. Personally I would suggest taking hyph-sh-cyrl.pat.txt and hyph-sh-latn.pat.txt. we currently do this:
{ "sr", "hyph-sr", "serbian", false, { "hyph-sr-cyrl", "hyph-sr-latn" }, }, so you suggest to replace that by the "sh" variants Hans ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl -----------------------------------------------------------------