[NTG-context] Transliteration

Hans Hagen j.hagen at xs4all.nl
Thu Feb 3 22:11:17 CET 2022


On 2/3/2022 10:01 PM, Mojca Miklavec wrote:
> On Thu, 3 Feb 2022 at 21:41, Hans Hagen wrote:
>>
>>> I have also merged the Serbian hyphenation patterns, so there is no need
>>> to switch the language in order to have hyphenation in transliterated text.
>>> That was possible because cyrillic and latin scripts use different code
>>> points, and there are no conflicts in patterns.
>>> So I suggest merging the patterns for Serbian cyrillic and latin.
>>
>> I'd like to hear Arthur / Mojca on that .... we can of course load them
>> both but if that is an upstream merge i'll wait for that
> 
> Yes, loading both patterns at once is definitely the correct approach.
> That's what the rest of the TeX world already does (at least LuaTeX
> and XeTeX; pdfTeX not of course), see
>      https://github.com/hyphenation/tex-hyphen/blob/master/hyph-utf8/tex/generic/hyph-utf8/loadhyph/loadhyph-sr-latn.tex
> 
> We have two sets of Cyrillic patterns (and several Latin ones as
> well), so composing a single file was a bit of a (somewhat political)
> challenge.
> Now at least in theory the users are free to choose which of the two
> sets of patterns they want.
> 
> I never checked what ConTeXt was doing with the Serbian patterns.
> Personally I would suggest taking hyph-sh-cyrl.pat.txt and hyph-sh-latn.pat.txt.
we currently do this:

     { "sr",  "hyph-sr",            "serbian", false, { "hyph-sr-cyrl", 
"hyph-sr-latn" }, },

so you suggest to replace that by the "sh" variants

Hans

-----------------------------------------------------------------
                                           Hans Hagen | PRAGMA ADE
               Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
        tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl
-----------------------------------------------------------------


More information about the ntg-context mailing list