On Fri, Oct 29, 2010 at 13:18, Steffen Wolfrum wrote:
Hi all,
I am just about to typeset a book of a russian author written in english, but with a lot of russian literature listed in the bibliography: The titles of theses sources are russian but in latin transliteration, like this ... O koordinacii mezhdunarodnyh i vneshnejekonomicheskih svjazej subjektov Rossijskoj Federacii
But even though I assigned "\language[ru]" the word "vneshnejekonomicheskih" eg. does not get hyphenated. And there are some dozen titles more that show the same problem ...
Is this (to not hyphenate) because of the transliteration? Do I have to choose another \language key?
Dear Steffen, The Russian patterns only cover the Cyrillic part. Serbian patterns are the only ones that cover both scripts, but even then the patterns themselves are seen as two different languages by TeX. The best thing to do would be to transliterate Russian patterns into Latin script (under one condition: transliteration needs to be one-to-one; if one cyrillic glyph transliterates into two latin characters, that doesn't help you). If you use LuaTeX you may then load the patterns on the fly. Another "easy" option would be to load any other slavic patterns as Jano suggested and then add exceptions where needed. I'm not sure if transliterated patterns belong to hyph-utf8. (If nothing else, Russian is transliterated differently into Slovenian for example, so one would formally then need "transliteration from Russian to any other given language written in Cyrillic script"). [still under assumption that you use LuaTeX and that transliteration is one-to-one] By far the easiest and most portable solution would be if you could convince Taco to implement something like "latin a is equivalent to cyrillic a as far as hyphenation is concerned" (which could also solve many other problems that we have). Actually, you can already do that by redefining \lccode of latin a to point to cyrillic a (and do that for the whole alphabet), but then you need to make sure that you don't use any commands for lowercasing/uppercasing words. If you need details, I can help you out, but first exact transliteration rules are needed. Mojca