[Pkg-isocodes-devel] [pkg-isocodes-Bugs][314422] tt.po uses Latin script although the glibc locale tt_RU.UTF-8 uses cyrillic script

pkg-isocodes-bugs at alioth.debian.org pkg-isocodes-bugs at alioth.debian.org
Fri Nov 1 23:29:04 UTC 2013


pkg-isocodes-Bugs item #314422 was changed at 2013-11-02 00:29 by Tobias Quathamer
You can respond by visiting: 
https://alioth.debian.org/tracker/?func=detail&atid=413077&aid=314422&group_id=30316

>Status: Closed
Priority: 3
Submitted By: Mike FABIAN (mfabian-guest)
>Assigned to: Tobias Quathamer (toddy)
Summary: tt.po uses Latin script although the glibc locale tt_RU.UTF-8 uses cyrillic script 
Part: General


Initial Comment:

See also https://bugzilla.redhat.com/show_bug.cgi?id=1005174 which is caused by
this problem.

glibc has the following locales for Tatar (tt):

    $ locale -a | grep ^tt
    tt_RU
    tt_RU.utf8
    tt_RU.utf8 at iqtelif
    tt_RU at iqtelif

tt_RU.UTF-8 at iqtelif uses Latin script:

    $ LANG=tt_RU.UTF-8 at iqtelif date
    Com Sin  6 13:00:25 CEST 2013

and tt_RU.UTF-8 uses cyrillic script:

    $ LANG=tt_RU.UTF-8 date
    Җом Сен  6 13:01:34 CEST 2013

This causes the problem, that gnome-initial-setup, which uses
the available glibc locales to offer a list of languages
to the user to choose, displays Tatar twice,
once for tt_RU.utf8 at iqtelif and once for tt_RU.UTF-8.

That is good, but the languages in this list are shown as language
endonyms (with territory) and to get these endonyms, the iso-codes
package is used.

As there is only one translation for Tatar in iso-codes-3.46 in the files:

./iso_639_3/tt.po
./iso_639/tt.po
./iso_3166/tt.po

which uses Latin script, both lines displayed by gnome-initial-setup
for the locales tt_RU.utf8 at iqtelif and tt_RU.UTF-8 end up to be
displayed identically as 

    Tatarça (Urıs Patşahlıq)

So probably the translations using Latin script should be moved to
tt at iqtelif.po. That would make the two entries for tt_RU.utf8 at iqtelif and tt_RU.UTF-8 in gnome-initial-setup distinct, only the line for tt_RU.utf8 at iqtelif
would display 

    Tatarça (Urıs Patşahlıq)

then, the line for tt_RU.UTF-8 would fallback to English because of the
lack of a translation for Tatar using Cyrillic.

If possible, translations using the Cyrillic script should be added.

https://en.wikipedia.org/wiki/Tatar_language#Writing_system

seems to say that Cyrillic script is more commonly used for Tatar, at
least at the moment.



----------------------------------------------------------------------

>Comment By: Tobias Quathamer (toddy)
Date: 2013-11-02 00:29

Message:
Hi,

thanks a lot for the bug report. This should be fixed now and will be part of the next release. Please note that an automatic transliteration of the latin script into cyrillic has been added. If this results in strange or wrong words, please do not hesitate to open another bug report.

Regards,
Tobias

----------------------------------------------------------------------

You can respond by visiting: 
https://alioth.debian.org/tracker/?func=detail&atid=413077&aid=314422&group_id=30316



More information about the Pkg-isocodes-devel mailing list