[Koha-translate] Language, Script, Country, Encoding

koha-translate

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Koha-translate] Language, Script, Country, Encoding - an Explanation

From:	Dorian Meid
Subject:	[Koha-translate] Language, Script, Country, Encoding - an Explanation
Date:	Sun, 20 Jan 2008 15:53:04 +0100

I recognize slight uncertainties when submitting the metadata foryour translations. So I wanted to explain the basics a little.Koha uses RFC4646 http://rfc.net/rfc4646.html for languageidentification.It states, that a language is identified by several tags, separatedby a hyphen:


Language tag - Script tag - Region/Country tag

The language tag is written in lowercase, the script tag is writtenin lowercase with the first letter in uppercase and the region orcountry tag is written in uppercase.

Example: zh-Hans-CN
zh is Chinese, Hans is the simplified Chinese script and CN is China.

The language is how you speak or what word you use to name a thing.

The language tags are standardised in ISO 639-1 or ISO 639-2 http://www.loc.gov/standards/iso639-2/php/code_list.phpAs we are in a library environment it may be useful to mention thedifference between ISO 639-2/T and ISO 639-2/B.T refers the terminology code and B refers the bibliographic code,e.g. german has the tag "deu" in ISO 639-2/T and "ger" in ISO 639-2/B.The reason for this inconvenience is that some libraries assignedsome tags for languages (the B-tags) before the ISO (T)standardisation was made.The T and B differences are only in the three-letter tags of ISO639-2. So far we use the two-letter tags of ISO 639-1, but RFC4646allows also 639-2.

The script is how your characters look like or what you paint toproduce a specific sound.The script tags are standardised in ISO 15942 http://www.unicode.org/iso15924/codelists.htmlYou have to add the script tag if your language can be written inmore than one script, e.g. Hans for simplified Chinese or Hant fortraditional chinese, or if the specified language is not written inthe normal script e.g. de-Latf-DE for German in Fraktur.You should, but don't have to omit the script tag if there is onlyone commonly used script for your language.

The region or country is where the language is spoken, this isimportant because there often are differences between countries,which basically share the same language, e.g. British English andAmerican English.The region/country tag is either a two letter Country code assandardised in ISO 3166-1 http://www.iso.org/iso/country_codes/iso_3166_code_lists.htm or a three digit Region code as standardisedin UN M.49 http://unstats.un.org/unsd/methods/m49/m49.htmNormally we use the ISO letter code, but the UN region code can behandy when specifying a language spoken in more than one country,e.g. es-005 (Spanish as spoken in South America).

When given a script tag we know how your script should look like, butcomputers are dumb. They don't know written characters, the just knowbytes. The assignment of written (visual) characters to byte valuesis called character encoding. There are many different characterencodings and to make it even worse there are some scripts, which canbe successfully encoded in different ways.Normal character encodings are capable of assigning 128 or 256characters. Unicode is capable of several billions of characters andcan encode all used scripts, so it is the preffered choice for Kohathemes and translations.So please use UTF-8 for your document character encoding http://www.unicode.org/standard/WhatIsUnicode.htmlIf you can't use UTF-8 or don't know how to use it please ask thelist or at least specifiy the encoding you are using, so we cantranscode your document.


Hope that helps.
Maybe this should be added to the readme on translate or the wiki.

Dorian Meid

[Prev in Thread]

Current Thread

[Next in Thread]

[Koha-translate] Language, Script, Country, Encoding - an Explanation, Dorian Meid <=

Prev by Date: Re: [Koha-translate] Some more questions concerning translation strings
Next by Date: Re: [Koha-translate] 2 questions and a comment
Previous by thread: [Koha-translate] Start Russian translation of Koha 3.0
Next by thread: [Koha-translate] Lao translation
Index(es):
- Date
- Thread