Using Language Identifiers (RFC 3066)

 

Using Language Identifiers (RFC 3066)

Language identifiers as specified by RFC 3066, can have the form language, language-country, language-country-variant and some other specialized forms. The guidelines for choosing between language and language-country are ambiguous.

To clarify which form should be used, John Cowan and I have posted this list for review. This is currently a draft document. It will be continually revised as we get feedback from linguists and internationalization experts. (Although, I have a day job, so allow some time for me turn it around.) The topic is being discussed on the W3C www-international mail list and the IETF ietf-lang mail list. (The feedback thread starts www-international 2004OctDec-0131 and ietf-lang 2004-12-002431.)

This page is intended to list combinations of language code and country code that have been found useful in distinguishing language variants from each other. Please write the authors if you think either that there is no variation between two code pairs listed, or that there are additional code pairs that are useful in distinguishing written variants.

Send comments to Tex Texin and John Cowan. To improve turn-around of edits, please make sure the mail uses the subject: "Language Identifier List Comments". This will help us easily identify the mails and process them. Please use a different subject heading for other discussions.

There are a number of suggestions for deciding whether to use a one-level (language only) or two-level (language-region) tag. They require some discussion and will be added here shortly.

Language codes used as primary subtags are from ISO 639Country codes used as secondary subtags (or tertiary if script tags are secondary) are from ISO 3166. Note also that several unique language tags are also defined in the IANA Language Tag Registry. Script codes, generally 4 letters in length, such as "Hans" for Simplified Chinese, "Hant" for Traditonal Chinese, "Cyrl" for Cyrillic, etc. are defined by ISO 15924 Codes For The Representation Of Names Of Scripts.

DRAFT

Proposed List of 1-level Language Identifiers

Some languages have no significant variations, and therefore can be adequately represented by a language subtag alone, as opposed to a language subtag and country subtag. We would like to clarify which languages are in this category.

Proposed List of 2-level Language Identifiers

Some languages differ by region and therefore a language subtag and country subtag may be used to identify the language more precisely. Note that depending on the purpose or intended audience of the content, a very precise label may not be needed and a one-level language tag may be adequate. For example, using "en" for this page may be sufficient instead of en-US. If the tag is being used with an application (as opposed to labeling content) the requirement for precision may also vary.

John Cowan provided the original data for this list. The list has since been amended based on suggestions from a number of folks and considerable research.

Table Legend

In the following table, the language codes are followed by a parenthesized expression which contains the language name and if the region code is different from the region represented by the row, a region name. For example, en-US (English, U.S.).

Tags will look different depending on their source and whether they have been deprecated. Their appearance is as follows:

ISO 639 tags IANA registry tags deprecated tags

Region
Code
Region NameLanguages
[Format:Tag (Language Name, Region Name)]
(none)No region assignedeo (Esperanto) ia (Interlingua) ie (Interlingue) io (Ido) vo (Volapük)
ADAndorraca (Catalan) fr-AD (French)
AEUnited Arab Emiratesar-AE (Arabic)
AFAfghanistanfa-AF (Persian) ps (Pushto) ug (Uighur) uz-AF (Uzbek)
AGAntigua And Barbudaen-AG (English)
AIAnguillaen-AI (English)
ALAlbaniasq (Albanian)
AMArmeniahy (Armenian)
ANNetherlands Antillesnl-AN (Dutch)
AOAngolakg (Kongo) kj (Kuanyama) pt-AO (Portuguese)
ARArgentinacy-AR (Welsh) es-AR (Spanish) gn (Guarani)
ASAmerican Samoaen-AS (English) sm (Samoan)
ATAustriade-AT (German) hu (Hungarian)
AUAustraliaen-AU (English) YI (Yiddish)
AWArubanl-AW (Dutch)
AXÅland Islandssv-AX (Swedish)
AZAzerbaijanav (Avaric) az (Azerbaijani) az-Arab (Azerbaijani) az-Cyrl (Azerbaijani) az-Latn (Azerbaijani) os (Ossetian)
BABosnia And Herzegovinabs (Bosnian) hr-BA (Croatian) sr-BA (Serbian)
BBBarbadosen-BB (English)
BDBangladeshbn-BD (Bengali)
BEBelgiumde-BE (German) en-BE (English) fr-BE (French) nl-BE (Dutch) wa (Walloon) yi (Yiddish)
BFBurkina Fasobm (Bambara) fr-BF (French) ha (Hausa)
BGBulgariabg (Bulgarian) cu (Church Slavic) tr-BG (Turkish)
BHBahrainar-BH (Arabic)
BIBurundifr-BI (French) rn (Rundi)
BJBeninfr-BJ (French)
BMBermudaen-BM (English)
BNBrunei Darussalamen-BN (English) ms-BN (Malay)
BOBoliviaay (Aymara) es-BO (Spanish)
BRBrazilpt-BR (Portuguese)
BSBahamasen-BS (English)
BTBhutandz (Dzongkha) ne (Nepali)
BWBotswanaen-BW (English) sn (Shona) tn-BW (Tswana)
BYBelarusbe (Belarusian) cu (Church Slavic) yi (Yiddish)
BZBelizeen-BZ (English)
CACanadacr (Cree) en-CA (English) fr-CA (French) iu (Inuktitut) oj (Ojibwa) yi (Yiddish)
CCCocos (Keeling) Islandsms-CC (Malay)
CDCongo, The Democratic Republic Of Thefr-CD (French) kg (Kongo) ln-CD (Lingala) lu (Luba-Katanga)
CFCentral African Republicfr-CF (French) sg (Sango)
CGCongofr-CG (French) kg (Kongo) ln-CG (Lingala)
CHSwitzerlandde-CH (German) fr-CH (French) it-CH (Italian) rm (Raeto-Romance)
CICôte D'ivoireak (Akan) bm (Bambara) fr-CI (French)
CKCook Islandsen-CK (English)
CLChileay (Aymara) es-CL (Spanish)
CMCameroonen-CM (English) fr-CM (French)
CNChinabo (Tibetan) i-hak (Hakka) ii (Sichuan Yi) za (Zhuang) zh-CN (Chinese) zh-gan (Chinese) zh-guoyu (Chinese) zh-hakka (Chinese) zh-Hans (Chinese) zh-Hant (Chinese) zh-wuu (Chinese) zh-xiang (Chinese) zh-yue (Chinese)
COColombiaes-CO (Spanish)
CRCosta Ricaes-CR (Spanish)
CSSerbia And Montenegrocu (Church Slavic) hu (Hungarian) sq (Albanian) sr (Serbian) sr-Cyrl (Serbian) sr-Latn (Serbian)
CUCubaes-CU (Spanish)
CVCape Verdept-CV (Portuguese)
CXChristmas Islandms-CC (Malay,Cocos (Keeling) Islands)
CYCyprusel-CY (Greek) tr-CY (Turkish)
CZCzech Republiccs (Czech)
DEGermanyda-DE (Danish) da-DE (Danish) de-1901 (German) de-1996 (German) de-AT-1901 (German) de-AT-1996 (German) de-CH-1901 (German) de-CH-1996 (German) de-DE (German) de-DE-1901 (German) de-DE-1996 (German) dsb (Lower Sorbian) fy-DE (Frisian) hsb (Upper Sorbian) lb (Luxembourgish) nds (Low German) wen (Sorbian) yi (Yiddish)
DJDjiboutiaa-DJ (Afar) fr-DJ (French) so-DJ (Somali)
DKDenmarkda-DK (Danish) de-DK (German)
DMDominicaen-DM (English)
DODominican Republices-DO (Spanish)
DZAlgeriaar-DZ (Arabic)
ECEcuadores-EC (Spanish)
EEEstoniaet (Estonian) yi (Yiddish)
EGEgyptar-EG (Arabic)
EREritreaaa-ER (Afar) byn (Blin) en-ER (English) gez-ER (Geez) ti-ER (Tigrinya) tig (Tigre)
ESSpainan (Aragonese) ca (Catalan) es-ES (Spanish) eu (Basque) gl (Galician)
ETEthiopiaaa-ET (Afar) am (Amharic) en-ET (English) gez-ET (Geez) om-ET (Oromo) sid (Sidamo) so-ET (Somali) ti-ET (Tigrinya) wal (Walamo)
FIFinlandfi-FI (Finnish) smn (Inari Sami) sv-FI (Swedish)
FJFijien-FJ (English) fj (Fijian)
FKFalkland Islands (Malvinas)en-FK (English)
FMMicronesia, Federated States Ofen-FM (English)
FOFaroe Islandsda-FO (Danish) fo (Faroese)
FRFrancebr (Breton) co (Corsican) de-FR (German) fr-FR (French) oc (Occitan)
GAGabonfr-GA (French)
GBUnited Kingdomcy-GB (Welsh) en-boont (English) en-GB (English) en-GB-oed (English) en-scouse (English) fr-GB (French) ga-GB (Irish) gd (Gaelic) gv (Manx) kw (Cornish)
GDGrenadaen-GD (English)
GEGeorgiaab (Abkhazian) ka (Georgian) os (Ossetian)
GFFrench Guianafr-GF (French)
GHGhanaak (Akan) ee (Ewe) en-GH (English) tw (Twi)
GIGibraltaren-GI (English)
GLGreenlandda-GL (Danish) kl (Kalaallisut)
GMGambiabm (Bambara) en-GM (English) wo (Wolof)
GNGuineafr-GN (French)
GPGuadeloupefr-GP (French)
GQEquatorial Guineaes-GQ (Spanish)
GRGreeceel-GR (Greek)
GTGuatemalaes-GT (Spanish)
GUGuamch-GU (Chamorro) en-GU (English)
GWGuinea-Bissaupt-GW (Portuguese)
GYGuyanaen-GY (English)
HKHong Kongen-HK (English) zh-Hant (Chinese) zh-Hant-HK (Chinese) zh-HK (Chinese)
HNHondurases-HN (Spanish)
HRCroatiahr-HR (Croatian) it-HR (Italian)
HTHaitifr-HT (French) ht (Haitian)
HUHungaryde-HU (German) hu-HU (Hungarian) sk-HU (Slovak) sr-HU (Serbian)
IDIndonesiaid (Indonesian) jv (Javanese) su (Sundanese)
IEIrelanden-IE (English) ga-IE (Irish)
ILIsraelar-IL (Arabic) en-IL (English) he (Hebrew) yi (Yiddish)
INIndiaar-IN (Arabic) as (Assamese) bh (Bihari) bn-IN (Bengali) en-IN (English) gu (Gujarati) hi (Hindi) kn (Kannada) kok (Konkani) ks (Kashmiri) ml (Malayalam) mr (Marathi) ne (Nepali) or (Oriya) pa (Panjabi) pi (Pali) sa (Sanskrit) sd-IN (Sindhi) ta-IN (Tamil) te (Telugu) ur-IN (Urdu)
IOBritish Indian Ocean Territoryen-IO (English)
IQIraqar-IQ (Arabic) ku (Kurdish)
IRIran, Islamic Republic Ofae (Avestan) fa-IR (Persian) ku (Kurdish)
ISIcelandis (Icelandic)
ITItalyco (Corsican) de-IT (German) fr-IT (French) it-IT (Italian) sc (Sardinian)
JMJamaicaen-JM (English)
JOJordanar-JO (Arabic)
JPJapanja (Japanese)
KEKenyaen-KE (English) ki (Kikuyu) om-KE (Oromo) so-KE (Somali) sw-KE (Swahili)
KGKyrgyzstanky (Kirghiz) ug (Uighur)
KHCambodiakm (Khmer)
KIKiribatien-KI (English)
KMComorosfr-KM (French)
KNSaint Kitts And Nevisen-KN (English)
KPKorea, Democratic People's Republic Ofko-KP (Korean)
KRKorea, Republic Ofko-KR (Korean)
KWKuwaitar-KW (Arabic)
KYCayman Islandsen-KY (English)
KZKazakhstanav (Avaric) kk (Kazakh) os (Ossetian) ug (Uighur)
LALao People's Democratic Republiclo (Lao)
LBLebanonar-LB (Arabic) fr-LB (French)
LCSaint Luciaen-LC (English)
LILiechtensteinde-LI (German)
LKSri Lankasi (Sinhala)
LRLiberiaen-LR (English)
LSLesothoen-LS (English) st (Southern Sotho) xh (Xhosa)
LTLithuanialt (Lithuanian) yi (Yiddish)
LULuxembourgde-LU (German) fr-LU (French) lb (Luxembourgish)
LVLatvialv (Latvian) yi (Yiddish)
LYLibyan Arab Jamahiriyaar-LY (Arabic)
MAMoroccoar-MA (Arabic)
MCMonacofr-MC (French)
MDMoldova, Republic Ofmo (Moldavian) ro (Romanian) tr (Turkish) uk (Ukrainian) yi (Yiddish)
MGMadagascarfr-MG (French) mg (Malagasy)
MHMarshall Islandsen-MH (English) mh (Marshallese)
MKMacedonia, The Former Yugoslav Republic Ofcu (Church Slavic) mk (Macedonian)
MLMalibm (Bambara) fr-ML (French)
MMMyanmarmy (Burmese)
MNMongoliamn (Mongolian)
MOMacaozh-Hant (Chinese) zh-MO (Chinese)
MPNorthern Mariana Islandsch-MP (Chamorro) en-MP (English)
MQMartiniquefr-MQ (French)
MRMauritaniaar-MR (Arabic) wo (Wolof)
MSMontserraten-MS (English)
MTMaltaen-MT (English) mt (Maltese)
MUMauritiusen-MU (English)
MVMaldivesdv (Divehi)
MWMalawien-MW (English) ny (Chichewa)
MXMexicoes-MX (Spanish)
MYMalaysiajv (Javanese) ms-MY (Malay)
MZMozambiquept-MZ (Portuguese) sn (Shona)
NANamibiaen-NA (English) hz (Herero) ng (Ndonga)
NCNew Caledoniafr-NC (French)
NENigerff-NE (Fulah) fr-NE (French) ha (Hausa) kr (Kanuri)
NFNorfolk Islanden-NF (English)
NGNigeriaen-NG (English) ff-NG (Fulah) ha (Hausa) ig (Igbo) kr (Kanuri) yo (Yoruba)
NINicaraguaes-NI (Spanish)
NLNetherlandsfy-NL (Frisian) li (Limburgan) nl-NL (Dutch)
NONorwaynb (Bokmål) nn (Nynorsk) no (Norwegian) no-bok (Norwegian) no-nyn (Norwegian)
NPNepalne (Nepali)
NRNauruen-NR (English) na (Nauru)
NUNiueen-NU (English)
NZNew Zealanden-NZ (English) mi (Maori)
OMOmanar-OM (Arabic)
PAPanamaes-PA (Spanish)
PEPeruay (Aymara) es-PE (Spanish) qu (Quechua)
PFFrench Polynesiafr-PF (French) ty (Tahitian)
PGPapua New Guineaen-PG (English) ho (Hiri Motu)
PHPhilippinesen-PH (English) tl (Tagalog)
PKPakistanen-PK (English) sd-PK (Sindhi) ur-PK (Urdu)
PLPolandde-PL (German) pl (Polish) yi (Yiddish)
PMSaint Pierre And Miquelonfr-PM (French)
PNPitcairnen-PN (English)
PRPuerto Ricoen-PR (English) es-PR (Spanish)
PSPalestinian Territory, Occupiedar-PS (Arabic)
PTPortugalpt-PT (Portuguese)
PWPalauen-PW (English)
PYParaguayes-PY (Spanish) gn (Guarani)
QAQatarar-QA (Arabic)
RERéunionfr-RE (French)
RORomaniacu (Church Slavic) hu (Hungarian) ro (Romanian) YI (Yiddish)
RURussian Federationav (Avaric) ba (Bashkir) ce (Chechen) cu (Church Slavic) cv (Chuvash) kv (Komi) os (Ossetian) ru-RU (Russian) tt (Tatar) yi (Yiddish)
RWRwandaen-RW (English) fr-RW (French) rw (Kinyarwanda)
SASaudi Arabiaar-SA (Arabic)
SBSolomon Islandsen-SB (English)
SCSeychellesen-SC (English) fr-SC (French)
SDSudanar-SD (Arabic) din (Dinka) ha (Hausa)
SESwedenfi-SE (Finnish) se (Northern Sami) sma (S. Sami) sme (N. Sami) sv-SE (Swedish)
SGSingaporebn-SG (Bengali) en-SG (English) ms-SG (Malay) ta-SG (Tamil) zh-Hans-SG (Chinese) zh-SG (Chinese)
SHSaint Helenaen-SH (English)
SISloveniahu-SI (Hungarian) it-SI (Italian)
SKSlovakiahu (Hungarian) sk (Slovak)
SLSierra Leoneen-SL (English)
SMSan Marinoit-SM (Italian)
SNSenegalff-SN (Fulah) wo (Wolof)
SOSomaliaar-SO (Arabic) en-SO (English) so-SO (Somali)
SRSurinamejv (Javanese) nl-SR (Dutch)
STSao Tome And Principept-ST (Portuguese)
SVEl Salvadores-SV (Spanish)
SYSyrian Arab Republicar-SY (Arabic) syr (Syriac)
SZSwazilanden-SZ (English) ss-SZ (Swati)
TCTurks And Caicos Islandsen-TC (English)
TDChadar-TD (Arabic) fr-TD (French)
TGTogoee (Ewe) fr-TG (French) ha (Hausa)
THThailandsi (Sinhala) th (Thai)
TJTajikistanos (Ossetian) tg (Tajik) ug (Uighur)
TKTokelauen-TK (English)
TLTimor-Lestept-TL (Portuguese)
TMTurkmenistanos (Ossetian) tk (Turkmen)
TNTunisiaar-TN (Arabic)
TOTongaen-TO (English) to (Tongan)
TRTurkeyab (Abkhazian) av (Avaric) ku (Kurdish) tr-TR (Turkish) ug (Uighur)
TTTrinidad And Tobagoen-TT (English)
TVTuvalugil (Gilbertese) tvl (Tuvalu)
TWTaiwan, Province Of Chinazh-Hant (Chinese) zh-min (Chinese) zh-min-nan (Chinese) zh-TW (Chinese)
TZTanzania, United Republic Ofsw-TZ (Swahili)
UAUkraineab (Abkhazian) cu (Church Slavic) hu (Hungarian) os (Ossetian) pl (Polish) ro (Romanian) ru-UA (Russian) uk (Ukrainian) yi (Yiddish)
UGUgandaen-UG (English) lg (Ganda)
UMUnited States Minor Outlying Islandsen-UM (English)
USUnited Statesen-US (English) es-US (Spanish) haw (Hawaiian) ik (Inupiaq) nv (Navajo) oj (Ojibwa) yi (Yiddish)
UYUruguayes-UY (Spanish)
UZUzbekistanos (Ossetian) ug (Uighur) uz-Cyrl (Uzbek) uz-Latn (Uzbek) uz-UZ (Uzbek)
VAHoly See (Vatican City State)fr (French) it (Italian) la (Latin)
VCSaint Vincent And The Grenadinesen-VC (English)
VEVenezuelaes-VE (Spanish)
VGVirgin Islands, Britishen-VG (English)
VIVirgin Islands, U.S.en-VI (English)
VNViet Namvi (Vietnamese)
VUVanuatubi (Bislama) en-VU (English) fr-VU (French)
WFWallis And Futunafr-WF (French)
WSSamoaen-WS (English) sm (Samoan)
YEYemenar-YE (Arabic)
YTMayottefr-YT (French)
YUYugoslaviasr-YU (Serbian)
ZASouth Africaaf (Afrikaans) en-ZA (English) nr (S. Ndebele) ss-ZA (Swati) tn-ZA (Tswana) ts (Tsonga) ve (Venda) xh (Xhosa) yi (Yiddish) zu (Zulu)
ZMZambiaen-ZM (English)
ZWZimbabween-ZW (English) nd (N. Ndebele) sn (Shona) ve (Venda) zu (Zulu)

References

Copyright © 2004, 2005 Tex Texin, John Cowan. All rights reserved.
This page last updated 2005-01-01.
Top of page

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值