Language support for Language Detection
Use this article to learn which natural languages that language detection supports.
The Language Detection feature can detect a wide range of languages, variants, dialects, and some regional/cultural languages, and return detected languages with their name and code. The returned language code parameters conform to BCP-47 standard with most of them conforming to ISO-639-1 identifiers.
If you have content expressed in a less frequently used language, you can try Language Detection to see if it returns a code. The response for languages that can't be detected is unknown
.
Languages supported by Language Detection
Language | Language Code |
---|---|
Afrikaans | af |
Albanian | sq |
Amharic | am |
Arabic | ar |
Armenian | hy |
Assamese | as |
Azerbaijani | az |
Bashkir | ba |
Basque | eu |
Belarusian | be |
Bengali | bn |
Bosnian | bs |
Bulgarian | bg |
Burmese | my |
Catalan | ca |
Central Khmer | km |
Chinese | zh |
Chinese Simplified | zh_chs |
Chinese Traditional | zh_cht |
Chuvash | cv |
Corsican | co |
Croatian | hr |
Czech | cs |
Danish | da |
Dari | prs |
Divehi | dv |
Dutch | nl |
English | en |
Esperanto | eo |
Estonian | et |
Faroese | fo |
Fijian | fj |
Finnish | fi |
French | fr |
Galician | gl |
Georgian | ka |
German | de |
Greek | el |
Gujarati | gu |
Haitian | ht |
Hausa | ha |
Hebrew | he |
Hindi | hi |
Hmong Daw | mww |
Hungarian | hu |
Icelandic | is |
Igbo | ig |
Indonesian | id |
Inuktitut | iu |
Irish | ga |
Italian | it |
Japanese | ja |
Javanese | jv |
Kannada | kn |
Kazakh | kk |
Kinyarwanda | rw |
Kirghiz | ky |
Korean | ko |
Kurdish | ku |
Lao | lo |
Latin | la |
Latvian | lv |
Lithuanian | lt |
Luxembourgish | lb |
Macedonian | mk |
Malagasy | mg |
Malay | ms |
Malayalam | ml |
Maltese | mt |
Maori | mi |
Marathi | mr |
Mongolian | mn |
Nepali | ne |
Norwegian | no |
Norwegian Nynorsk | nn |
Odia | or |
Pasht | ps |
Persian | fa |
Polish | pl |
Portuguese | pt |
Punjabi | pa |
Queretaro Otomi | otq |
Romanian | ro |
Russian | ru |
Samoan | sm |
Serbian | sr |
Shona | sn |
Sindhi | sd |
Sinhala | si |
Slovak | sk |
Slovenian | sl |
Somali | so |
Spanish | es |
Sundanese | su |
Swahili | sw |
Swedish | sv |
Tagalog | tl |
Tahitian | ty |
Tajik | tg |
Tamil | ta |
Tatar | tt |
Telugu | te |
Thai | th |
Tibetan | bo |
Tigrinya | ti |
Tongan | to |
Turkish | tr |
Turkmen | tk |
Upper Sorbian | hsb |
Uyghur | ug |
Ukrainian | uk |
Urdu | ur |
Uzbek | uz |
Vietnamese | vi |
Welsh | cy |
Xhosa | xh |
Yiddish | yi |
Yoruba | yo |
Yucatec Maya | yua |
Zulu | zu |
Romanized Indic Languages supported by Language Detection
Language | Language Code |
---|---|
Assamese | as |
Bengali | bn |
Gujarati | gu |
Hindi | hi |
Kannada | kn |
Malayalam | ml |
Marathi | mr |
Odia | or |
Punjabi | pa |
Tamil | ta |
Telugu | te |
Urdu | ur |
Script detection
Language | Script code | Scripts |
---|---|---|
Bengali (Bengali-Assamese) | as |
Latn , Beng |
Bengali (Bangla) | bn |
Latn , Beng |
Gujarati | gu |
Latn , Gujr |
Hindi | hi |
Latn , Deva |
Kannada | kn |
Latn , Knda |
Malayalam | ml |
Latn , Mlym |
Marathi | mr |
Latn , Deva |
Oriya | or |
Latn , Orya |
Gurmukhi | pa |
Latn , Guru |
Tamil | ta |
Latn , Taml |
Telugu | te |
Latn , Telu |
Arabic | ar |
Latn , Arab |
Cyrillic | tt |
Latn , Cyrl |
Serbian | sr |
Latn , Cyrl |
Unified Canadian Aboriginal Syllabics | iu |
Latn , Cans |
Next steps
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for