Supported text embedding languages

All text embedding models support and have been evaluated on English-language text. Additionally, the text-multilingual-embedding-002 model supports and has been evaluated on the languages listed on this page.

Evaluated languages

  • Arabic (ar)
  • Bengali (bn)
  • English (en)
  • Spanish (es)
  • German (de)
  • Persian (fa)
  • Finnish (fi)
  • French (fr)
  • Hindi (hi)
  • Indonesian (id)
  • Japanese (ja)
  • Korean (ko)
  • Russian (ru)
  • Swahili (sw)
  • Telugu (te)
  • Thai (th)
  • Yoruba (yo)
  • Chinese (zh)

Supported languages

  • Afrikaans
  • Albanian
  • Amharic
  • Arabic
  • Armenian
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bulgarian
  • Burmese
  • Catalan
  • Cebuano
  • Chichewa
  • Chinese
  • Corsican
  • Czech
  • Danish
  • Dutch
  • English
  • Esperanto
  • Estonian
  • Filipino
  • Finnish
  • French
  • Galician
  • Georgian
  • German
  • Greek
  • Gujarati
  • Haitian Creole
  • Hausa
  • Hawaiian
  • Hebrew
  • Hindi
  • Hmong
  • Hungarian
  • Icelandic
  • Igbo
  • Indonesian
  • Irish
  • Italian
  • Japanese
  • Javanese
  • Kannada
  • Kazakh
  • Khmer
  • Korean
  • Kurdish
  • Kyrgyz
  • Lao
  • Latin
  • Latvian
  • Lithuanian
  • Luxembourgish
  • Macedonian
  • Malagasy
  • Malay
  • Malayalam
  • Maltese
  • Maori
  • Marathi
  • Mongolian
  • Nepali
  • Norwegian
  • Pashto
  • Persian
  • Polish
  • Portuguese
  • Punjabi
  • Romanian
  • Russian
  • Samoan
  • Scottish Gaelic
  • Serbian
  • Shona
  • Sindhi
  • Sinhala
  • Slovak
  • Slovenian
  • Somali
  • Sotho
  • Spanish
  • Sundanese
  • Swahili
  • Swedish
  • Tajik
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Urdu
  • Uzbek
  • Vietnamese
  • Welsh
  • West Frisian
  • Xhosa
  • Yiddish
  • Yoruba
  • Zulu

What's next