All text embedding models support and have been evaluated on English-language
text. Additionally, the text-multilingual-embedding-002
model supports and has
been evaluated on the languages listed on this page.
Evaluated languages
- Arabic (
ar
) - Bengali (
bn
) - English (
en
) - Spanish (
es
) - German (
de
) - Persian (
fa
) - Finnish (
fi
) - French (
fr
) - Hindi (
hi
) - Indonesian (
id
) - Japanese (
ja
) - Korean (
ko
) - Russian (
ru
) - Swahili (
sw
) - Telugu (
te
) - Thai (
th
) - Yoruba (
yo
) - Chinese (
zh
)
Supported languages
- Afrikaans
- Albanian
- Amharic
- Arabic
- Armenian
- Azerbaijani
- Basque
- Belarusian
- Bengali
- Bulgarian
- Burmese
- Catalan
- Cebuano
- Chichewa
- Chinese
- Corsican
- Czech
- Danish
- Dutch
- English
- Esperanto
- Estonian
- Filipino
- Finnish
- French
- Galician
- Georgian
- German
- Greek
- Gujarati
- Haitian Creole
- Hausa
- Hawaiian
- Hebrew
- Hindi
- Hmong
- Hungarian
- Icelandic
- Igbo
- Indonesian
- Irish
- Italian
- Japanese
- Javanese
- Kannada
- Kazakh
- Khmer
- Korean
- Kurdish
- Kyrgyz
- Lao
- Latin
- Latvian
- Lithuanian
- Luxembourgish
- Macedonian
- Malagasy
- Malay
- Malayalam
- Maltese
- Maori
- Marathi
- Mongolian
- Nepali
- Norwegian
- Pashto
- Persian
- Polish
- Portuguese
- Punjabi
- Romanian
- Russian
- Samoan
- Scottish Gaelic
- Serbian
- Shona
- Sindhi
- Sinhala
- Slovak
- Slovenian
- Somali
- Sotho
- Spanish
- Sundanese
- Swahili
- Swedish
- Tajik
- Tamil
- Telugu
- Thai
- Turkish
- Ukrainian
- Urdu
- Uzbek
- Vietnamese
- Welsh
- West Frisian
- Xhosa
- Yiddish
- Yoruba
- Zulu
What's next
- Learn how to get text embeddings.