Core NLP Tools
Multilingual NLP tools trusted by market leaders in the financial, automotive, retail and technological sectors and reaching hundreds of thousands of international consumers worldwide.
Personalized solutions available both as a cloud or on-premise, as well as through our NLP API platform.

Core NLP Tools for
Lexical Analysis
Available in 77 languages
Try our multilingual core NLP tools for lexical analysis for free on our NLP API platform or request a demo for a personalized solution.

Afrikaans | Burmese | French | Irish Gaelic | Macedonian | Portuguese | Swedish |
---|---|---|---|---|---|---|
Albanian | Catalan | Galician | Italian | Malay | Punjabi | Tagalog |
Amharic | Chinese | Georgian | Japanese | Malayalam | Romanian | Tamil |
Arabic (MSA) | Croatian | German | Kannada | Marathi | Russian | Telugu |
Armenian | Czech | Greek | Kazakh | Mongolian | Serbian | Thai |
Assamese | Danish | Gujarati | Khmer | Nepali | Sindhi | Turkish |
Azeri | Dutch | Hebrew | Korean | Norwegian Bokmal | Sinhala | Ukrainian |
Basque | English | Hindi | Kyrgyz | Norwegian Nynorsk | Slovak | Urdu |
Belarusian | Esperanto | Hungarian | Lao | Oriya | Slovenian | Uzbek |
Bengali | Estonian | Icelandic | Latvian | Persian | Spanish | Vietnamese |
Bulgarian | Finnish | Indonesian | Lithuanian | Polish | Swahili | Zulu |

Word Embeddings
Split text into tokens and embed them into a 300-dimensional vector space taking into account each token’s key linguistic features.

Lemmatization
Identify all potential roots (lemmas) of each word in a sentence, using morphological analysis and carefully-curated lexicons.

Decompounding
Identify the compound words and extract the lemmas of the simple words that compose them.

Spelling Suggestions
Check the correct spelling of your text, identifying the spelling mistakes suggesting corrections for them.

Lexical Dictionaries
Recognize all words in your documents via our linguistically curated wordlists that cover all words of each language, including their morphological and semantic attributes.

Language Identification
Identify what language a text is written in, including multilingual text. It detects the language of the input text and returns a list of sentences with their respective language.
Core NLP Tools for
Syntactical Analysis
Available in 21 languages
Try our multilingual core NLP tools for syntactical analysis for free on our NLP API platform or request a demo for a personalized solution.

Bulgarian | Croatian | Dutch | German | Japanese | Slovak | Spanish |
---|---|---|---|---|---|---|
Catalan | Czech | English | Hungarian | Portuguese | Slovenian | Swedish |
Chinese | Danish | French | Italian | Russian | Serbian | Turkish |

Parsing
Produce parse trees describing the structure of the constituents of each sentence.

Segmentation
Split text into sentences accurately, taking into account Natural Language rules.

Tokenization
Split text into sequences of linguistically significant units (tokens), making the scope of your analysis more precise.

POS Tagging
Determine the Part of Speech (POS) of each in a sentence, helping you solve disambiguation.

Phrase Extraction
Extract the relevant multi-word noun, verb, adjective or adverbial phrases using morphological and syntactic analysis.

Personalized Solutions
We provide solutions to market leaders in the financial, automotive, retail and technological sectors, reaching hundreds of thousands of international consumers worldwide.
Whether you’re looking for a specific implementation for one of our tools or a solution in a language we don’t yet have available, our team of experts will provide you with the perfect solution to your needs.
Bitext NLP API Platform
Discover our NLP API platform where you will find a wide variety of multilingual NLP tools and solutions for chatbots that will help you create the best customer experience for your business. Sign up and try it for free!
