Core NLP Tools

Multilingual NLP tools trusted by market leaders in the financial, automotive, retail and technological sectors and reaching hundreds of thousands of international consumers worldwide.

Personalized solutions available both as a cloud or on-premise, as well as through our NLP API platform.

Core NLP Tools for
Lexical Analysis

Available in 77 languages

Try our multilingual core NLP tools for lexical analysis for free on our NLP API platform or request a demo for a personalized solution.

Afrikaans Burmese French Irish Gaelic Macedonian Portuguese Swedish
Albanian Catalan Galician Italian Malay Punjabi Tagalog
Amharic Chinese Georgian Japanese Malayalam Romanian Tamil
Arabic (MSA) Croatian German Kannada Marathi Russian Telugu
Armenian Czech Greek Kazakh Mongolian Serbian Thai
Assamese Danish Gujarati Khmer Nepali Sindhi Turkish
Azeri Dutch Hebrew Korean Norwegian Bokmal Sinhala Ukrainian
Basque English Hindi Kyrgyz Norwegian Nynorsk Slovak Urdu
Belarusian Esperanto Hungarian Lao Oriya Slovenian Uzbek
Bengali Estonian Icelandic Latvian Persian Spanish Vietnamese
Bulgarian Finnish Indonesian Lithuanian Polish Swahili Zulu

 

Afrikaans Estonian Kyrgyz Sindhi
Albanian Finnish Lao Sinhala
Amharic French Latvian Slovak
Arabic (MSA) Galician Lithuanian Slovenian
Armenian Georgian Macedonian Spanish
Assamese German Malay Swahili
Azeri Greek Malayalam Swedish
Basque Gujarati Marathi Tagalog
Belarusian Hebrew Mongolian Tamil
Bengali Hindi Nepali Telugu
Bulgarian Hungarian Norwegian Bokmal Thai
Burmese Icelandic Norwegian Nynorsk Turkish
Catalan Indonesian Oriya Ukrainian
Chinese Irish Gaelic Persian Urdu
Croatian Italian Polish Uzbek
Czech Japanese Portuguese Vietnamese
Danish Kannada Punjabi Zulu
Dutch Kazakh Romanian
English Khmer Russian
Esperanto Korean Serbian
Afrikaans Greek Oriya
Albanian Gujarati Persian
Amharic Hebrew Polish
Arabic (MSA) Hindi Portuguese
Armenian Hungarian Punjabi
Assamese Icelandic Romanian
Azeri Indonesian Russian
Basque Irish Gaelic Serbian
Belarusian Italian Sindhi
Bengali Japanese Sinhala
Bulgarian Kannada Slovak
Burmese Kazakh Slovenian
Catalan Khmer Spanish
Chinese Korean Swahili
Croatian Kyrgyz Swedish
Czech Lao Tagalog
Danish Latvian Tamil
Dutch Lithuanian Telugu
English Macedonian Thai
Esperanto Malay Turkish
Estonian Malayalam Ukrainian
Finnish Marathi Urdu
French Mongolian Uzbek
Galician Nepali Vietnamese
Georgian Norwegian Bokmal Zulu
German Norwegian Nynorsk

Word Embeddings

Split text into tokens and embed them into a 300-dimensional vector space taking into account each token’s key linguistic features.



Lemmatization

Identify all potential roots (lemmas) of each word in a sentence, using morphological analysis and carefully-curated lexicons.



Decompounding

Identify the compound words and extract the lemmas of the simple words that compose them.



Spelling Suggestions

Check the correct spelling of your text, identifying the spelling mistakes suggesting corrections for them.



Lexical Dictionaries

Recognize all words in your documents via our linguistically curated wordlists that cover all words of each language, including their morphological and semantic attributes.



Language Identification

Identify what language a text is written in, including multilingual text. It detects the language of the input text and returns a list of sentences with their respective language.

Core NLP Tools for
Syntactical Analysis

Available in 21 languages

Try our multilingual core NLP tools for syntactical analysis for free on our NLP API platform or request a demo for a personalized solution.

Bulgarian Croatian Dutch German Japanese Slovak Spanish
Catalan Czech English Hungarian Portuguese Slovenian Swedish
Chinese Danish French Italian Russian Serbian Turkish
Bulgarian Dutch Japanese Spanish
Catalan English Portuguese Swedish
Chinese French Russian Turkish
Croatian German Slovak
Czech Hungarian Slovenian
Danish Italian Serbian

Parsing

Produce parse trees describing the structure of the constituents of each sentence.



Segmentation

Split text into sentences accurately, taking into account Natural Language rules.



Tokenization

Split text into sequences of linguistically significant units (tokens), making the scope of your analysis more precise.



POS Tagging

Determine the Part of Speech (POS) of each in a sentence, helping you solve disambiguation.



Phrase Extraction

Extract the relevant multi-word noun, verb, adjective or adverbial phrases using morphological and syntactic analysis.

API

The most comprehensive NLP platform to enhance your AI and Machine Learning services.

Start your FREE TRIAL

DEMO

Do you have questions? Would you like to see a demo with one of our experts?

Do not hesitate to ask and we will contact you shortly.