Bitext | We help AI understand humans Bitext. We help AI understand humans.

NLP for Arabic – The case of lemmatization

Sep 13, 2022 | Chatbots, Core NLP for AI engines, Lemmatization, NLP

Arabic is a complex language for NLP tasks, even for simple ones like lemmatization. There are several reasons for this: Arabic creates words based on roots: for example, the word کتاب (kitab, “book”) is derived from ك ت ب (k t b). Many related words are derived from...

How to Automate the Generation of Training Data for Conversational Bots

Aug 24, 2021 | AI, Chatbots, Lemmatization, Machine Learning, NLP, NLU, Synthetic data

Everything looks promising in the world of bots: big players are pushing platforms to build them (Google, Amazon, Facebook, Microsoft, IBM, Apple), large retail companies are adopting them (Starbucks, Domino’s, British Airways), press is excited about movies becoming...

Knowledge Graph Generation for Financial Databases

Jul 8, 2019 | AI, Lemmatization, NLP, Synthetic data

People who use financial databases are aware of the hardships of ensuring information is structured and legible. Don’t worry! Knowledge graphs are here to help. Data volume, nowadays, continues to grow uncontrolled and those datasets are hard to process and draw...

Synthetic Training Data for Chatbots

Jun 14, 2019 | Chatbots, conversational, Lemmatization, NLP, Synthetic data, text analysis

What is Training Data? Training data is the data that is used to train an NLU engine. An NLU engine allows chatbots to understand the intent of user queries. The training data is enriched by data labeling or data annotation, with information about entities, slots…...

Natural Language Processing (NLP) and Machine Learning (ML)

Apr 17, 2019 | AI, deep learning, Lemmatization, Machine Learning, NLP, Synthetic data, text analysis

Two concepts, one mission: to make machines understand humans. Natural Language Processing (NLP) and Machine Learning (ML) are all the rage right now as techniques that complement each other rather than as NLP vs ML In this post, we will focus on NLP and how it works...

Decompounding German, Korean and More: a ‘Gesamt + Kunst + Werk’

Feb 5, 2019 | AI, Lemmatization, NLP, Synthetic data, text analysis

It’s a true story that Germans love their long words. However, this fact may not be so loved for text processing procedures. The lack of NLP libraries in Python adapted to German makes it difficult to properly analyze this kind of words. Let us share with you our NLP...

« Older Entries

NLP for Arabic – The case of lemmatization

How to Automate the Generation of Training Data for Conversational Bots

Knowledge Graph Generation for Financial Databases

Synthetic Training Data for Chatbots

Natural Language Processing (NLP) and Machine Learning (ML)

Decompounding German, Korean and More: a ‘Gesamt + Kunst + Werk’

Recent Posts

Recent Comments

Archives

Categories

Meta