A collection of free resources for you to download, including guides, white papers, e-books, benchmarks, and case studies.
Arabic Sentiment Text Similarity
Pre Built Training Data
Lexical Data Resources
List of Services
Synonym Data Resources
Twitter Sentiment Analysis
Understanding your results
Create your coding plan
Booleans and POS Tagging
Knowledge Graph Generation for Financial Databases
People who use financial databases are aware of the hardships of ensuring information is...
Evaluate the Quality of your Chatbots and Conversational Agents
It is always important to evaluate the quality of your chatbots and conversational...
Synthetic Training Data for Chatbots
What is Training Data? Training data is the data that is used to train an NLU engine. An...
Natural Language Processing (NLP) and Machine Learning (ML)
Two concepts, one mission: to make machines understand humans. Natural Language...
Bitext’s Customer Support Dataset for free
We have shown in previous posts why Synthetic Training Data is the best way to boost the...
Decompounding German, Korean and More: a ‘Gesamt + Kunst + Werk’
It’s a true story that Germans love their long words. However, this fact may not be so...
Speed Up Your Bot Training with Artificial Data
If you want your chatbot to recognize a specific intent, you need to provide it with a...
Siri Speaking Arabic: What Is Failing?
Almost three years after Apple launched its well-known voice assistant Siri for the...
Artificial training data: how to speed up your bot training
Bots built upon machine learning need long training processes to have the ability to...
Improving Rasa’s results by 30% with artificial training data: Part II
Increasing bot accuracy has never been so easy. How? Generating artificial training...
Benchmark on Amazon Lex
Check out how we improved Amazon Lex accuracy by 50% using our training data
Benchmark on Entity Extration
This report compares Bitext’s entity extraction software to 3 other engines (CRFSuite, Stanford and SENNA)
Benchmark on Microsoft LUIS
Increase accuracy on the LUIS platform up to 40% using Bitext training data.
Benchmark on Lemmatization
A brief comparison of stemmers and lemmatizers
Benchmark on Dialogflow
A benchmark based on Dialogflow shows accuracy increases of up to 40%
Chatbot Multilingual Synthetic Data
Deploying a bot capable to engage in successful conversations for retail.
Download the Case Study about our work with TechCrunch.
Consumer Insights in Minutes
Learn how Movistar saved 75% using Bitext services.
Market research leader saves 65% of its time when looking for insights.
Automating Manual Coding
Learn how a market research leader achieved 65% savings using Bitext.
Discover how one of our customers began saving time and money.
E-books and Cheat Sheets
How Linguistics Can Improve Chatbots
Solving chatbot issues using linguistics.
Lemmatization vs Stemming
Download practical examples of the two methods in different languages.
Lemmatization and POS Tagging for Deep Learning
How both impact Deep Learning.
Anonymization for GDPR Compliance
Are you ready for the GDPR?
Lemmatization for Topic Modeling
Discover how lemmatization impacts Topic Modeling.
How to Solve Chatbot Problems
How to solve 3 common chatbot issues.