Dataset Community

Annotated training data scarcity is the main problem when building Conversational AI models.

At Bitext, we’re working on using synthetic training data generation to solve this, and we’ve published several large datasets (with more than 250K utterances) for training and evaluating intent recognition models for English customer service chatbots on GitHub.

dataset-community-AI-Bitext

Our Customers

Working with 3 of the Top 5 Largest Companies in NASDAQ

Datasets

Customer Service Tagged Training Dataset for Intent Detection

The training dataset has the following specs:

  • Customer Service domain
  • 11 categories or intent groups
  • 27 intents assigned to one of the 11 categories
  • 7 entity/slot types

Customer Service Tagged Evaluation Dataset for Intent Detection

The evaluation dataset has the following specs :

  • Customer Service domain
  • 11 categories or intent groups
  • 27 intents assigned to one of the 11 categories
  • 7 entity/slot types

Customer Service Tagged Evaluation Dataset for Intent Detection-Colloquial

The Colloquial Evaluation dataset has the following specs:

  • Customer Service domain
  • 11 categories or intent groups
  • 27 intents assigned to one of the 11 categories
  • 270,000 utterances assigned to the 27 intents
  • 7 entity/slot types

Customer Service Tagged Evaluation Dataset for Intent Detection-Politeness

The Politeness Evaluation dataset has the following specs:

  • Customer Service domain
  • 11 categories or intent groups
  • 27 intents assigned to one of the 11 categories
  • 270,000 utterances assigned to the 27 intents
  • 7 entity/slot types

Customer Support Intent Detection Training Dataset for RASA

The training dataset for RASA has the following specs:

  • Customer Service domain
  • 11 categories or intent groups
  • 27 intents assigned to one of the 11 categories
  • 260,000 utterances assigned to the 27 intents

Contact us

If you need more information about our dataset or have any questions contact us by clicking on the button bellow

MADRID, SPAIN

Camino de las Huertas, 20, 28223 Pozuelo
Madrid, Spain

SAN FRANCISCO, USA

541 Jefferson Ave Ste 100, Redwood City
CA 94063, USA