Classify open text using custom dictionaries suited to your needs. Forget about Boolean classifiers and test the 90% accuracy that you can achieve using linguistic knowledge.Download a Real Case
Bitext’s categorization service classifies text into different categories according to a predefined taxonomy or dictionary to extract only relevant information according to your needs.
The added value of Bitext platform is the fact that our analysis categorizes in context, beyond isolated keywords. For example, while using the word “like” you can ignore it when acting as a conjunction “This flavor is like the rest” and focus on when it works as a verb “I like this flavor”.
For a reliable categorization process, our tool uses first Deep Linguistic Analysis to detect entities, concepts and verb phrases (e.g. “Barack Obama”, “global warming”, “increase in prices”, “took off”). The linguistic representation of the text is then checked against a user build dictionary containing the taxonomy. When a word or phrase in the text corresponds to a dictionary entry, the category for that entry is assigned to the text.
The categorization service works with a user-supplied taxonomy, but sometimes there is no pre-existing dictionary or thesaurus of categories that can be easily integrated.
But we are here to help: our concept and entity extraction services can be used to analyze documents belonging to the target domain in order to boot-strap the taxonomy building process. By extracting the most relevant concepts, entities, and verb phrases from a corpus of documents, the process of assigning rules to categories can be significantly reduced.
You can also count on our expertise to help you with the creation of your dictionary trough our linguistic consultancy services.
Let us show you how useful this tool can be:
Our cloud services help market research professionals and data scientists perform sentiment analysis, categorization and entity & concept extraction, easily and effectively.
Free trial. No credit card required. No obligation.
José Echegaray 8 , building 3, office 4
Parque Empresarial Las Rozas
28232 Las Rozas
1700 Montgomery Street, Suite 101