Classify open text using custom dictionaries suited to your needs. Forget about Boolean classifiers and test the 90% accuracy that you can achieve using linguistic knowledge.Download a Real Case
For a reliable categorization process, our tool uses first Deep Linguistic Analysis to detect entities, concepts and verb phrases (e.g. “Barack Obama”, “global warming”, “increase in prices”, “took off”). The linguistic representation of the text is then checked against an user build dictionary containing the taxonomy. When a word or phrase in the text corresponds to a dictionary entry, the category for that entry is assigned to the text.
In the domain of mobile phones, a typical example of categorization will take into account concepts such as “screen”, “case”, “cover”, “camera”, “battery” which all belong to the PRODUCT category as nouns only.
Therefore, sentences like “I love the screen on my new Kindle Fire” or “I’ve bought a great new cover for my iPad” will be classified as belonging to the PRODUCT category.
However, sentences like “I hate it when they screen my iPad at security” or “I hope they’re going to cover the new Galaxy Tab in next week’s review” do not, because “screen” and “cover” are analyzed as verbs.
The categorization service works with a user-supplied taxonomy, but sometimes there is no pre-existing dictionary or thesaurus of categories that can be easily integrated.
But we can also help with that: our concept and entity extraction services can be used to analyze documents belonging to the target domain in order to boot-strap the taxonomy building process. By extracting the most relevant concepts, entities, and verb phrases from a corpus of documents, the process of assigning rules to categories can be significantly reduced.
You can also account on our expertise to help you with the creation of your dictionary trough our linguistic consultancy services.
No. Our approach goes beyond keyword matching so you will be able to create simple but accurate rules that will substitute the AND, OR and NOT operators. And forget about the NEAR as it is handled by our parsing engine.
Do you have to write monthly reports managing loads of data? We know this can be very time and resource consuming. Text Categorization can help you extracting only the information you are interested in and dividing that data into the different categories you have pre-defined.
Let us show you how useful this tool can be:
Our cloud services help market research professionals and data scientists perform sentiment analysis, categorization and entity & concept extraction, easily and effectively.
Free trial. No credit card required. No obligation.
José Echegaray 8 , building 3, office 4
Parque Empresarial Las Rozas
28232 Las Rozas
1700 Montgomery Street, Suite 101