Bitext NAMER Cracks Named Entity Recognition

On November 19, the Beyond Search Web log published a brief analysis of our multilingual NER (Named Entity Recognition system) technology.

The post highlighted the challenges of handling Chinese personal names in English to enable accurate and consistent cross-tabulation for analysts, researchers, and investigators.

Similar issues arise with organizational names, such as “Sun City” (a place and enterprise) or aliases like “Yati New City” for “Shwe Koko”; and, in general, with any language that is written in non-Roman alphabet and needs transliteration.

In fact, these issues affect to all languages that do not use Roman alphabet including Hindi, Malayalam or Vietnamese, since transliteration is not a one-to-one function but a one-to-many and, as a result, it generates ambiguity the hinders the work of analysts.

With real-time data streaming into government software, resolving ambiguities in entity identification is crucial, particularly for investigations into activities like money laundering. The Bitext NAMER addresses these challenges, including:

1. Correctly and identifying generic names.

2. Assigning them a type: person, place, time, organization…

3. Resolving aliases, also known as (AKAs), and psuedonyms.

4. Distinguishing similar names linked to potentially unrelated entities (e.g., “Levo Chan”).

Bitext’s proprietary methods support more than 20 languages, with an additional 30 languages available on request.

Bitext works with three of the top 5 US Big Tech firms.

In summary, Bitext NAMER enriches entity detection. Our unique method enables accurate, multilingual entity detection and normalization for a variety of applications.

More info about Bitext NAMER

admin

Recent Posts

Using Public Corpora to Build Your NER systems

Rationale. NER tools are at the heart of how the scientific community is solving LLM…

2 weeks ago

Open-Source Data and Training Issues

As described in our previous post “Using Public Corpora to Build Your NER systems”, we…

2 weeks ago

Why Semantic Intelligence Is the Missing Link in Active Metadata and Data Governance

The new Forrester Wave™: Data Governance Solutions, Q3 2025 makes one thing clear: governance is…

2 months ago

Bitext NAMER: Slashing Time and Costs in Automated Knowledge Graph Construction

The process of building Knowledge Graphs is essential for organizations seeking to organize, structure, and…

8 months ago

Multilingual Named Entity Recognition for Knowledge Graphs: Supporting 70+ Languages with Precision

In the era of data-driven decision-making, Knowledge Graphs (KGs) have emerged as pivotal tools for…

10 months ago

How LLM Verticalization Reduces Time and Cost in GenAI-Based Solutions

Verticalizing AI21’s Jamba 1.5 with Bitext Synthetic Text Efficiency and Benefits of Verticalizing LLMs –…

11 months ago