Text Analysis Token Counting Bug?

I'm building a fulltext index using the following code:

CREATE FULLTEXT INDEX "IND_<TABLE>" ON "<SCHEMA>"."<TABLE>"("LONGTEXT")

ASYNC

FAST PREPROCESS OFF

SEARCH ONLY OFF

TEXT ANALYSIS ON

LANGUAGE DETECTION ('EN')

CONFIGURATION 'LINGANALYSIS_BASIC';

There is a document in the table with the text in 'LONGTEXT' that has a word, "Berry." Berry appears once in the document, but it shows up with several thousands TA_COUNTER values and multiple records. I expected only one record for this document in $TA_IND<TABLE>. It shows multiple entries.

I"m trying to figure out if there is a bug in the tokenization in HANA I'm on SPS10 that is causing poor Berry to appear multiple times.

Example of the $TA_IND_<TABLE>

DOCID	TA_COUNTER	TA_NORMALIZED
000400062454	2,012	berry
000400062454	2,033	berry

I find it hard to believe that 2,012 or 2,033 berrys' were found even harder to believe that it was found twice in different locations?

From the SAP HANA Text Analysis Developer documentation.

TA_COUNTER: The token counter counts all tokens across the document. The order is only unique for a given processing type (hence the TA_RULE as the key). I checked the parent table as well and there is only one reference to the 000400062454 document identifier.

Text Analysis Token Counting Bug?

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...