Targeted s-gram matching: a novel n-gram matching technique for cross- and monolingual word form variants
We present a novel n-gram based string matching technique, which we call the targeted s-gram matching technique. In the technique, n-grams are classified into categories on the basis of character contiguity in words. The categories are then utilized in matching. The technique was compared with the c...
Saved in:
Main Authors: | Ari Pirkola, Heikki Keskustalo, Erkka Leppänen, Antti-Pekka Känsälä, Kalervo Järvelin |
---|---|
Format: | Article |
Language: | English |
Published: |
University of Borås
2002-01-01
|
Series: | Information Research: An International Electronic Journal |
Online Access: | http://informationr.net/ir/7-2/paper126.html |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval
by: Ari Pirkola, et al.
Published: (2002-01-01) -
Stemming and N-gram matching for term conflation in Turkish texts
by: F. Çuna Ekmekçioglu, et al.
Published: (1996-01-01) -
Environmental microbial communications in gram-positive and gram-negative bacteria
by: P. Srikanth, et al.
Published: (2023-11-01) -
Green gram and black gram: prospects of cultivation and breeding in Russian Federation
by: M. A. Vishnyakova, et al.
Published: (2019-01-01) -
Research on Word Vector Training Method Based on Improved Skip-Gram Algorithm
by: Yachun Tang
Published: (2022-01-01)