The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval

In an earlier study, we presented a query key goodness scheme, which can be used to separate between good and bad query keys. The scheme is based on the relative average term frequency (RATF) values of query keys. In the present paper, we tested the effectiveness of the scheme in Finnish to English...

Full description

Saved in:
Bibliographic Details
Main Authors: Ari Pirkola, Erkka Leppänen, Kalervo Järvelin
Format: Article
Language:English
Published: University of Borås 2002-01-01
Series:Information Research: An International Electronic Journal
Online Access:http://informationr.net/ir/7-2/paper127.html
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832572648990703616
author Ari Pirkola
Erkka Leppänen
Kalervo Järvelin
author_facet Ari Pirkola
Erkka Leppänen
Kalervo Järvelin
author_sort Ari Pirkola
collection DOAJ
description In an earlier study, we presented a query key goodness scheme, which can be used to separate between good and bad query keys. The scheme is based on the relative average term frequency (RATF) values of query keys. In the present paper, we tested the effectiveness of the scheme in Finnish to English cross-language retrieval in several experiments. Query keys were weighted and queries were reduced based on the RATF values of keys. The tests were carried out in TREC and CLEF document collections using the InQuery retrieval system. The TREC tests indicated that the best RATF-based queries delivered substantial and statistically significant performance improvements, and performed as well as syn-structured queries shown to be effective in many CLIR studies. The CLEF tests indicated the limitations of the use of RATF in CLIR. However, the best RATF-based queries performed better than baseline queries also in the CLEF collection.
format Article
id doaj-art-d1f8a36cff3548e1b03cdf89091edfd4
institution Kabale University
issn 1368-1613
language English
publishDate 2002-01-01
publisher University of Borås
record_format Article
series Information Research: An International Electronic Journal
spelling doaj-art-d1f8a36cff3548e1b03cdf89091edfd42025-02-02T08:57:56ZengUniversity of BoråsInformation Research: An International Electronic Journal1368-16132002-01-0172127The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language RetrievalAri PirkolaErkka LeppänenKalervo JärvelinIn an earlier study, we presented a query key goodness scheme, which can be used to separate between good and bad query keys. The scheme is based on the relative average term frequency (RATF) values of query keys. In the present paper, we tested the effectiveness of the scheme in Finnish to English cross-language retrieval in several experiments. Query keys were weighted and queries were reduced based on the RATF values of keys. The tests were carried out in TREC and CLEF document collections using the InQuery retrieval system. The TREC tests indicated that the best RATF-based queries delivered substantial and statistically significant performance improvements, and performed as well as syn-structured queries shown to be effective in many CLIR studies. The CLEF tests indicated the limitations of the use of RATF in CLIR. However, the best RATF-based queries performed better than baseline queries also in the CLEF collection.http://informationr.net/ir/7-2/paper127.html
spellingShingle Ari Pirkola
Erkka Leppänen
Kalervo Järvelin
The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval
Information Research: An International Electronic Journal
title The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval
title_full The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval
title_fullStr The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval
title_full_unstemmed The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval
title_short The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval
title_sort ratf formula kwok s formula exploiting average term frequency in cross language retrieval
url http://informationr.net/ir/7-2/paper127.html
work_keys_str_mv AT aripirkola theratfformulakwoksformulaexploitingaveragetermfrequencyincrosslanguageretrieval
AT erkkaleppanen theratfformulakwoksformulaexploitingaveragetermfrequencyincrosslanguageretrieval
AT kalervojarvelin theratfformulakwoksformulaexploitingaveragetermfrequencyincrosslanguageretrieval
AT aripirkola ratfformulakwoksformulaexploitingaveragetermfrequencyincrosslanguageretrieval
AT erkkaleppanen ratfformulakwoksformulaexploitingaveragetermfrequencyincrosslanguageretrieval
AT kalervojarvelin ratfformulakwoksformulaexploitingaveragetermfrequencyincrosslanguageretrieval