The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval
In an earlier study, we presented a query key goodness scheme, which can be used to separate between good and bad query keys. The scheme is based on the relative average term frequency (RATF) values of query keys. In the present paper, we tested the effectiveness of the scheme in Finnish to English...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
University of Borås
2002-01-01
|
Series: | Information Research: An International Electronic Journal |
Online Access: | http://informationr.net/ir/7-2/paper127.html |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832572648990703616 |
---|---|
author | Ari Pirkola Erkka Leppänen Kalervo Järvelin |
author_facet | Ari Pirkola Erkka Leppänen Kalervo Järvelin |
author_sort | Ari Pirkola |
collection | DOAJ |
description | In an earlier study, we presented a query key goodness scheme, which can be used to separate between good and bad query keys. The scheme is based on the relative average term frequency (RATF) values of query keys. In the present paper, we tested the effectiveness of the scheme in Finnish to English cross-language retrieval in several experiments. Query keys were weighted and queries were reduced based on the RATF values of keys. The tests were carried out in TREC and CLEF document collections using the InQuery retrieval system. The TREC tests indicated that the best RATF-based queries delivered substantial and statistically significant performance improvements, and performed as well as syn-structured queries shown to be effective in many CLIR studies. The CLEF tests indicated the limitations of the use of RATF in CLIR. However, the best RATF-based queries performed better than baseline queries also in the CLEF collection. |
format | Article |
id | doaj-art-d1f8a36cff3548e1b03cdf89091edfd4 |
institution | Kabale University |
issn | 1368-1613 |
language | English |
publishDate | 2002-01-01 |
publisher | University of Borås |
record_format | Article |
series | Information Research: An International Electronic Journal |
spelling | doaj-art-d1f8a36cff3548e1b03cdf89091edfd42025-02-02T08:57:56ZengUniversity of BoråsInformation Research: An International Electronic Journal1368-16132002-01-0172127The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language RetrievalAri PirkolaErkka LeppänenKalervo JärvelinIn an earlier study, we presented a query key goodness scheme, which can be used to separate between good and bad query keys. The scheme is based on the relative average term frequency (RATF) values of query keys. In the present paper, we tested the effectiveness of the scheme in Finnish to English cross-language retrieval in several experiments. Query keys were weighted and queries were reduced based on the RATF values of keys. The tests were carried out in TREC and CLEF document collections using the InQuery retrieval system. The TREC tests indicated that the best RATF-based queries delivered substantial and statistically significant performance improvements, and performed as well as syn-structured queries shown to be effective in many CLIR studies. The CLEF tests indicated the limitations of the use of RATF in CLIR. However, the best RATF-based queries performed better than baseline queries also in the CLEF collection.http://informationr.net/ir/7-2/paper127.html |
spellingShingle | Ari Pirkola Erkka Leppänen Kalervo Järvelin The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval Information Research: An International Electronic Journal |
title | The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval |
title_full | The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval |
title_fullStr | The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval |
title_full_unstemmed | The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval |
title_short | The RATF Formula (Kwok's Formula): Exploiting Average Term Frequency in Cross-Language Retrieval |
title_sort | ratf formula kwok s formula exploiting average term frequency in cross language retrieval |
url | http://informationr.net/ir/7-2/paper127.html |
work_keys_str_mv | AT aripirkola theratfformulakwoksformulaexploitingaveragetermfrequencyincrosslanguageretrieval AT erkkaleppanen theratfformulakwoksformulaexploitingaveragetermfrequencyincrosslanguageretrieval AT kalervojarvelin theratfformulakwoksformulaexploitingaveragetermfrequencyincrosslanguageretrieval AT aripirkola ratfformulakwoksformulaexploitingaveragetermfrequencyincrosslanguageretrieval AT erkkaleppanen ratfformulakwoksformulaexploitingaveragetermfrequencyincrosslanguageretrieval AT kalervojarvelin ratfformulakwoksformulaexploitingaveragetermfrequencyincrosslanguageretrieval |