Compounds in dictionary-based Cross-language information retrieval_revised

Compound words form an important part of natural language. From the cross-lingual information retrieval (CLIR) point of view it is important that many natural languages are highly productive with compounds, and translation resources cannot include entries for all compounds. Also, compounds are often...

Full description

Saved in:
Bibliographic Details
Format: Article
Language:English
Published: University of Borås 2002-01-01
Series:Information Research: An International Electronic Journal
Online Access:http://informationr.net/ir/7-2/paper128.html
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832574037100855296
collection DOAJ
description Compound words form an important part of natural language. From the cross-lingual information retrieval (CLIR) point of view it is important that many natural languages are highly productive with compounds, and translation resources cannot include entries for all compounds. Also, compounds are often content bearing words in a sentence. In Swedish, German and Finnish roughly one tenth of the words in a text prepared for information retrieval purposes are compounds. Important research questions concerning compound handling in dictionary-based cross-language information retrieval are 1) compound splitting into components, 2) normalisation of components, 3) translation of components and 4) query structuring for compounds and their components in the target language. The impact of compound processing on the performance of the cross-language information retrieval process is evaluated in this study and the results indicate that the effect is clearly positive.
format Article
id doaj-art-621db8721c584a3aa89fda162cd82c0b
institution Kabale University
issn 1368-1613
language English
publishDate 2002-01-01
publisher University of Borås
record_format Article
series Information Research: An International Electronic Journal
spelling doaj-art-621db8721c584a3aa89fda162cd82c0b2025-02-02T01:17:48ZengUniversity of BoråsInformation Research: An International Electronic Journal1368-16132002-01-0172128Compounds in dictionary-based Cross-language information retrieval_revisedCompound words form an important part of natural language. From the cross-lingual information retrieval (CLIR) point of view it is important that many natural languages are highly productive with compounds, and translation resources cannot include entries for all compounds. Also, compounds are often content bearing words in a sentence. In Swedish, German and Finnish roughly one tenth of the words in a text prepared for information retrieval purposes are compounds. Important research questions concerning compound handling in dictionary-based cross-language information retrieval are 1) compound splitting into components, 2) normalisation of components, 3) translation of components and 4) query structuring for compounds and their components in the target language. The impact of compound processing on the performance of the cross-language information retrieval process is evaluated in this study and the results indicate that the effect is clearly positive.http://informationr.net/ir/7-2/paper128.html
spellingShingle Compounds in dictionary-based Cross-language information retrieval_revised
Information Research: An International Electronic Journal
title Compounds in dictionary-based Cross-language information retrieval_revised
title_full Compounds in dictionary-based Cross-language information retrieval_revised
title_fullStr Compounds in dictionary-based Cross-language information retrieval_revised
title_full_unstemmed Compounds in dictionary-based Cross-language information retrieval_revised
title_short Compounds in dictionary-based Cross-language information retrieval_revised
title_sort compounds in dictionary based cross language information retrieval revised
url http://informationr.net/ir/7-2/paper128.html