Translating unknown cross-lingual queries in digital libraries using a web-based approach

Jenq Haur Wang, Jei Wen Teng, Pu Jen Cheng, Wen Hsiang Lu, Lee Feng Chien

Research output: Chapter in Book/Report/Conference proceedingConference contribution

30 Citations (Scopus)

Abstract

Users' cross-lingual queries to a digital library system might be short and not included in a common translation dictionary (unknown terms). In this paper, we investigate the feasibility of exploiting the Web as the corpus source to translate unknown query terms for cross-language information retrieval (CLIR) in digital libraries. We propose a Web-based term translation approach to determine effective translations for unknown query terms by mining bilingual search-result pages obtained from a real Web search engine. This approach can enhance the construction of a domain-specific bilingual lexicon and benefit CLIR services in a digital library that only has monolingual document collections. Very promising results have been obtained in generating effective translation equivalents for many unknown terms, including proper nouns, technical terms and Web query terms.

Original languageEnglish
Title of host publicationProceedings of the Fourth ACM/IEEE Joint Conference on Digital Libraries; Global Reach and Diverse Impact, JCDL 2004
PublisherAssociation for Computing Machinery (ACM)
Pages108-116
Number of pages9
ISBN (Print)1581138326, 9781581138320
DOIs
Publication statusPublished - 2004
EventProceedings of the Fourth ACM/IEEE Joint Conference on Digital Libraries; Global reach and Diverse Impact, JCDL 2004 - Tucson, AZ, United States
Duration: 2004 Jun 72004 Jun 11

Publication series

NameProceedings of the ACM IEEE International Conference on Digital Libraries, JCDL 2004

Other

OtherProceedings of the Fourth ACM/IEEE Joint Conference on Digital Libraries; Global reach and Diverse Impact, JCDL 2004
Country/TerritoryUnited States
CityTucson, AZ
Period04-06-0704-06-11

All Science Journal Classification (ASJC) codes

  • Software
  • Information Systems
  • Computer Science Applications
  • Library and Information Sciences

Fingerprint

Dive into the research topics of 'Translating unknown cross-lingual queries in digital libraries using a web-based approach'. Together they form a unique fingerprint.

Cite this