Translating unknown queries with web corpora for cross-language information retrieval

Pu Jen Cheng, Jei Wen Teng, Ruei Cheng Chen, Jenq Haur Wang, Wen Hsiang Lu, Lee Feng Chien

研究成果: Conference contribution

70 引文 斯高帕斯(Scopus)

摘要

It is crucial for cross-language information retrieval (CLIR) systems to deal with the translation of unknown queries1 due to that real queries might be short. The purpose of this paper is to investigate the feasibility of exploiting the Web as the corpus source to translate unknown queries for CLIR. We propose an online translation approach to determine effective translations for unknown query terms via mining of bilingual search-result pages obtained from Web search engines. This approach can alleviate the problem of the lack of large bilingual corpora, translate many unknown query terms, provide flexible query specifications, and extract semantically-close translations to benefit CLIR tasks- especially for cross-language Web search.

原文English
主出版物標題Proceedings of Sheffield SIGIR - Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
發行者Association for Computing Machinery (ACM)
頁面146-153
頁數8
ISBN(列印)1581138814, 9781581138818
DOIs
出版狀態Published - 2004
事件Proceedings of Sheffield SIGIR - Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - Sheffield, United Kingdom
持續時間: 2004 7月 252004 7月 29

出版系列

名字Proceedings of Sheffield SIGIR - Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

Other

OtherProceedings of Sheffield SIGIR - Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
國家/地區United Kingdom
城市Sheffield
期間04-07-2504-07-29

All Science Journal Classification (ASJC) codes

  • 一般工程

指紋

深入研究「Translating unknown queries with web corpora for cross-language information retrieval」主題。共同形成了獨特的指紋。

引用此