A document clustering approach for search engines

Chun Wei Tsai, Ting Wen Liang, Jiun Huei Ho, Chu Sing Yang, Ming Chao Chiang

研究成果: Conference contribution

7 引文 斯高帕斯(Scopus)

摘要

This paper presents a new internet search engine system called Document Clustering for Search Engines (DCSE). This system focuses on overcoming the following challenges faced by search engines: (1) relevance of the search results in response to a user query and (2) information coverage. The DCSE system is based upon a meta-search engine that integrates information retrieval (IR), information extraction (IE), genetic algorithm (GA) and document clustering algorithm into a single system. DCSE utilizes information extraction techniques and vector space model (VSM) calculations to determine the relevance of various data, and then categorizes the data via information retrieval and document clustering algorithm in order to better refine the result. Users will receive information that has been calculated and sorted and web links that are ranked according to their relevance. The end result will reduce the amount of time that users spend filtering out irrelevant data.

原文English
主出版物標題2006 IEEE International Conference on Systems, Man and Cybernetics
發行者Institute of Electrical and Electronics Engineers Inc.
頁面1050-1055
頁數6
ISBN(列印)1424401003, 9781424401000
DOIs
出版狀態Published - 2006 一月 1
事件2006 IEEE International Conference on Systems, Man and Cybernetics - Taipei, Taiwan
持續時間: 2006 十月 82006 十月 11

出版系列

名字Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
2
ISSN(列印)1062-922X

Other

Other2006 IEEE International Conference on Systems, Man and Cybernetics
國家/地區Taiwan
城市Taipei
期間06-10-0806-10-11

All Science Journal Classification (ASJC) codes

  • 工程 (全部)

指紋

深入研究「A document clustering approach for search engines」主題。共同形成了獨特的指紋。

引用此