A document clustering approach for search engines

Chun Wei Tsai, Ting Wen Liang, Jiun Huei Ho, Chu Sing Yang, Ming Chao Chiang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

This paper presents a new internet search engine system called Document Clustering for Search Engines (DCSE). This system focuses on overcoming the following challenges faced by search engines: (1) relevance of the search results in response to a user query and (2) information coverage. The DCSE system is based upon a meta-search engine that integrates information retrieval (IR), information extraction (IE), genetic algorithm (GA) and document clustering algorithm into a single system. DCSE utilizes information extraction techniques and vector space model (VSM) calculations to determine the relevance of various data, and then categorizes the data via information retrieval and document clustering algorithm in order to better refine the result. Users will receive information that has been calculated and sorted and web links that are ranked according to their relevance. The end result will reduce the amount of time that users spend filtering out irrelevant data.

Original languageEnglish
Title of host publication2006 IEEE International Conference on Systems, Man and Cybernetics
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1050-1055
Number of pages6
ISBN (Print)1424401003, 9781424401000
DOIs
Publication statusPublished - 2006 Jan 1
Event2006 IEEE International Conference on Systems, Man and Cybernetics - Taipei, Taiwan
Duration: 2006 Oct 82006 Oct 11

Publication series

NameConference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
Volume2
ISSN (Print)1062-922X

Other

Other2006 IEEE International Conference on Systems, Man and Cybernetics
CountryTaiwan
CityTaipei
Period06-10-0806-10-11

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Fingerprint Dive into the research topics of 'A document clustering approach for search engines'. Together they form a unique fingerprint.

  • Cite this

    Tsai, C. W., Liang, T. W., Ho, J. H., Yang, C. S., & Chiang, M. C. (2006). A document clustering approach for search engines. In 2006 IEEE International Conference on Systems, Man and Cybernetics (pp. 1050-1055). [4273986] (Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics; Vol. 2). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICSMC.2006.384538