An adaptive page clustering based weighting method for information retrieval

Yi Xian Lin, Hung Yu Kao

Research output: Contribution to conferencePaper

Abstract

With the coming of the era of information explosion, using Internet to obtain information has become the most convenient pipeline information flow. However, the found information mostly based on keyword matching through the search engines, and the search engines do not generally conduct filtering and screening in order to enhance the returns. If the web pages pass a systematic arrangement and are divided into multiple categories or clusters, the users will be guided to obtain real help of information. In this paper, we propose an adaptive web pages clustering algorithm to perform this task. It extracts features to reduce feature dimensions, then filters automatically web pages into its appropriate cluster and enhances the features of the pages to site features for different coefficients to improve the effect. Finally, providing users a more accurate search data model. The experimental results show that compared to the traditional TF-IDF, the proposed approach can find the needed web pages and the topics of the web pages in the corresponding cluster that are highly similar.

Original languageEnglish
Pages199-204
Number of pages6
DOIs
Publication statusPublished - 2013 Jan 1
Event2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013 - Taipei, Taiwan
Duration: 2013 Dec 62013 Dec 8

Other

Other2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013
CountryTaiwan
CityTaipei
Period13-12-0613-12-08

Fingerprint

Information retrieval
Websites
Search engines
Clustering algorithms
Explosions
Data structures
Screening
Pipelines
Internet

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence

Cite this

Lin, Y. X., & Kao, H. Y. (2013). An adaptive page clustering based weighting method for information retrieval. 199-204. Paper presented at 2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013, Taipei, Taiwan. https://doi.org/10.1109/TAAI.2013.48
Lin, Yi Xian ; Kao, Hung Yu. / An adaptive page clustering based weighting method for information retrieval. Paper presented at 2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013, Taipei, Taiwan.6 p.
@conference{72c80f43cd9341fd9c68ba22947c0006,
title = "An adaptive page clustering based weighting method for information retrieval",
abstract = "With the coming of the era of information explosion, using Internet to obtain information has become the most convenient pipeline information flow. However, the found information mostly based on keyword matching through the search engines, and the search engines do not generally conduct filtering and screening in order to enhance the returns. If the web pages pass a systematic arrangement and are divided into multiple categories or clusters, the users will be guided to obtain real help of information. In this paper, we propose an adaptive web pages clustering algorithm to perform this task. It extracts features to reduce feature dimensions, then filters automatically web pages into its appropriate cluster and enhances the features of the pages to site features for different coefficients to improve the effect. Finally, providing users a more accurate search data model. The experimental results show that compared to the traditional TF-IDF, the proposed approach can find the needed web pages and the topics of the web pages in the corresponding cluster that are highly similar.",
author = "Lin, {Yi Xian} and Kao, {Hung Yu}",
year = "2013",
month = "1",
day = "1",
doi = "10.1109/TAAI.2013.48",
language = "English",
pages = "199--204",
note = "2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013 ; Conference date: 06-12-2013 Through 08-12-2013",

}

Lin, YX & Kao, HY 2013, 'An adaptive page clustering based weighting method for information retrieval', Paper presented at 2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013, Taipei, Taiwan, 13-12-06 - 13-12-08 pp. 199-204. https://doi.org/10.1109/TAAI.2013.48

An adaptive page clustering based weighting method for information retrieval. / Lin, Yi Xian; Kao, Hung Yu.

2013. 199-204 Paper presented at 2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013, Taipei, Taiwan.

Research output: Contribution to conferencePaper

TY - CONF

T1 - An adaptive page clustering based weighting method for information retrieval

AU - Lin, Yi Xian

AU - Kao, Hung Yu

PY - 2013/1/1

Y1 - 2013/1/1

N2 - With the coming of the era of information explosion, using Internet to obtain information has become the most convenient pipeline information flow. However, the found information mostly based on keyword matching through the search engines, and the search engines do not generally conduct filtering and screening in order to enhance the returns. If the web pages pass a systematic arrangement and are divided into multiple categories or clusters, the users will be guided to obtain real help of information. In this paper, we propose an adaptive web pages clustering algorithm to perform this task. It extracts features to reduce feature dimensions, then filters automatically web pages into its appropriate cluster and enhances the features of the pages to site features for different coefficients to improve the effect. Finally, providing users a more accurate search data model. The experimental results show that compared to the traditional TF-IDF, the proposed approach can find the needed web pages and the topics of the web pages in the corresponding cluster that are highly similar.

AB - With the coming of the era of information explosion, using Internet to obtain information has become the most convenient pipeline information flow. However, the found information mostly based on keyword matching through the search engines, and the search engines do not generally conduct filtering and screening in order to enhance the returns. If the web pages pass a systematic arrangement and are divided into multiple categories or clusters, the users will be guided to obtain real help of information. In this paper, we propose an adaptive web pages clustering algorithm to perform this task. It extracts features to reduce feature dimensions, then filters automatically web pages into its appropriate cluster and enhances the features of the pages to site features for different coefficients to improve the effect. Finally, providing users a more accurate search data model. The experimental results show that compared to the traditional TF-IDF, the proposed approach can find the needed web pages and the topics of the web pages in the corresponding cluster that are highly similar.

UR - http://www.scopus.com/inward/record.url?scp=84899410945&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84899410945&partnerID=8YFLogxK

U2 - 10.1109/TAAI.2013.48

DO - 10.1109/TAAI.2013.48

M3 - Paper

AN - SCOPUS:84899410945

SP - 199

EP - 204

ER -

Lin YX, Kao HY. An adaptive page clustering based weighting method for information retrieval. 2013. Paper presented at 2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013, Taipei, Taiwan. https://doi.org/10.1109/TAAI.2013.48