Anchor text mining for translation extraction of query terms

Wen Hsiang Lu, Lee Feng Chein, Hsi Jian Lee

Research output: Contribution to journalConference article

5 Citations (Scopus)

Abstract

This paper presents an approach to automatically extracting the bilingual translations of many Web query terms through mining the Web anchor texts. Some preliminary experiments are conducted on using 109,416 Web pages containing both Chinese and English anchor texts in their in-links to extract Chinese translations of 200 English queries selected from popular query terms in Taiwan. It is found that the effective translations of 75% of the popular query terms can be extracted, in which 87.2% cannot be obtained in common translation dictionaries.

Original languageEnglish
Pages (from-to)388-389
Number of pages2
JournalSIGIR Forum (ACM Special Interest Group on Information Retrieval)
Publication statusPublished - 2001 Oct 22
Event24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - New Orleans, LA, United States
Duration: 2001 Sep 92001 Sep 13

Fingerprint

Anchors
Glossaries
Websites
Experiments
Query
Text mining
World Wide Web

All Science Journal Classification (ASJC) codes

  • Management Information Systems
  • Hardware and Architecture

Cite this

@article{994334a8903140519b7fec6d58e272e3,
title = "Anchor text mining for translation extraction of query terms",
abstract = "This paper presents an approach to automatically extracting the bilingual translations of many Web query terms through mining the Web anchor texts. Some preliminary experiments are conducted on using 109,416 Web pages containing both Chinese and English anchor texts in their in-links to extract Chinese translations of 200 English queries selected from popular query terms in Taiwan. It is found that the effective translations of 75{\%} of the popular query terms can be extracted, in which 87.2{\%} cannot be obtained in common translation dictionaries.",
author = "Lu, {Wen Hsiang} and Chein, {Lee Feng} and Lee, {Hsi Jian}",
year = "2001",
month = "10",
day = "22",
language = "English",
pages = "388--389",
journal = "SIGIR Forum (ACM Special Interest Group on Information Retrieval)",
issn = "0163-5840",
publisher = "Association for Computing Machinery (ACM)",

}

Anchor text mining for translation extraction of query terms. / Lu, Wen Hsiang; Chein, Lee Feng; Lee, Hsi Jian.

In: SIGIR Forum (ACM Special Interest Group on Information Retrieval), 22.10.2001, p. 388-389.

Research output: Contribution to journalConference article

TY - JOUR

T1 - Anchor text mining for translation extraction of query terms

AU - Lu, Wen Hsiang

AU - Chein, Lee Feng

AU - Lee, Hsi Jian

PY - 2001/10/22

Y1 - 2001/10/22

N2 - This paper presents an approach to automatically extracting the bilingual translations of many Web query terms through mining the Web anchor texts. Some preliminary experiments are conducted on using 109,416 Web pages containing both Chinese and English anchor texts in their in-links to extract Chinese translations of 200 English queries selected from popular query terms in Taiwan. It is found that the effective translations of 75% of the popular query terms can be extracted, in which 87.2% cannot be obtained in common translation dictionaries.

AB - This paper presents an approach to automatically extracting the bilingual translations of many Web query terms through mining the Web anchor texts. Some preliminary experiments are conducted on using 109,416 Web pages containing both Chinese and English anchor texts in their in-links to extract Chinese translations of 200 English queries selected from popular query terms in Taiwan. It is found that the effective translations of 75% of the popular query terms can be extracted, in which 87.2% cannot be obtained in common translation dictionaries.

UR - http://www.scopus.com/inward/record.url?scp=0034785486&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034785486&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:0034785486

SP - 388

EP - 389

JO - SIGIR Forum (ACM Special Interest Group on Information Retrieval)

JF - SIGIR Forum (ACM Special Interest Group on Information Retrieval)

SN - 0163-5840

ER -