TY - GEN
T1 - A web-based unsupervised algorithm for learning transliteration model to improve translation of low-frequency proper names
AU - Shia, Min Shiang
AU - Lin, Jiun Hung
AU - Yu, Scott
AU - Lu, Wen Hsiang
PY - 2005
Y1 - 2005
N2 - In machine translation, cross-language information retrieval, and cross-language question answering, the problems of unknown term translation are difficult to be solved. Although we have proposed several effective Web-based term translation extraction methods exploring Web resources to deal with translation of frequent Web query terms. However, many low-frequency unknown terms are still difficult to be translated by using our previous Web-based term translation extraction methods. Therefore, in this paper we propose a two-stage hybrid translation extraction method, which is composed of our pervious Web-based term translation extraction method and a new Web-based transliteration method to improve translation of low-frequency unknown proper names. Additionally, to construct a good quality transliteration model, we also present a Web-based unsupervised learning algorithm to automatically collect diverse English-Chinese transliteration pairs from the Web. Experimental results showed that our new method can make great improvements for translation of unknown proper names.
AB - In machine translation, cross-language information retrieval, and cross-language question answering, the problems of unknown term translation are difficult to be solved. Although we have proposed several effective Web-based term translation extraction methods exploring Web resources to deal with translation of frequent Web query terms. However, many low-frequency unknown terms are still difficult to be translated by using our previous Web-based term translation extraction methods. Therefore, in this paper we propose a two-stage hybrid translation extraction method, which is composed of our pervious Web-based term translation extraction method and a new Web-based transliteration method to improve translation of low-frequency unknown proper names. Additionally, to construct a good quality transliteration model, we also present a Web-based unsupervised learning algorithm to automatically collect diverse English-Chinese transliteration pairs from the Web. Experimental results showed that our new method can make great improvements for translation of unknown proper names.
UR - http://www.scopus.com/inward/record.url?scp=33847262997&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33847262997&partnerID=8YFLogxK
U2 - 10.1109/NLPKE.2005.1598774
DO - 10.1109/NLPKE.2005.1598774
M3 - Conference contribution
AN - SCOPUS:33847262997
SN - 0780393619
SN - 9780780393615
T3 - Proceedings of 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE'05
SP - 420
EP - 425
BT - Proceedings of 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE'05
T2 - 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE'05
Y2 - 30 October 2005 through 1 November 2005
ER -