On modeling remote and local dependencies in language

Yu Sheng Lai, Chung Hsien Wu

研究成果: Paper同行評審

摘要

In this paper, a statistical language model that can model both remote and local dependencies is proposed. This model takes into account the relationship between the predicted word and its preceding words without considering the order of the preceding words. Two primary parameters, the reliability coefficient and the combination factor, are proposed to achieve a better performance of the language model. The reliability coefficients identify the reliabilities of the remote dependencies to the predicted word. The combination factor gives a weight to the combination of the local dependency and the remote dependency. The language model was tested on the task of word clustering and compared to the traditional N-gram language model. A large corpus provided by Academia Sinica, Taiwan, containing 5 million words was used for training and testing. The experimental results show that the proposed model takes littler computation and achieves a better performance for large N compared to the traditional N-gram language model.

原文English
頁面123-136
頁數14
出版狀態Published - 1999

All Science Journal Classification (ASJC) codes

  • 語言與語言學
  • 言語和聽力

指紋

深入研究「On modeling remote and local dependencies in language」主題。共同形成了獨特的指紋。

引用此