TY - GEN
T1 - Learning knowledge from user search
AU - Lee, Yen Kuan
AU - Chuang, Kun Ta
N1 - Funding Information:
This paper was supported in part by Ministry of Science and Technology, R.O.C., under Contract 104-2221-E-006-050. In addition, the authors especially thank the Taiwan Ministry of Economic Affairs and Institute for Information Industry for financially supporting this research : “Plan title : Fundamental Industrial Technology Development Program (3/4)”.
Publisher Copyright:
© Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, ROCLING 2015.
PY - 2015/10/1
Y1 - 2015/10/1
N2 - In this paper, we introduce the concept of a novel application, called Knowledge Learning from User Search, aiming at identifying timely new knowledge triples from user search log. In the literature, the need of knowledge enrichment has been recognized as the key to the success of knowledge-based search. However, previous work of automatic knowledge extraction, such as Google Knowledge Vault, attempt to identify the unannotated knowledge triples from the full web-scale content in the offline execution. In our study, we show that most people demand a specific knowledge, such as the marriage between Brad Pitt and Angelina Jolie, soon after the information is announced. Moreover, the number of queries of such knowledge dramatically declines after a few days, meaning that the most people cannot obtain the precise knowledge from the execution of the offline knowledge enrichment. To remedy this, we propose the SCKE framework to extract new knowledge triples which can be executed in the online scenario. We model the'Query-Click Page' bipartite graph to extract the query correlation and to identify cohesive pairwise entities, finally statistically identifying the confident relation between entities. Our experimental studies show that new triples can also be identified in the very beginning after the event happens, enabling the capability to provide the up-to-date knowledge summary for most user queries.
AB - In this paper, we introduce the concept of a novel application, called Knowledge Learning from User Search, aiming at identifying timely new knowledge triples from user search log. In the literature, the need of knowledge enrichment has been recognized as the key to the success of knowledge-based search. However, previous work of automatic knowledge extraction, such as Google Knowledge Vault, attempt to identify the unannotated knowledge triples from the full web-scale content in the offline execution. In our study, we show that most people demand a specific knowledge, such as the marriage between Brad Pitt and Angelina Jolie, soon after the information is announced. Moreover, the number of queries of such knowledge dramatically declines after a few days, meaning that the most people cannot obtain the precise knowledge from the execution of the offline knowledge enrichment. To remedy this, we propose the SCKE framework to extract new knowledge triples which can be executed in the online scenario. We model the'Query-Click Page' bipartite graph to extract the query correlation and to identify cohesive pairwise entities, finally statistically identifying the confident relation between entities. Our experimental studies show that new triples can also be identified in the very beginning after the event happens, enabling the capability to provide the up-to-date knowledge summary for most user queries.
UR - http://www.scopus.com/inward/record.url?scp=85085917056&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85085917056&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85085917056
T3 - Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, ROCLING 2015
SP - 248
EP - 262
BT - Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, ROCLING 2015
A2 - Chen, Sin-Horng
A2 - Wang, Hsin-Min
A2 - Chien, Jen-Tzung
A2 - Kao, Hung-Yu
A2 - Chang, Wen-Whei
A2 - Wang, Yih-Ru
A2 - Wu, Shih-Hung
PB - The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
T2 - 27th Conference on Computational Linguistics and Speech Processing, ROCLING 2015
Y2 - 1 October 2015 through 2 October 2015
ER -