Top stories identification from blog to news in TREC 2010 blog track

Yu Fan Lin, Jing Hau Wang, Liang Cheng Lai, Hung Yu Kao

研究成果: Conference article同行評審


In 2010 Blog Track, there are two tasks including Faceted Blog Distillation Task and Top Stories Identification Task. We mainly focus on the Top Stories Identification Task. In this task, there are two issues to solve. The first issue is ranking the important news stories on the specified day, named Story Ranking Task. The second issue is named News Blog Post Ranking Task. News Blog Post Ranking Task is ranking the blog posts that are relevant to the news story and diversifying the topics of blog posts. In Story Ranking Task, our team Ikm100 (NCKU CSIE IKMLAB) submitted three runs. In the first run, a news story is scored by its number of discussion posts. In the second run, our idea is that if the news story is discussed by more people and the supporting blog post is relatively important, the news story would be more important. In the last run, we use the "Relevant-Post Time-Entropy evaluation" to score the news story. In News Blog Post Ranking Task, we use the cosine similarity between the news story and the blog post, and also use importance of posts to extract the supporting blog posts of the news query.

期刊NIST Special Publication
出版狀態Published - 2010 十二月 1
事件19th Text REtrieval Conference, TREC 2010 - Gaithersburg, MD, United States
持續時間: 2010 十一月 162010 十一月 19

All Science Journal Classification (ASJC) codes

  • 工程 (全部)


深入研究「Top stories identification from blog to news in TREC 2010 blog track」主題。共同形成了獨特的指紋。