News story clustering from both what and how aspects: Using bag of word model and affinity propagation

Wei Ta Chu, Chao Chin Huang, Wen Fang Cheng

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

The 24-hour news TV channels repeat the same news stories again and again. In this paper we cluster hundreds of news stories broadcasted in a day into dozens of clusters according to topics, and thus facilitate efficient browsing and summarization. The proposed system automatically removes commercial breaks and detects anchorpersons, and then determines boundaries of news stories. Semantic concepts, the bag of visual word model and the bag of trajectory model are used to describe what and how objects present in news stories. After measuring similarity between stories by the earth mover's distance, the affinity propagation algorithm is utilized to cluster stories of the same topic together. The experimental results show that with the proposed methods sophisticated news stories can be effectively clustered.

Original languageEnglish
Title of host publicationMM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops - AIEMPro 2011 Workshop, AIEMPro'11
Pages7-12
Number of pages6
DOIs
Publication statusPublished - 2011 Dec 1
Event2011 ACM Multimedia Conference, MM'11 and Co-Located Workshops - 2011 ACM International Workshop on Automated Media Analysis and Production for Novel TV Services, AIEMPro'11 - Scottsdale, AZ, United States
Duration: 2011 Nov 282011 Dec 1

Publication series

NameMM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops - AIEMPro 2011 Workshop, AIEMPro'11

Conference

Conference2011 ACM Multimedia Conference, MM'11 and Co-Located Workshops - 2011 ACM International Workshop on Automated Media Analysis and Production for Novel TV Services, AIEMPro'11
Country/TerritoryUnited States
CityScottsdale, AZ
Period11-11-2811-12-01

All Science Journal Classification (ASJC) codes

  • Computer Graphics and Computer-Aided Design
  • Human-Computer Interaction

Fingerprint

Dive into the research topics of 'News story clustering from both what and how aspects: Using bag of word model and affinity propagation'. Together they form a unique fingerprint.

Cite this