Using cross-media correlation for scene detection in travel videos

Wei Ta Chu, Che Cheng Lin, Jen Yu Yu

研究成果: Conference contribution

5 引文 斯高帕斯(Scopus)

摘要

Focusing on travel videos taken in uncontrolled environments and by amateur photographers, we exploit correlation between different modalities to facilitate effective travel video scene detection. Scenes in travel photos, i.e., content taken at the same scenic spot, can be easily determined by examining time information. For a travel video, we extract several keyframes for each video shot. Then, photos and keyframes are represented as a sequence of visual word histograms, respectively. Based on this representation, we transform scene detection into a sequence matching problem. After finding the best alignment between two sequences, we can determine scene boundaries in videos with the help of that in photos. We demonstrate that we averagely achieve a purity value of 0.95 if the proposed method is combined with conventional ones. We show that not only features of visual words aid in scene detection, but also cross-media correlation does.

原文English
主出版物標題CIVR 2009 - Proceedings of the ACM International Conference on Image and Video Retrieval
頁面132-138
頁數7
DOIs
出版狀態Published - 2009
事件ACM International Conference on Image and Video Retrieval, CIVR 2009 - Santorini Island, Greece
持續時間: 2009 七月 82009 七月 10

出版系列

名字CIVR 2009 - Proceedings of the ACM International Conference on Image and Video Retrieval

Conference

ConferenceACM International Conference on Image and Video Retrieval, CIVR 2009
國家Greece
城市Santorini Island
期間09-07-0809-07-10

All Science Journal Classification (ASJC) codes

  • Computer Graphics and Computer-Aided Design
  • Computer Vision and Pattern Recognition

引用此