TY - GEN
T1 - Using cross-media correlation for scene detection in travel videos
AU - Chu, Wei Ta
AU - Lin, Che Cheng
AU - Yu, Jen Yu
N1 - Copyright:
Copyright 2010 Elsevier B.V., All rights reserved.
PY - 2009
Y1 - 2009
N2 - Focusing on travel videos taken in uncontrolled environments and by amateur photographers, we exploit correlation between different modalities to facilitate effective travel video scene detection. Scenes in travel photos, i.e., content taken at the same scenic spot, can be easily determined by examining time information. For a travel video, we extract several keyframes for each video shot. Then, photos and keyframes are represented as a sequence of visual word histograms, respectively. Based on this representation, we transform scene detection into a sequence matching problem. After finding the best alignment between two sequences, we can determine scene boundaries in videos with the help of that in photos. We demonstrate that we averagely achieve a purity value of 0.95 if the proposed method is combined with conventional ones. We show that not only features of visual words aid in scene detection, but also cross-media correlation does.
AB - Focusing on travel videos taken in uncontrolled environments and by amateur photographers, we exploit correlation between different modalities to facilitate effective travel video scene detection. Scenes in travel photos, i.e., content taken at the same scenic spot, can be easily determined by examining time information. For a travel video, we extract several keyframes for each video shot. Then, photos and keyframes are represented as a sequence of visual word histograms, respectively. Based on this representation, we transform scene detection into a sequence matching problem. After finding the best alignment between two sequences, we can determine scene boundaries in videos with the help of that in photos. We demonstrate that we averagely achieve a purity value of 0.95 if the proposed method is combined with conventional ones. We show that not only features of visual words aid in scene detection, but also cross-media correlation does.
UR - http://www.scopus.com/inward/record.url?scp=74049139450&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=74049139450&partnerID=8YFLogxK
U2 - 10.1145/1646396.1646420
DO - 10.1145/1646396.1646420
M3 - Conference contribution
AN - SCOPUS:74049139450
SN - 9781605584805
T3 - CIVR 2009 - Proceedings of the ACM International Conference on Image and Video Retrieval
SP - 132
EP - 138
BT - CIVR 2009 - Proceedings of the ACM International Conference on Image and Video Retrieval
T2 - ACM International Conference on Image and Video Retrieval, CIVR 2009
Y2 - 8 July 2009 through 10 July 2009
ER -