TY - GEN
T1 - Semantic based background music recommendation for home videos
AU - Lin, Yin Tzu
AU - Tsai, Tsung Hung
AU - Hu, Min Chun
AU - Cheng, Wen Huang
AU - Wu, Ja Ling
PY - 2014
Y1 - 2014
N2 - In this paper, we propose a new background music recommendation scheme for home videos and two new features describing the short-term motion/tempo distribution in visual/aural content. Unlike previous researches that merely matched the visual and aural contents through a perceptual way, we incorporate the textual semantics and content semantics while determining the matching degree of a video and a song. The key idea is that the recommended music should contain semantics that relate to the ones in the input video and that the rhythm of the music and the visual motion of the video should be harmonious enough. As a result, a few user-given tags and automatically annotated tags are used to compute their relation to the lyrics of the songs for selecting candidate musics. Then, we use the proposed motion-direction histogram (MDH) and pitch tempo pattern (PTP) to do the second-run selection. The user preference to the music genre is also taken into account as a filtering mechanism at the beginning. The primitive user evaluation shows that the proposed scheme is promising.
AB - In this paper, we propose a new background music recommendation scheme for home videos and two new features describing the short-term motion/tempo distribution in visual/aural content. Unlike previous researches that merely matched the visual and aural contents through a perceptual way, we incorporate the textual semantics and content semantics while determining the matching degree of a video and a song. The key idea is that the recommended music should contain semantics that relate to the ones in the input video and that the rhythm of the music and the visual motion of the video should be harmonious enough. As a result, a few user-given tags and automatically annotated tags are used to compute their relation to the lyrics of the songs for selecting candidate musics. Then, we use the proposed motion-direction histogram (MDH) and pitch tempo pattern (PTP) to do the second-run selection. The user preference to the music genre is also taken into account as a filtering mechanism at the beginning. The primitive user evaluation shows that the proposed scheme is promising.
UR - http://www.scopus.com/inward/record.url?scp=84893491676&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84893491676&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-04117-9_26
DO - 10.1007/978-3-319-04117-9_26
M3 - Conference contribution
AN - SCOPUS:84893491676
SN - 9783319041162
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 283
EP - 290
BT - MultiMedia Modeling - 20th Anniversary International Conference, MMM 2014, Proceedings
T2 - 20th Anniversary International Conference on MultiMedia Modeling, MMM 2014
Y2 - 6 January 2014 through 10 January 2014
ER -