Generative and discriminative modeling toward semantic context detection in audio tracks

研究成果: Conference contribution

12 引文 斯高帕斯(Scopus)

摘要

Semantic-level content analysis is a crucial issue to achieve efficient content retrieval and management. We propose a hierarchical approach that models the statistical characteristics of several audio events over a time series to accomplish semantic context detection. Two stages, including audio event and semantic context modeling/testing, are devised to bridge the semantic gap between physical audio features and semantic concepts. For action movies we focused in this work, hidden Markov models (HMMs) are used to model four representative audio events, i.e. gunshot, explosion, car-braking, and engine sounds. At the semantic context level, generative (ergodic hidden Markov model) and discriminative (support vector machine, SVM) approaches are investigated to fuse the characteristics and correlations among various audio events, which provide cues for detecting gunplay and car-chasing scenes. The experimental results demonstrate the effectiveness of the proposed approaches and draw a sketch for semantic indexing and retrieval. Moreover, the differences between two fusion schemes are discussed to be the reference for future research.

原文English
主出版物標題Proceedings of the 11th International Multimedia Modelling Conference, MMM 2005
頁面38-45
頁數8
DOIs
出版狀態Published - 2005
事件11th International Multimedia Modelling Conference, MMM 2005 - Melbourne, VIC, Australia
持續時間: 2005 1月 122005 1月 14

出版系列

名字Proceedings of the 11th International Multimedia Modelling Conference, MMM 2005

Conference

Conference11th International Multimedia Modelling Conference, MMM 2005
國家/地區Australia
城市Melbourne, VIC
期間05-01-1205-01-14

All Science Journal Classification (ASJC) codes

  • 電腦繪圖與電腦輔助設計
  • 建模與模擬

指紋

深入研究「Generative and discriminative modeling toward semantic context detection in audio tracks」主題。共同形成了獨特的指紋。

引用此