Audio-video summarization of TV news using speech recognition and shot change detection

Chien Lin Huang, Chia Hsin Hsieh, Chung-Hsien Wu

研究成果: Conference contribution

2 引文 斯高帕斯(Scopus)

摘要

This paper presents an approach to audio-video summarization of TV news to provide concise information about the content while preserves the essential message of the original. In this study, anchor speech and field report videos are considered separately. First, speech signal is automatically recognized as transcripts and a confidence measure considering syntactic and semantic relations is used to estimate the reliability of words. For video skimming, RGB color histogram difference is adopted to segment video shots and evaluate the smoothness of images concatenation. As a result, the extracted anchor speech and the field report image sequence of TV news are aggregated into a summarization output. The experimental results indicate that the proposed approach effectively extracts important speech segments and gives a concise video sequence.

原文English
主出版物標題9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
頁面73-76
頁數4
出版狀態Published - 2005
事件9th European Conference on Speech Communication and Technology - Lisbon, Portugal
持續時間: 2005 九月 42005 九月 8

Other

Other9th European Conference on Speech Communication and Technology
國家Portugal
城市Lisbon
期間05-09-0405-09-08

All Science Journal Classification (ASJC) codes

  • Engineering(all)

指紋 深入研究「Audio-video summarization of TV news using speech recognition and shot change detection」主題。共同形成了獨特的指紋。

引用此