TY - GEN
T1 - Spoken document summarization using acoustic, prosodic and semantic information
AU - Huang, Chien Lin
AU - Hsieh, Chia Hsin
AU - Wu, Chung Hsien
PY - 2005
Y1 - 2005
N2 - This paper presents a spoken document summarization scheme using acoustic, prosodic and semantic information. First, speech recognition confidence is estimated to choose reliable words from the speech transcription. Prosodic information, including pitch and energy, is used for stressed word selection. Latent semantic indexing (LSI) is adopted to identify significant words. Finally, word trigram and semantic dependency is measured to include the syntactic and semantic information for speech summarization. The dynamic programming (DP) algorithm is used to find the best summarization result according to the summarization score estimated from the above five measures. Finally, the summarized result is presented by the concatenation of the summarized speech words. Experimental results indicate that the proposed approach effectively extracts important words and gives a promising speech summary.
AB - This paper presents a spoken document summarization scheme using acoustic, prosodic and semantic information. First, speech recognition confidence is estimated to choose reliable words from the speech transcription. Prosodic information, including pitch and energy, is used for stressed word selection. Latent semantic indexing (LSI) is adopted to identify significant words. Finally, word trigram and semantic dependency is measured to include the syntactic and semantic information for speech summarization. The dynamic programming (DP) algorithm is used to find the best summarization result according to the summarization score estimated from the above five measures. Finally, the summarized result is presented by the concatenation of the summarized speech words. Experimental results indicate that the proposed approach effectively extracts important words and gives a promising speech summary.
UR - http://www.scopus.com/inward/record.url?scp=33750536804&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33750536804&partnerID=8YFLogxK
U2 - 10.1109/ICME.2005.1521453
DO - 10.1109/ICME.2005.1521453
M3 - Conference contribution
AN - SCOPUS:33750536804
SN - 0780393325
SN - 9780780393325
T3 - IEEE International Conference on Multimedia and Expo, ICME 2005
SP - 434
EP - 437
BT - IEEE International Conference on Multimedia and Expo, ICME 2005
T2 - IEEE International Conference on Multimedia and Expo, ICME 2005
Y2 - 6 July 2005 through 8 July 2005
ER -