TY - GEN
T1 - Spoken document summarization using topic-related corpus and semantic dependency grammar
AU - Hsieh, Chia Hsin
AU - Huang, Chien Lin
AU - Wu, Chung Hsien
PY - 2004
Y1 - 2004
N2 - This study presents a spoken document summarization scheme using a topic-related corpus and semantic dependency grammars. The summarization score considers speech recognition confidence, word significance, word trigram, semantic dependency grammar (SDG) and probabilistic context free grammar (PCFG). In addition, a topic-related corpus consisting of keywords as well as article is used to estimate the word significance score using latent semantic indexing (LSI). Semantic relations between words are determined by SDG using HowNet and Sinica Treebank. The dynamic programming algorithm is applied to decide the summarization ratio and look for the best summarization result according to summarization scores. Experimental results indicate that the proposed approach effectively extracts important words with semantic dependency and gives a promising speech summary.
AB - This study presents a spoken document summarization scheme using a topic-related corpus and semantic dependency grammars. The summarization score considers speech recognition confidence, word significance, word trigram, semantic dependency grammar (SDG) and probabilistic context free grammar (PCFG). In addition, a topic-related corpus consisting of keywords as well as article is used to estimate the word significance score using latent semantic indexing (LSI). Semantic relations between words are determined by SDG using HowNet and Sinica Treebank. The dynamic programming algorithm is applied to decide the summarization ratio and look for the best summarization result according to summarization scores. Experimental results indicate that the proposed approach effectively extracts important words with semantic dependency and gives a promising speech summary.
UR - http://www.scopus.com/inward/record.url?scp=21444456599&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=21444456599&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:21444456599
SN - 0780386787
T3 - 2004 International Symposium on Chinese Spoken Language Processing - Proceedings
SP - 333
EP - 336
BT - 2004 International Symposium on Chinese Spoken Language Processing - Proceedings
T2 - 2004 International Symposium on Chinese Spoken Language Processing
Y2 - 15 December 2004 through 18 December 2004
ER -