摘要
In this paper, the prosodic information, a very special and important feature in Mandarin speech, is used for Mandarin telephone speech utterance verification. A two-stage strategy, with recognition followed by verification, is adopted. For keyword recognition, 59 context-independent subsyllables, i.e., 22 INITIAL's and 37 FINAL's in Mandarin speech, and one background/silence model, are used as the basic recognition units. For utterance verification, 12 anti-subsyllable HMM's, 175 context-dependent prosodic HMM's, and five anti-prosodic HMM's, are constructed. A keyword verification function combining phonetic-phase and prosodic-phase verification is investigated. Using a test set of 2400 conversational speech utterances from 20 speakers (12 males and 8 females), at 8.5% false rejection, the proposed verification method resulted in 17.8% false alarm rate. Furthermore, this method was able to correctly reject 90.4% of nonkeywords. Comparison with a baseline system without prosodic-phase verification shows that the prosodic information can benefit the verification performance.
| 原文 | English |
|---|---|
| 頁(從 - 到) | 697-700 |
| 頁數 | 4 |
| 期刊 | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
| 卷 | 2 |
| 出版狀態 | Published - 1999 1月 1 |
| 事件 | Proceedings of the 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-99) - Phoenix, AZ, USA 持續時間: 1999 3月 15 → 1999 3月 19 |
All Science Journal Classification (ASJC) codes
- 軟體
- 訊號處理
- 電氣與電子工程
指紋
深入研究「Utterance verification using prosodic information for Mandarin telephone speech keyword spotting」主題。共同形成了獨特的指紋。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver