Spoken document retrieval using multilevel knowledge and semantic verification

Chien Lin Huang, Chung Hsien Wu

Research output: Contribution to journalArticlepeer-review

21 Citations (Scopus)


This study presents a novel approach to spoken document retrieval based on multilevel knowledge indexing and semantic verification. Multilevel knowledge indexing considers three information sources, namely transcription data, keywords extracted from spoken documents, and hypernyms of the extracted keywords. A semantic network with forward-backward propagation is presented for semantic verification of the retrieved documents. In the forward step for semantic verification, a bag of keywords is chosen based on word significance measures. Semantic relations are estimated and adopted for verification in the backward procedure. The verification score is then utilized to weight and rerank the retrieved documents to obtain the final results. Experiments are performed on 40 h of anchor speech extracted from 198 h of collected broadcast news. Experimental results indicate that multilevel knowledge indexing and semantic verification achieve better retrieval results than other indexing schemes.

Original languageEnglish
Article number4317560
Pages (from-to)2551-2560
Number of pages10
JournalIEEE Transactions on Audio, Speech and Language Processing
Issue number8
Publication statusPublished - 2007 Sep

All Science Journal Classification (ASJC) codes

  • Acoustics and Ultrasonics
  • Electrical and Electronic Engineering


Dive into the research topics of 'Spoken document retrieval using multilevel knowledge and semantic verification'. Together they form a unique fingerprint.

Cite this