TY - JOUR
T1 - Ontology-based speech act identification in a bilingual dialog system using partial pattern trees
AU - Yeh, Jui Feng
AU - Wu, Chung Hsien
AU - Chen, Ming Jun
PY - 2008/3
Y1 - 2008/3
N2 - This article presents a bilingual ontology-based dialog system with multiple services. An ontology-alignment algorithm is proposed to integrate ontologies of different languages for cross-language applications. A domain-specific ontology is further extracted from the bilingual ontology using an island-driven algorithm and a domain corpus. This study extracts the semantic words/concepts using latent semantic analysis (LSA). Based on the extracted semantic words and the domain ontology, a partial pattern tree is constructed to model the speech act of a spoken utterance. The partial pattern tree is used to deal with the ill-formed sentence problem in a spokendialog system. Concept expansion based on domain ontology is also adopted to improve system performance. For performance evaluation, a medical dialog system with multiple services, including registration information, clinic information, and FAQ information, is implemented. Four performance measures were used separately for evaluation. The speech act identification rate was 86.2%. A task success rate of 77% was obtained. The contextual appropriateness of the system response was 78.5%. Finally, the rate for correct FAQ retrieval was 82%, an improvement of 15% over the keyword-based vectorspace model. The results show the proposed ontology-based speech-act identification is effective for dialog management.
AB - This article presents a bilingual ontology-based dialog system with multiple services. An ontology-alignment algorithm is proposed to integrate ontologies of different languages for cross-language applications. A domain-specific ontology is further extracted from the bilingual ontology using an island-driven algorithm and a domain corpus. This study extracts the semantic words/concepts using latent semantic analysis (LSA). Based on the extracted semantic words and the domain ontology, a partial pattern tree is constructed to model the speech act of a spoken utterance. The partial pattern tree is used to deal with the ill-formed sentence problem in a spokendialog system. Concept expansion based on domain ontology is also adopted to improve system performance. For performance evaluation, a medical dialog system with multiple services, including registration information, clinic information, and FAQ information, is implemented. Four performance measures were used separately for evaluation. The speech act identification rate was 86.2%. A task success rate of 77% was obtained. The contextual appropriateness of the system response was 78.5%. Finally, the rate for correct FAQ retrieval was 82%, an improvement of 15% over the keyword-based vectorspace model. The results show the proposed ontology-based speech-act identification is effective for dialog management.
UR - http://www.scopus.com/inward/record.url?scp=41449086451&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=41449086451&partnerID=8YFLogxK
U2 - 10.1002/asi.20700
DO - 10.1002/asi.20700
M3 - Article
AN - SCOPUS:41449086451
SN - 1532-2882
VL - 59
SP - 684
EP - 694
JO - Journal of the American Society for Information Science and Technology
JF - Journal of the American Society for Information Science and Technology
IS - 5
ER -