Using Grammatical and Semantic Correction Model to Improve Chinese-to-Taiwanese Machine Translation Fluency

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Currently, there are three major issues to tackle in Chinese-to-Taiwanese machine translation: multi-pronunciation Taiwanese words, unknown words, and Chinese-to-Taiwanese grammatical and semantic transformation. Recent studies have mostly focused on the issues of multi-pronunciation Taiwanese words and unknown words, while very few research papers focus on grammatical and semantic transformation. However, there exist grammatical rules exclusive to Taiwanese that, if not translated properly, would cause the result to feel unnatural to native speakers and potentially twist the original meaning of the sentence, even with the right words and pronunciations. Therefore, this study collects and organizes a few common Taiwanese sentence structures and grammar rules, then creates a grammar and semantic correction model for Chinese-to-Taiwanese machine translation, which would detect and correct grammatical and semantic discrepancies between the two languages, thus improving translation fluency.

Original languageEnglish
Title of host publicationROCLING 2022 - Proceedings of the 34th Conference on Computational Linguistics and Speech Processing
EditorsYung-Chun Chang, Yi-Chin Huang, Jheng-Long Wu, Ming-Hsiang Su, Hen-Hsen Huang, Yi-Fen Liu, Lung-Hao Lee, Chin-Hung Chou, Yuan-Fu Liao
PublisherThe Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
Pages75-83
Number of pages9
ISBN (Electronic)9789869576956
Publication statusPublished - 2022
Event34th Conference on Computational Linguistics and Speech Processing, ROCLING 2022 - Taipei, Taiwan
Duration: 2022 Nov 212022 Nov 22

Publication series

NameROCLING 2022 - Proceedings of the 34th Conference on Computational Linguistics and Speech Processing

Conference

Conference34th Conference on Computational Linguistics and Speech Processing, ROCLING 2022
Country/TerritoryTaiwan
CityTaipei
Period22-11-2122-11-22

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Speech and Hearing

Fingerprint

Dive into the research topics of 'Using Grammatical and Semantic Correction Model to Improve Chinese-to-Taiwanese Machine Translation Fluency'. Together they form a unique fingerprint.

Cite this