Improved U-Net Based on Dual Attention Mechanism for Glottis Segmentation and Dysphagia Auxiliary Diagnosis

Shih Hsiung Lee, Jui Chung Ni, Yen Cheng Shen, Hsuan Chih Ku, Chu Sing Yang, Ko Wei Huang, Chun Hao Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In today’s aging society, the proportion of elderly population is increasing year by year, and providing comprehensive care for the elderly has become an important issue. Among many aging diseases, dysphagia is a health threat that we often overlook, which if not detected and treated in time, can lead to aspiration pneumonia. Currently, the main detection method is usually through imaging of the throat and judgment by doctors. However, inexperienced doctors may make misjudgments. In order to avoid such situations, this study hopes to assist doctors in their diagnosis through effective image semantic segmentation technology. In the field of medical image semantic segmentation, the U-Net architecture has been proven to be a successful image segmentation architecture. The encoder-decoder technology in U-Net can effectively extract features and restore the original image. However, U-Net may lose important features during the downsampling process of feature extraction. Therefore, this study added a dual attention mechanism in the encoder, which effectively captures important features through position attention and channel attention in the image. In addition to the dual attention mechanism, this study added ResNet blocks in each encoder and decoder block to preserve feature information between downsampling and upsampling. Finally, this paper proves the effectiveness of these mechanisms through experiments and obtains good results.

Original languageEnglish
Title of host publicationRecent Challenges in Intelligent Information and Database Systems - 15th Asian Conference, ACIIDS 2023, Proceedings
EditorsNgoc Thanh Nguyen, Siridech Boonsang, Kitsuchart Pasupa, Hamido Fujita, Bogumiła Hnatkowska, Tzung-Pei Hong, Ali Selamat
PublisherSpringer Science and Business Media Deutschland GmbH
Pages234-243
Number of pages10
ISBN (Print)9783031424298
DOIs
Publication statusPublished - 2023
Event15th International scientific conferences on research and applications in the field of intelligent information and database systems, ACIIDS 2023 - Phuket, Thailand
Duration: 2023 Jul 242023 Jul 26

Publication series

NameCommunications in Computer and Information Science
Volume1863 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference15th International scientific conferences on research and applications in the field of intelligent information and database systems, ACIIDS 2023
Country/TerritoryThailand
CityPhuket
Period23-07-2423-07-26

All Science Journal Classification (ASJC) codes

  • General Computer Science
  • General Mathematics

Fingerprint

Dive into the research topics of 'Improved U-Net Based on Dual Attention Mechanism for Glottis Segmentation and Dysphagia Auxiliary Diagnosis'. Together they form a unique fingerprint.

Cite this