TY - GEN
T1 - Spatial-Slice Feature Learning Using Visual Transformer and Essential Slices Selection Module for COVID-19 Detection of CT Scans in the Wild
AU - Hsu, Chih Chung
AU - Tsai, Chi Han
AU - Chen, Guan Lin
AU - Ma, Sin Di
AU - Tai, Shen Chieh
N1 - Funding Information:
Acknowledgement. This study was supported in part by the National Science and Technology Council, Taiwan, under Grants 110-2222-E-006 -012, 111-2221-E-006 -210, 111-2221-E-001-002, 111-2634-F-007-002. We thank to National Center for High-performance Computing (NCHC) for providing computational and storage resources.
Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - Computed tomography (CT) imaging could be convenient for diagnosing various diseases. However, the CT images could be diverse since their resolution and number of slices are determined by the machine and its settings. Conventional deep learning models are hard to tickle such diverse data since the essential requirement of the deep neural network is the consistent shape of the input data in each dimension. A way to overcome this issue is based on the slice-level classifier and aggregating the predictions for each slice to make the final result. However, it lacks slice-wise feature learning, leading to suppressed performance. This paper proposes an effective spatial-slice feature learning (SSFL) to tickle this issue for COVID-19 symptom classification. First, the semantic feature embedding of each slice for a CT scan is extracted by a conventional 2D convolutional neural network (CNN) and followed by using the visual Transformer-based sub-network to deal with feature learning between slices, leading to joint feature representation. Then, an essential slices set algorithm is proposed to automatically select a subset of the CT scan, which could effectively remove the uncertain slices as well as improve the performance of our SSFL. Comprehensive experiments reveal that the proposed SSFL method shows not only excellent performance but also achieves stable detection results.
AB - Computed tomography (CT) imaging could be convenient for diagnosing various diseases. However, the CT images could be diverse since their resolution and number of slices are determined by the machine and its settings. Conventional deep learning models are hard to tickle such diverse data since the essential requirement of the deep neural network is the consistent shape of the input data in each dimension. A way to overcome this issue is based on the slice-level classifier and aggregating the predictions for each slice to make the final result. However, it lacks slice-wise feature learning, leading to suppressed performance. This paper proposes an effective spatial-slice feature learning (SSFL) to tickle this issue for COVID-19 symptom classification. First, the semantic feature embedding of each slice for a CT scan is extracted by a conventional 2D convolutional neural network (CNN) and followed by using the visual Transformer-based sub-network to deal with feature learning between slices, leading to joint feature representation. Then, an essential slices set algorithm is proposed to automatically select a subset of the CT scan, which could effectively remove the uncertain slices as well as improve the performance of our SSFL. Comprehensive experiments reveal that the proposed SSFL method shows not only excellent performance but also achieves stable detection results.
UR - http://www.scopus.com/inward/record.url?scp=85150688864&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85150688864&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-25082-8_42
DO - 10.1007/978-3-031-25082-8_42
M3 - Conference contribution
AN - SCOPUS:85150688864
SN - 9783031250811
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 621
EP - 634
BT - Computer Vision – ECCV 2022 Workshops, Proceedings
A2 - Karlinsky, Leonid
A2 - Michaeli, Tomer
A2 - Nishino, Ko
PB - Springer Science and Business Media Deutschland GmbH
T2 - 17th European Conference on Computer Vision, ECCV 2022
Y2 - 23 October 2022 through 27 October 2022
ER -