TY - GEN
T1 - Augmented-Training-Aware Bisenet for Real-Time Semantic Segmentation
AU - Hsu, Chih Chung
AU - Lee, Cheih
AU - Tai, Shen Chieh
AU - Jiang, Yun Zhong
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Semantic segmentation techniques have become an attractive research field for autonomous driving. However, it is well-known that the computational complexity of the conventional semantic segmentation is relatively high compared to other computer vision applications. Fast inference of the semantic segmentation for autonomous driving is highly desired. A lightweight convolutional neural network, the Bilateral segmentation network (BiSeNet), is adopted in this paper. However, the performance of the conventional BiSeNet is not so reliable that the model quantization could lead to an even worse result. Therefore, we proposed an augmented training strategy to significantly improve the semantic segmentation task's performance. First, heavy data augmentation, including CutOut, deformable distortion, and step-wise hard example mining, is used in the training phase to boost the performance of the feature representation learning. Second, the L1 and L2 norm regularization are also used in the model training to prevent the possible overfitting issue. Then, the post-quantization is performed on the TensorFlow-Lite model to significantly reduce the model size and computational complexity. The comprehensive experiments verified that the proposed method is effective and efficient for autonomous driving applications over other state-of-the-art methods.
AB - Semantic segmentation techniques have become an attractive research field for autonomous driving. However, it is well-known that the computational complexity of the conventional semantic segmentation is relatively high compared to other computer vision applications. Fast inference of the semantic segmentation for autonomous driving is highly desired. A lightweight convolutional neural network, the Bilateral segmentation network (BiSeNet), is adopted in this paper. However, the performance of the conventional BiSeNet is not so reliable that the model quantization could lead to an even worse result. Therefore, we proposed an augmented training strategy to significantly improve the semantic segmentation task's performance. First, heavy data augmentation, including CutOut, deformable distortion, and step-wise hard example mining, is used in the training phase to boost the performance of the feature representation learning. Second, the L1 and L2 norm regularization are also used in the model training to prevent the possible overfitting issue. Then, the post-quantization is performed on the TensorFlow-Lite model to significantly reduce the model size and computational complexity. The comprehensive experiments verified that the proposed method is effective and efficient for autonomous driving applications over other state-of-the-art methods.
UR - http://www.scopus.com/inward/record.url?scp=85138082122&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85138082122&partnerID=8YFLogxK
U2 - 10.1109/ICMEW56448.2022.9859497
DO - 10.1109/ICMEW56448.2022.9859497
M3 - Conference contribution
AN - SCOPUS:85138082122
T3 - ICMEW 2022 - IEEE International Conference on Multimedia and Expo Workshops 2022, Proceedings
BT - ICMEW 2022 - IEEE International Conference on Multimedia and Expo Workshops 2022, Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2022
Y2 - 18 July 2022 through 22 July 2022
ER -