Swift Concurrent Semantic Segmentation and Object Detection on Edge Devices

Chih Chung Hsu, Yun Zhong Jiang, Wei Hao Huang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We propose a real-time network optimized for joint semantic segmentation and object detection on edge devices. Our architecture builds on the latest YOLO series network and incorporates lightweight segmentation sub-networks for multi-task learning. Specifically, we leverage layers two to four of the YOLO network, which contain substantial semantic information at varying resolutions, to segment objects of diverse sizes. We introduce the Parallel Aggregation Pyramid Pooling Module (PAPPM) to efficiently generate buffered semantic segmentation feature maps by utilizing single-point addition and residual learning. This approach reduces computational complexity and memory usage without compromising accuracy. We also propose a novel Progressively Iterative Learning (PIL) approach to learn the weights for the backbone, neck, and multi-task heads, respectively, without catastrophic forgetting. Our approach achieves state-of-the-art performance on benchmark datasets, demonstrating the effectiveness of our proposed techniques.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages40-45
Number of pages6
ISBN (Electronic)9798350313154
DOIs
Publication statusPublished - 2023
Event2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023 - Brisbane, Australia
Duration: 2023 Jul 102023 Jul 14

Publication series

NameProceedings - 2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023

Conference

Conference2023 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2023
Country/TerritoryAustralia
CityBrisbane
Period23-07-1023-07-14

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Media Technology
  • Control and Optimization

Fingerprint

Dive into the research topics of 'Swift Concurrent Semantic Segmentation and Object Detection on Edge Devices'. Together they form a unique fingerprint.

Cite this