Frequency disentangled residual network

Satya Rajendra Singh, Roshan Reddy Yedla, Shiv Ram Dubey, Rakesh Kumar Sanodiya, Wei Ta Chu

研究成果: Article同行評審

摘要

Residual networks (ResNets) have been utilized for various computer vision and image processing applications. The residual connection improves the training of the network with better gradient flow. A residual block consists of a few convolutional layers having trainable parameters, which leads to overfitting. Moreover, the present residual networks are not able to utilize the high- and low-frequency information suitably, which also challenges the generalization capability of the network. In this paper, a frequency-disentangled residual network (FDResNet) is proposed to tackle these issues. Specifically, FDResNet includes separate connections in the residual block for low- and high-frequency components, respectively. Basically, the proposed model disentangles the low- and high-frequency components to increase the generalization ability. Moreover, the computation of low- and high-frequency components using fixed filters further avoids the overfitting. The proposed model is tested on benchmark CIFAR-10/100, Caltech, and TinyImageNet datasets for image classification. The performance of the proposed model is also tested in the image retrieval framework. It is noticed that the proposed model outperforms its counterpart residual model. The effect of kernel size and standard deviation is also evaluated. The impact of the frequency disentangling is also analyzed using a saliency map.

原文English
文章編號9
期刊Multimedia Systems
30
發行號1
DOIs
出版狀態Published - 2024 2月

All Science Journal Classification (ASJC) codes

  • 軟體
  • 資訊系統
  • 媒體技術
  • 硬體和架構
  • 電腦網路與通信

指紋

深入研究「Frequency disentangled residual network」主題。共同形成了獨特的指紋。

引用此