TY - JOUR
T1 - Fuzzy Optimization Feature Fusion for Enhanced Fine-Grained Visual Classification in Sustainable Manufacturing using Vision Transformer
AU - Lai, Chin Feng
AU - Lai, Yi Wei
AU - Chen, Shih Yeh
AU - Lee, Chi Hsuan
AU - Chen, Mu Yen
N1 - Publisher Copyright:
© 1993-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Fine-grained visual classification (FGVC) in sustainable manufacturing faces challenges due to the diverse, complex, and highly similar objects in manufacturing environments. Traditional convolutional neural networks (CNNs) often require extensive annotations and high computational costs, limiting their effectiveness. This study introduces a Fuzzy Optimization Feature Fusion Model (FOFFM) based on Vision Transformer, designed to enhance FGVC accuracy and efficiency. FOFFM addresses challenges such as information loss during image-to-token mapping and high category similarity by optimizing classification token capabilities and leveraging Contrastive Loss. By enhancing resource efficiency and reducing redundant computations, FOFFM contributes to lower energy consumption and operational costs, directly supporting sustainable manufacturing practices. Experimental results on the NABirds dataset demonstrate FOFFM's competitive performance with a streamlined, resource-efficient end-to-end training process. Unlike other methods, such as TransFG, FOFFM reduces computational complexity while maintaining robust accuracy, making it highly suitable for practical applications in sustainable manufacturing, particularly in optimizing resource utilization and minimizing environmental impact. This work provides valuable insights for manufacturing data analysis and contributes to advancing FGVC in sustainable manufacturing contexts.
AB - Fine-grained visual classification (FGVC) in sustainable manufacturing faces challenges due to the diverse, complex, and highly similar objects in manufacturing environments. Traditional convolutional neural networks (CNNs) often require extensive annotations and high computational costs, limiting their effectiveness. This study introduces a Fuzzy Optimization Feature Fusion Model (FOFFM) based on Vision Transformer, designed to enhance FGVC accuracy and efficiency. FOFFM addresses challenges such as information loss during image-to-token mapping and high category similarity by optimizing classification token capabilities and leveraging Contrastive Loss. By enhancing resource efficiency and reducing redundant computations, FOFFM contributes to lower energy consumption and operational costs, directly supporting sustainable manufacturing practices. Experimental results on the NABirds dataset demonstrate FOFFM's competitive performance with a streamlined, resource-efficient end-to-end training process. Unlike other methods, such as TransFG, FOFFM reduces computational complexity while maintaining robust accuracy, making it highly suitable for practical applications in sustainable manufacturing, particularly in optimizing resource utilization and minimizing environmental impact. This work provides valuable insights for manufacturing data analysis and contributes to advancing FGVC in sustainable manufacturing contexts.
UR - https://www.scopus.com/pages/publications/105001519305
UR - https://www.scopus.com/pages/publications/105001519305#tab=citedBy
U2 - 10.1109/TFUZZ.2025.3555523
DO - 10.1109/TFUZZ.2025.3555523
M3 - Article
AN - SCOPUS:105001519305
SN - 1063-6706
JO - IEEE Transactions on Fuzzy Systems
JF - IEEE Transactions on Fuzzy Systems
ER -