跳至主導覽 跳至搜尋 跳過主要內容

A Mamba-Inspired Linear Vision Transformer with Spatial and Channel Reconstruction MLP for Roasted Coffee Bean Recognition

研究成果: Conference contribution

摘要

With rising consumer demands for coffee flavor and quality, quality control in coffee production processes has become increasingly important. Therefore, reducing labor costs during production while maintaining the consistency of coffee bean quality presents a significant challenge. To address this issue, this study proposes a Vision Transformer (ViT) architecture that incorporates a Spatial and Channel reconstruction Attention (SCA) block to assess the roasting levels of coffee beans. According to the experiment results, the proposed method achieves an F1-score of 99.79% on the roasted coffee bean database. Moreover, the method contains only 0.46 million parameters, making it suitable for deployment on lowcost embedded platforms.

原文English
主出版物標題IS3C 2025 - International Symposium on Computer, Consumer and Control
發行者Institute of Electrical and Electronics Engineers Inc.
ISBN(電子)9798331587000
DOIs
出版狀態Published - 2025
事件7th International Symposium on Computer, Consumer and Control, IS3C 2025 - Taichung, Taiwan
持續時間: 2025 6月 272025 6月 30

出版系列

名字IS3C 2025 - International Symposium on Computer, Consumer and Control

Conference

Conference7th International Symposium on Computer, Consumer and Control, IS3C 2025
國家/地區Taiwan
城市Taichung
期間25-06-2725-06-30

All Science Journal Classification (ASJC) codes

  • 人工智慧
  • 電腦科學應用
  • 電腦視覺和模式識別
  • 人機介面
  • 控制和優化

引用此