TY - JOUR
T1 - Using Machine learning to predict blood glucose level based on Photoplethysmography
AU - Jian, Shi En
AU - Lo, Yu Lung
AU - Chuang, Yun Tzu
AU - Kuo, Shu Han
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2025/9/1
Y1 - 2025/9/1
N2 - Photoplethysmography (PPG) is a non-invasive technique used to monitor the tiny changes in blood vessels caused by the heartbeat. This study obtained PPG signals from a commercial PPG module, followed by Signal Quality Index (SQI) detection to assess the signal quality and further signal processing, such as filtering and baseline shift. As a result, a total of 36 features were extracted from PPG signals in both time and frequency domains, and a correlation matrix was used to inspect the correlation among features. Machine learning models, including Random Forest (RF) and eXtreme Gradient Boosting (XGBoost), were employed to train and predict blood glucose levels. Of the 36 features, 9 were selected for use as input data for red and infrared light signals. Four datasets, namely “Red”, “Infrared”, “Composite” (a combination of “Red” and “Infrared”), and “Modified Composite” (a refined version of the “Composite” dataset that reduces feature collinearity and enhances prediction accuracy), were investigated. The RF model trained with the “Modified Composite” dataset yielded the best prediction, with a Mean Absolute Relative Difference (MARD) of 5.15% and an R-value of 0.93.
AB - Photoplethysmography (PPG) is a non-invasive technique used to monitor the tiny changes in blood vessels caused by the heartbeat. This study obtained PPG signals from a commercial PPG module, followed by Signal Quality Index (SQI) detection to assess the signal quality and further signal processing, such as filtering and baseline shift. As a result, a total of 36 features were extracted from PPG signals in both time and frequency domains, and a correlation matrix was used to inspect the correlation among features. Machine learning models, including Random Forest (RF) and eXtreme Gradient Boosting (XGBoost), were employed to train and predict blood glucose levels. Of the 36 features, 9 were selected for use as input data for red and infrared light signals. Four datasets, namely “Red”, “Infrared”, “Composite” (a combination of “Red” and “Infrared”), and “Modified Composite” (a refined version of the “Composite” dataset that reduces feature collinearity and enhances prediction accuracy), were investigated. The RF model trained with the “Modified Composite” dataset yielded the best prediction, with a Mean Absolute Relative Difference (MARD) of 5.15% and an R-value of 0.93.
UR - https://www.scopus.com/pages/publications/105001866611
UR - https://www.scopus.com/pages/publications/105001866611#tab=citedBy
U2 - 10.1016/j.measurement.2025.117421
DO - 10.1016/j.measurement.2025.117421
M3 - Article
AN - SCOPUS:105001866611
SN - 0263-2241
VL - 253
JO - Measurement: Journal of the International Measurement Confederation
JF - Measurement: Journal of the International Measurement Confederation
M1 - 117421
ER -