Our study’s main purpose is to emphasise the significance of medical knowledge of pathophysiology before machine learning. We investigated whether combining domain knowledge with machine learning results might increase accuracy and minimise the number of bio-features used to detect obstructive sleep apnea (OSA). The present study analysed data on 36 self-reported symptoms and 24 clinical features obtained from 3,495 patients receiving polysomnography at a regional hospital and a medical centre. The area under the receiver operating characteristic (AUC) curve was used to evaluate patients with and without moderate or severe OSA using three prediction models on the basis of various estimation methods: the multiple logistic regression (MLR), support vector machine (SVM), and neural network (NN) methods. Odds ratios stratified by gender and age were also measured to account for clinicians’ common sense. We discovered that adding the self-reported snoring item improved the AUC by 0.01–0.10 and helped us to rapidly achieve the optimum level. The performance of four items (gender, age, body mass index [BMI], and snoring) was comparable with that of adding two or more items (neck and waist circumference) for predicting moderate to severe OSA (Apnea–Hypopnea Index ≥15 events/hr) in all three prediction models, demonstrating the medical knowledge value of pathophysiology. The four-item test sample AUCs were 0.83, 0.84, and 0.83 for MLR, SVM, and NN, respectively. Participants with regular snoring and a BMI of ≥25 kg/m2 had a greater chance of moderate to severe OSA according to the stratified adjusted odds ratios. Combining domain knowledge into machine learning could increase efficiency and enable primary care physicians to refer for an OSA diagnosis earlier.
All Science Journal Classification (ASJC) codes
- Cognitive Neuroscience
- Behavioral Neuroscience