Rheumatoid arthritis (RA) is a chronic autoimmune rheumatic disease that can cause painful swelling in the joint lining, morning stiffness, and joint deformation/destruction. These symptoms decrease both quality of life and life expectancy. However, if RA can be diagnosed in the early stages, it can be controlled with pharmacotherapy. Although many studies have examined the possibility of early assessment and diagnosis, few have considered the relationship between significant risk factors and the early assessment of RA. In this paper, we present a novel framework for early RA assessment that utilizes data preprocessing, risk pattern mining, validation, and analysis. Under our proposed framework, two risk patterns can be discovered. Type I refers to well-known risk patterns that have been identified by existing studies, whereas Type II denotes unknown relationship risk patterns that have rarely or never been reported in the literature. These Type II patterns are very valuable in supporting novel hypotheses in clinical trials of RA, and constitute the main contribution of this work. To ensure the robustness of our experimental evaluation, we use a nationwide clinical database containing information on 1,314 RA-diagnosed patients over a 12-year fol-low-up period (1997-2008) and 965,279 non-RA patients. Our proposed framework is employed on this large-scale population-based dataset, and is shown to effectively discover rich RA risk patterns. These patterns may assist physicians in patient assessment, and enhance opportunities for early detection of RA. The proposed framework is broadly applicable to the mining of risk patterns for major disease assessments. This enables the identification of early risk patterns that are significantly associated with a target disease.
All Science Journal Classification (ASJC) codes