Building robust models for small data containing nominal inputs and continuous outputs based on possibility distributions

Der Chiang Li, Qi Shi Shi, Hung Yu Chen

研究成果: Article

摘要

Learning with small data is challenging for most algorithms in regard to building statistically robust models. In previous studies, virtual sample generation (VSG) approaches have been verified as effective in terms of meeting this challenge. However, most VSG methods were developed for numerical inputs. Therefore, to address situations where data has nominal inputs and continuous outputs, a systemic VSG procedure is proposed to generate samples based on fuzzy techniques to further enhance modelling capability. Based on the concept of the data preprocess in the M5′ model tree, we reveal a useful procedure by which to extract the fuzzy relations between nominal inputs and continuous outputs. Further, with the idea of nonparametric operations, we employ trend similarity to present the fuzzy relations between inputs and outputs. Then, these relations are represented by possibility distributions, and sample candidates are created based on these distributions. Finally, the candidates filtered using α-cut are regarded as qualified virtual samples. In the experiments, we demonstrate the effectiveness of our approach through a comparison with two other VSG approaches using five public datasets and two prediction models. Moreover, three parameters used in our approaches are discussed. However, determining how to find the most fit parameters requires further study in the future.

原文English
頁(從 - 到)2805-2822
頁數18
期刊International Journal of Machine Learning and Cybernetics
10
發行號10
DOIs
出版狀態Published - 2019 十月 1

All Science Journal Classification (ASJC) codes

  • Software
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

指紋 深入研究「Building robust models for small data containing nominal inputs and continuous outputs based on possibility distributions」主題。共同形成了獨特的指紋。

  • 引用此