PROBLEM TO BE SOLVED: To provide a method for screening samples for building a prediction model and a computer program product thereof. SOLUTION: When a set of new sample data is added to a dynamic moving window, a clustering step is performed with respect to all of the sets of sample data within the window for grouping the sets of sample data with similar properties as one group. Then, the number of sets of sample data in each group is inspected. If the number of the sets of sample data in the largest group is greater than a predetermined threshold, it means that there are too many sets of sample data with similar properties in the largest group, and the oldest sample data in the largest group can be deleted. If the number of the sets of sample data in the largest group is smaller than or equal to a predetermined threshold, it means that the sample data in the largest group are quite unique, and should be kept for building or refreshing the prediction model.
|Original language||Chinese (Traditional)|
|Publication status||Published - 1800|