Countermeasure of Polluting Health-Related Dataset for Data Mining

I-Hsien Liu, Jung Shian Li, Yen Chu Peng, Meng Huan Lee, Chuan Gang Liu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Nowadays, machine learning is widely used in a variety of applications, but it still faces many security challenges. Among them, the security of dataset is particularly important, because the data set is the key factor in achieving high correctness for machine learning. Recently, it becomes more difficult for an attacker to directly modify or attack the machine learning models because these models are setup usually in a well-known and well-designed format. However, the attackers can easily manipulate the dataset in various ways. Therefore, we develop countermeasures of polluting o a health-related dataset for data mining, which is robust Data Washing, an algorithm based on denoising autoencoder. It effectively alleviates damages to datasets caused by poisoning attack. We implement several DNN models for different datasets. The proposed Our robust Data Washing algorithm efficiently recovers the poisoning dataset and detect several attacks with a high accuracy rate.

Original languageEnglish
Title of host publicationProceedings of the 2022 IEEE 4th Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability, ECBIOS 2022
EditorsTeen-Hang Meen
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages152-155
Number of pages4
ISBN (Electronic)9781728195797
DOIs
Publication statusPublished - 2022
Event4th IEEE Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability, ECBIOS 2022 - Tainan, Taiwan
Duration: 2022 May 272022 May 29

Publication series

NameProceedings of the 2022 IEEE 4th Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability, ECBIOS 2022

Conference

Conference4th IEEE Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability, ECBIOS 2022
Country/TerritoryTaiwan
CityTainan
Period22-05-2722-05-29

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Information Systems and Management
  • Renewable Energy, Sustainability and the Environment
  • Biomedical Engineering
  • Health Informatics
  • Health(social science)

Fingerprint

Dive into the research topics of 'Countermeasure of Polluting Health-Related Dataset for Data Mining'. Together they form a unique fingerprint.

Cite this