ANTI-Disinformation: An Adversarial Attack and Defense Network Towards Improved Robustness for Disinformation Detection on Social Media

Kuan Chun Chen, Chih Yao Chen, Cheng Te Li

研究成果: Conference contribution

1 引文 斯高帕斯(Scopus)

摘要

The prevalence of disinformation, which includes malformation (e.g., cyberbullying) and misinformation (e.g., fake news) in online platforms has raised significant concerns, prompting the need for robust detection methods to mitigate its detrimental impact. While the field of text classification has witnessed notable advancements in recent years, existing approaches often overlook the evolving nature of disinformation, wherein perpetrators employ perturbations to toxic content to evade detection or censorship. To address this challenge, we present a novel framework, Adversarial Network Towards Improved robustness for Disinformation detection (ANTI-Disinformation), which leverages reinforcement learning techniques as adversarial attacks. Additionally, we propose a defense model to enhance model's robustness against such attacks. To evaluate the effectiveness of our approach, we conduct extensive experiments on well-known disinformation datasets collected from multiple social media platforms. The results demonstrate our approach can effectively produce degradation in existing models' performance the most, showcasing the effectiveness of our framework and the vulnerability of existing detection systems. The results also exhibit that the proposed defense methods can consistently outperform existing typical methods in constructing robust detection models.

原文English
主出版物標題Proceedings - 2023 IEEE International Conference on Big Data, BigData 2023
編輯Jingrui He, Themis Palpanas, Xiaohua Hu, Alfredo Cuzzocrea, Dejing Dou, Dominik Slezak, Wei Wang, Aleksandra Gruca, Jerry Chun-Wei Lin, Rakesh Agrawal
發行者Institute of Electrical and Electronics Engineers Inc.
頁面5476-5484
頁數9
ISBN(電子)9798350324457
DOIs
出版狀態Published - 2023
事件2023 IEEE International Conference on Big Data, BigData 2023 - Sorrento, Italy
持續時間: 2023 12月 152023 12月 18

出版系列

名字Proceedings - 2023 IEEE International Conference on Big Data, BigData 2023

Conference

Conference2023 IEEE International Conference on Big Data, BigData 2023
國家/地區Italy
城市Sorrento
期間23-12-1523-12-18

All Science Journal Classification (ASJC) codes

  • 人工智慧
  • 電腦網路與通信
  • 電腦科學應用
  • 資訊系統
  • 資訊系統與管理
  • 安全、風險、可靠性和品質

指紋

深入研究「ANTI-Disinformation: An Adversarial Attack and Defense Network Towards Improved Robustness for Disinformation Detection on Social Media」主題。共同形成了獨特的指紋。

引用此