Pseudo Triplet Networks for Classification Tasks with Cross-Source Feature Incompleteness

Cayon Liow, Cheng Te Li, Chun Pai Yang, Shou De Lin

研究成果: Conference contribution

摘要

Cross-source feature incompleteness - a scenario where certain features are only available in one data source but missing in another - is a common and significant challenge in machine learning. It typically arises in situations where the training data and testing data are collected from different sources with distinct feature sets. Addressing this challenge has the potential to greatly improve the utility of valuable datasets that might otherwise be considered incomplete and enhance model performance. This paper introduces the novel Pseudo Triplet Network (PTN) to address cross-source feature incompleteness. PTN fuses two Siamese network architectures - Triplet Networks and Pseudo Networks. By segregating data into instance, positive, and negative subsets, PTN facilitates effectively contrastive learning through a hybrid loss function design. The model was rigorously evaluated on six benchmark datasets from the UCI Repository, in comparison with five other methods for managing missing data, under a range of feature overlap and missing data scenarios. The PTN consistently exhibited superior performance, displaying resilience in high missing ratio situations and maintaining robust stability across various data scenarios.

原文English
主出版物標題CIKM 2023 - Proceedings of the 32nd ACM International Conference on Information and Knowledge Management
發行者Association for Computing Machinery
頁面4079-4083
頁數5
ISBN(電子)9798400701245
DOIs
出版狀態Published - 2023 10月 21
事件32nd ACM International Conference on Information and Knowledge Management, CIKM 2023 - Birmingham, United Kingdom
持續時間: 2023 10月 212023 10月 25

出版系列

名字International Conference on Information and Knowledge Management, Proceedings

Conference

Conference32nd ACM International Conference on Information and Knowledge Management, CIKM 2023
國家/地區United Kingdom
城市Birmingham
期間23-10-2123-10-25

All Science Journal Classification (ASJC) codes

  • 一般商業,管理和會計
  • 一般決策科學

指紋

深入研究「Pseudo Triplet Networks for Classification Tasks with Cross-Source Feature Incompleteness」主題。共同形成了獨特的指紋。

引用此