TY - GEN
T1 - The Node-Similarity Distribution of Complex Networks and Its Applications in Link Prediction (Extended Abstract)
AU - Pu, Cunlai
AU - Li, Jie
AU - Wang, Jian
AU - Quek, Tony Q.S.
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Node-similarity distributions not only characterize different types of complex networks, but also offer insights in the structural predictability of complex networks, and even facilitate prediction tasks in complex networks. By means of the generating function, we propose a framework to calculate the common neighbor based similarity (CNS) distributions, offering theoretical results of similarity distributions of various complex networks. Furthermore, we apply node-similarity distributions to link prediction, a key task in network analysis. Specifically, by deriving analytical solutions for two metrics: i) precision and ii) area under the receiver operating characteristic curve (AUC), we give theoretical evaluation of link prediction. Also, by analyzing i) the expected prediction accuracy of similarity scores and ii) optimal prediction priority of unconnected node pairs, we optimize link prediction with similarity distributions. Simulation results confirm our findings and also validate the proposed methods for evaluating and optimizing link prediction.
AB - Node-similarity distributions not only characterize different types of complex networks, but also offer insights in the structural predictability of complex networks, and even facilitate prediction tasks in complex networks. By means of the generating function, we propose a framework to calculate the common neighbor based similarity (CNS) distributions, offering theoretical results of similarity distributions of various complex networks. Furthermore, we apply node-similarity distributions to link prediction, a key task in network analysis. Specifically, by deriving analytical solutions for two metrics: i) precision and ii) area under the receiver operating characteristic curve (AUC), we give theoretical evaluation of link prediction. Also, by analyzing i) the expected prediction accuracy of similarity scores and ii) optimal prediction priority of unconnected node pairs, we optimize link prediction with similarity distributions. Simulation results confirm our findings and also validate the proposed methods for evaluating and optimizing link prediction.
UR - http://www.scopus.com/inward/record.url?scp=85167664237&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85167664237&partnerID=8YFLogxK
U2 - 10.1109/ICDE55515.2023.00376
DO - 10.1109/ICDE55515.2023.00376
M3 - Conference contribution
AN - SCOPUS:85167664237
T3 - Proceedings - International Conference on Data Engineering
SP - 3899
EP - 3900
BT - Proceedings - 2023 IEEE 39th International Conference on Data Engineering, ICDE 2023
PB - IEEE Computer Society
T2 - 39th IEEE International Conference on Data Engineering, ICDE 2023
Y2 - 3 April 2023 through 7 April 2023
ER -