TY - GEN
T1 - Tackling the achilles heel of social networks
T2 - 24th International Conference on World Wide Web, WWW 2015
AU - Yan, Rui
AU - Yen, Ian E.H.
AU - Li, Cheng Te
AU - Zhao, Shiqi
AU - Hu, Xiaohua
PY - 2015/5/18
Y1 - 2015/5/18
N2 - Online social networks nowadays enjoy their worldwide prosperity, as they have revolutionized the way for people to discover, to share, and to distribute information. With millions of registered users and the proliferation of user-generated contents, the social networks be-come "giants", likely eligible to carry on any research tasks. How-ever, the giants do have their Achilles Heel: extreme data sparsity. Compared with the massive data over the whole collection, individ-ual posting documents, (e.g., a microblog less than 140 characters), seem to be too sparse to make a difference under various research scenarios, while actually they are different. In this paper we pro-pose to tackle the Achilles Heel of social networks by smoothing the language model via influence propagation. We formulate a so-cialized factor graph model, which utilizes both the textual corre-lations between document pairs and the socialized augmentation networks behind the documents, such as user relationships and so-cial interactions. These factors are modeled as attributes and de-pendencies among documents and their corresponding users. An efficient algorithm is designed to learn the proposed factor graph model. Finally we propagate term counts to smooth documents based on the estimated influence. Experimental results on Twitter and Weibo datasets validate the effectiveness of the proposed mod-el. By leveraging the smoothed language model with social factors, our approach obtains significant improvement over several alterna-tive methods on both intrinsic and extrinsic evaluations measured in terms of perplexity, nDCG and MAP results.
AB - Online social networks nowadays enjoy their worldwide prosperity, as they have revolutionized the way for people to discover, to share, and to distribute information. With millions of registered users and the proliferation of user-generated contents, the social networks be-come "giants", likely eligible to carry on any research tasks. How-ever, the giants do have their Achilles Heel: extreme data sparsity. Compared with the massive data over the whole collection, individ-ual posting documents, (e.g., a microblog less than 140 characters), seem to be too sparse to make a difference under various research scenarios, while actually they are different. In this paper we pro-pose to tackle the Achilles Heel of social networks by smoothing the language model via influence propagation. We formulate a so-cialized factor graph model, which utilizes both the textual corre-lations between document pairs and the socialized augmentation networks behind the documents, such as user relationships and so-cial interactions. These factors are modeled as attributes and de-pendencies among documents and their corresponding users. An efficient algorithm is designed to learn the proposed factor graph model. Finally we propagate term counts to smooth documents based on the estimated influence. Experimental results on Twitter and Weibo datasets validate the effectiveness of the proposed mod-el. By leveraging the smoothed language model with social factors, our approach obtains significant improvement over several alterna-tive methods on both intrinsic and extrinsic evaluations measured in terms of perplexity, nDCG and MAP results.
UR - http://www.scopus.com/inward/record.url?scp=84944137957&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84944137957&partnerID=8YFLogxK
U2 - 10.1145/2736277.2741673
DO - 10.1145/2736277.2741673
M3 - Conference contribution
AN - SCOPUS:84944137957
T3 - WWW 2015 - Proceedings of the 24th International Conference on World Wide Web
SP - 1318
EP - 1328
BT - WWW 2015 - Proceedings of the 24th International Conference on World Wide Web
PB - Association for Computing Machinery, Inc
Y2 - 18 May 2015 through 22 May 2015
ER -