TY - GEN
T1 - Popularity prediction of social media based on multi-modal feature mining
AU - Hsu, Chih Chung
AU - Lee, Jun Yi
AU - Kang, Li Wei
AU - Zhang, Zhong Xuan
AU - Lee, Chia Yen
AU - Wu, Shao Min
N1 - Publisher Copyright:
© 2019 Association for Computing Machinery. ACM ISBN 978-1-4503-6889-6/19/10...$15.00
PY - 2019/10/15
Y1 - 2019/10/15
N2 - Popularity prediction of social media becomes a more attractive issue in recent years. It consists of multi-type data sources such as image, meta-data, and text information. In order to effectively predict the popularity of a specified post in the social network, fusing multi-feature from heterogeneous data is required. In this paper, a popularity prediction framework for social media based on multi-modal feature mining is presented. First, we discover image semantic features by extracting their image descriptions generated by image captioning. Second, an effective text-based feature engineering is used to construct an effective word-to-vector model. The trained word-to-vector model is used to encode the text information and the semantic image features. Finally, an ensemble regression approach is proposed to aggregate these encoded features and learn the final regressor. Extensive experiments show that the proposed method significantly outperforms other state-of-the-art regression models. We also show that the multi-modal approach could effectively improve the performance in the social media prediction challenge.
AB - Popularity prediction of social media becomes a more attractive issue in recent years. It consists of multi-type data sources such as image, meta-data, and text information. In order to effectively predict the popularity of a specified post in the social network, fusing multi-feature from heterogeneous data is required. In this paper, a popularity prediction framework for social media based on multi-modal feature mining is presented. First, we discover image semantic features by extracting their image descriptions generated by image captioning. Second, an effective text-based feature engineering is used to construct an effective word-to-vector model. The trained word-to-vector model is used to encode the text information and the semantic image features. Finally, an ensemble regression approach is proposed to aggregate these encoded features and learn the final regressor. Extensive experiments show that the proposed method significantly outperforms other state-of-the-art regression models. We also show that the multi-modal approach could effectively improve the performance in the social media prediction challenge.
UR - http://www.scopus.com/inward/record.url?scp=85074816167&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85074816167&partnerID=8YFLogxK
U2 - 10.1145/3343031.3356064
DO - 10.1145/3343031.3356064
M3 - Conference contribution
AN - SCOPUS:85074816167
T3 - MM 2019 - Proceedings of the 27th ACM International Conference on Multimedia
SP - 2687
EP - 2691
BT - MM 2019 - Proceedings of the 27th ACM International Conference on Multimedia
PB - Association for Computing Machinery, Inc
T2 - 27th ACM International Conference on Multimedia, MM 2019
Y2 - 21 October 2019 through 25 October 2019
ER -