Personalized time-sync comment generation based on a multimodal transformer

Hei Chia Wang, Martinus Maslim, Wei Ting Hong

研究成果: Article同行評審

摘要

Online video entertainment has attracted large audiences and sustained viewing in various fields. With more than 4.5 billion Internet users worldwide, online video entertainment continues to be the most popular activity for users. Time synchronization comments (TSCs) are a new type of text information in videos. Unlike traditional online video-sharing platforms, where users can only leave comments in the comments section, TSCs can "fly through" the screen at each video playback time. However, the current research on TSC generation does not address the problem of personalization but only focuses on the relationship between images and TSC modalities. Therefore, we propose a multimodal transformer, personalized time-sync comment generation (PTSCG), to generate personalized TSCs. The generated TSCs are more suitable for different users. According to the experimental results, the F − 1 score evaluated for PTSCG after comparing the generated TSC with the original TSCs reached 0.58, which is better than those of other existing models, showing the effectiveness of the method proposed in this study.

原文English
文章編號105
期刊Multimedia Systems
30
發行號2
DOIs
出版狀態Published - 2024 4月

All Science Journal Classification (ASJC) codes

  • 軟體
  • 資訊系統
  • 媒體技術
  • 硬體和架構
  • 電腦網路與通信

指紋

深入研究「Personalized time-sync comment generation based on a multimodal transformer」主題。共同形成了獨特的指紋。

引用此