TY - JOUR
T1 - Automatic meeting summarization and topic detection system
AU - Huang, Tai Chia
AU - Hsieh, Chia Hsuan
AU - Wang, Hei Chia
N1 - Funding Information:
The research was based on the work supported by the Taiwan Ministry of Science and Technology under Grant No. MOST 103-2410-H-006-055-MY3.
Publisher Copyright:
© 2018, Emerald Publishing Limited.
PY - 2018/8/20
Y1 - 2018/8/20
N2 - Purpose: Producing meeting documents requires an instantaneous recorder during meetings, which costs extra human resources and takes time to amend the file. However, a high-quality meeting document can enable users to recall the meeting content efficiently. The paper aims to discuss these issues. Design/methodology/approach: An application based on this framework is developed to help the users find topics and obtain summarizations of meeting contents without extra effort. This app uses the Bluemix speech recognizer to obtain speech transcripts. It then combines latent Dirichlet allocation and a TextTiling algorithm with the speech script of meetings to detect boundaries between different topics and evaluate the topics in each segment. TextTeaser, an open API based on a feature-based approach, is then used to summarize the speech transcripts. Findings: The results indicate that the summaries generated by the machine are 85 percent similar to the records written by humankind. Originality/value: To reduce the human effort in generating meeting reports, this paper presents a framework to record and analyze meeting contents automatically by voice recognition, topic detection, and extractive summarization.
AB - Purpose: Producing meeting documents requires an instantaneous recorder during meetings, which costs extra human resources and takes time to amend the file. However, a high-quality meeting document can enable users to recall the meeting content efficiently. The paper aims to discuss these issues. Design/methodology/approach: An application based on this framework is developed to help the users find topics and obtain summarizations of meeting contents without extra effort. This app uses the Bluemix speech recognizer to obtain speech transcripts. It then combines latent Dirichlet allocation and a TextTiling algorithm with the speech script of meetings to detect boundaries between different topics and evaluate the topics in each segment. TextTeaser, an open API based on a feature-based approach, is then used to summarize the speech transcripts. Findings: The results indicate that the summaries generated by the machine are 85 percent similar to the records written by humankind. Originality/value: To reduce the human effort in generating meeting reports, this paper presents a framework to record and analyze meeting contents automatically by voice recognition, topic detection, and extractive summarization.
UR - http://www.scopus.com/inward/record.url?scp=85046016842&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85046016842&partnerID=8YFLogxK
U2 - 10.1108/DTA-09-2017-0062
DO - 10.1108/DTA-09-2017-0062
M3 - Article
AN - SCOPUS:85046016842
SN - 2514-9288
VL - 52
SP - 351
EP - 365
JO - Data Technologies and Applications
JF - Data Technologies and Applications
IS - 3
ER -