Score-informed pitch-wise alignment using score-driven non-negative matrix factorization

Tien Ming Wang, Pei Yin Tsai, Alvin W.Y. Su

研究成果: Conference contribution

2 引文 斯高帕斯(Scopus)

摘要

This paper aims at the task of score alignment that aligns the audio recording and its corresponding score properly. Conventional methods are mostly hard to deal with the issue: the asynchrony in the recording of simultaneous notes in the score. In this paper, we propose a note-based score alignment by means of the pitch-by-time format called piano-roll feature. One of the manners is that we develop an approach of converting audio spectrogram to piano-roll-like feature. The score-driven non-negative matrix factorization is then involved in the conversion. Secondly, the pitch-wise alignment is proposed, considering each pitch sequence (i.e., the row of pianoroll) separately. In the results, about 88% of notes match their onsets deviated from ground truth for less than 50 msbased on MIDI-Aligned Piano Sounds(MAPS) database.

原文English
主出版物標題ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings
頁面206-211
頁數6
DOIs
出版狀態Published - 2012 12月 1
事件2012 3rd IEEE/IET International Conference on Audio, Language and Image Processing, ICALIP 2012 - Shanghai, China
持續時間: 2012 7月 162012 7月 18

出版系列

名字ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings

Other

Other2012 3rd IEEE/IET International Conference on Audio, Language and Image Processing, ICALIP 2012
國家/地區China
城市Shanghai
期間12-07-1612-07-18

All Science Journal Classification (ASJC) codes

  • 語言與語言學
  • 電腦視覺和模式識別

指紋

深入研究「Score-informed pitch-wise alignment using score-driven non-negative matrix factorization」主題。共同形成了獨特的指紋。

引用此