TY - GEN
T1 - Score-informed pitch-wise alignment using score-driven non-negative matrix factorization
AU - Wang, Tien Ming
AU - Tsai, Pei Yin
AU - Su, Alvin W.Y.
PY - 2012/12/1
Y1 - 2012/12/1
N2 - This paper aims at the task of score alignment that aligns the audio recording and its corresponding score properly. Conventional methods are mostly hard to deal with the issue: the asynchrony in the recording of simultaneous notes in the score. In this paper, we propose a note-based score alignment by means of the pitch-by-time format called piano-roll feature. One of the manners is that we develop an approach of converting audio spectrogram to piano-roll-like feature. The score-driven non-negative matrix factorization is then involved in the conversion. Secondly, the pitch-wise alignment is proposed, considering each pitch sequence (i.e., the row of pianoroll) separately. In the results, about 88% of notes match their onsets deviated from ground truth for less than 50 msbased on MIDI-Aligned Piano Sounds(MAPS) database.
AB - This paper aims at the task of score alignment that aligns the audio recording and its corresponding score properly. Conventional methods are mostly hard to deal with the issue: the asynchrony in the recording of simultaneous notes in the score. In this paper, we propose a note-based score alignment by means of the pitch-by-time format called piano-roll feature. One of the manners is that we develop an approach of converting audio spectrogram to piano-roll-like feature. The score-driven non-negative matrix factorization is then involved in the conversion. Secondly, the pitch-wise alignment is proposed, considering each pitch sequence (i.e., the row of pianoroll) separately. In the results, about 88% of notes match their onsets deviated from ground truth for less than 50 msbased on MIDI-Aligned Piano Sounds(MAPS) database.
UR - http://www.scopus.com/inward/record.url?scp=84872133273&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84872133273&partnerID=8YFLogxK
U2 - 10.1109/ICALIP.2012.6376613
DO - 10.1109/ICALIP.2012.6376613
M3 - Conference contribution
AN - SCOPUS:84872133273
SN - 9781467301718
T3 - ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings
SP - 206
EP - 211
BT - ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings
T2 - 2012 3rd IEEE/IET International Conference on Audio, Language and Image Processing, ICALIP 2012
Y2 - 16 July 2012 through 18 July 2012
ER -