TY - JOUR
T1 - Note-based alignment using score-driven non-negative matrix factorisation for audio recordings
AU - Wang, Tien Ming
AU - Tsai, Pei Yin
AU - Su, Alvin W.Y.
PY - 2014
Y1 - 2014
N2 - This study presents a discussion on the task of score alignment, which properly aligns an audio recording with its corresponding score. Conventional methods have difficulty performing this task because of asynchrony in the recording of simultaneous notes in the score. A note-based score alignment based on the pitch-by-time feature is proposed, called the piano-roll feature, and it presents an approach for converting the audio spectrogram to a piano-roll-like feature. Score-driven non-negative matrix factorisation is then adopted in the transformation. Furthermore, this study also proposes pitch-wise alignment considering each pitch sequence (i.e. the row of piano roll) separately. Results based on the MIDI-Aligned Piano Sounds database show that approximately 88% of notes match their onsets, deviating from the ground truth by less than 50 ms. Other results based on SCREAM Music Annotation Project database that is a manual annotation project of commercial CD recordings are presented as well.
AB - This study presents a discussion on the task of score alignment, which properly aligns an audio recording with its corresponding score. Conventional methods have difficulty performing this task because of asynchrony in the recording of simultaneous notes in the score. A note-based score alignment based on the pitch-by-time feature is proposed, called the piano-roll feature, and it presents an approach for converting the audio spectrogram to a piano-roll-like feature. Score-driven non-negative matrix factorisation is then adopted in the transformation. Furthermore, this study also proposes pitch-wise alignment considering each pitch sequence (i.e. the row of piano roll) separately. Results based on the MIDI-Aligned Piano Sounds database show that approximately 88% of notes match their onsets, deviating from the ground truth by less than 50 ms. Other results based on SCREAM Music Annotation Project database that is a manual annotation project of commercial CD recordings are presented as well.
UR - http://www.scopus.com/inward/record.url?scp=84892951364&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84892951364&partnerID=8YFLogxK
U2 - 10.1049/iet-spr.2012.0157
DO - 10.1049/iet-spr.2012.0157
M3 - Article
AN - SCOPUS:84892951364
SN - 1751-9675
VL - 8
SP - 1
EP - 9
JO - IET Signal Processing
JF - IET Signal Processing
IS - 1
ER -