Score-informed pitch-wise alignment using score-driven non-negative matrix factorization

Tien Ming Wang, Pei Yin Tsai, Alvin W.Y. Su

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

This paper aims at the task of score alignment that aligns the audio recording and its corresponding score properly. Conventional methods are mostly hard to deal with the issue: the asynchrony in the recording of simultaneous notes in the score. In this paper, we propose a note-based score alignment by means of the pitch-by-time format called piano-roll feature. One of the manners is that we develop an approach of converting audio spectrogram to piano-roll-like feature. The score-driven non-negative matrix factorization is then involved in the conversion. Secondly, the pitch-wise alignment is proposed, considering each pitch sequence (i.e., the row of pianoroll) separately. In the results, about 88% of notes match their onsets deviated from ground truth for less than 50 msbased on MIDI-Aligned Piano Sounds(MAPS) database.

Original languageEnglish
Title of host publicationICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings
Pages206-211
Number of pages6
DOIs
Publication statusPublished - 2012 Dec 1
Event2012 3rd IEEE/IET International Conference on Audio, Language and Image Processing, ICALIP 2012 - Shanghai, China
Duration: 2012 Jul 162012 Jul 18

Publication series

NameICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings

Other

Other2012 3rd IEEE/IET International Conference on Audio, Language and Image Processing, ICALIP 2012
CountryChina
CityShanghai
Period12-07-1612-07-18

    Fingerprint

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Computer Vision and Pattern Recognition

Cite this

Wang, T. M., Tsai, P. Y., & Su, A. W. Y. (2012). Score-informed pitch-wise alignment using score-driven non-negative matrix factorization. In ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings (pp. 206-211). [6376613] (ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings). https://doi.org/10.1109/ICALIP.2012.6376613