Accurate audio-to-score alignment for expressive violin recordings

Jia Ling Syue, Li Su, Yi Ju Lin, Pei Ching Li, Yen Kuang Lu, Yu Lin Wang, Alvin W.Y. Su

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

An audio-to-score alignment system adaptive to various playing styles and techniques, and also with high accuracy for onset/offset annotation is the key step toward advanced research on automatic music expression analysis. Technical barriers include the processing of overlapped notes, repeated note sequences, and silence. Most of these characteristics vary with expressions. In this paper, the audio-toscore alignment problem of expressive violin performance is addressed. We propose a two-stage alignment system composed of the dynamic time warping (DTW) algorithm, simulation of overlapped sustain notes, background noise model, silence detection, and refinement process, to better capture the onset. More importantly, we utilize the nonnegative matrix factorization (NMF) method for synthesis of the reference signal in order to deal with highly diverse timbre in real-world performance. A dataset of annotated expressive violin recordings in which each piece is played with various expressive musical terms is used. The optimal choice of basic parameters considered in conventional alignment systems, such as features, distance functions in DTW, synthesis methods for the reference signal, and energy ratios, is analyzed. Different settings on different expressions are compared and discussed. Results show that the proposed methods notably improve the conventional DTW-based alignment method.

Original languageEnglish
Title of host publicationProceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017
EditorsSally Jo Cunningham, Zhiyao Duan, Xiao Hu, Douglas Turnbull
PublisherInternational Society for Music Information Retrieval
Pages250-256
Number of pages7
ISBN (Electronic)9789811151798
Publication statusPublished - 2017 Jan 1
Event18th International Society for Music Information Retrieval Conference, ISMIR 2017 - Suzhou, China
Duration: 2017 Oct 232017 Oct 27

Publication series

NameProceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017

Conference

Conference18th International Society for Music Information Retrieval Conference, ISMIR 2017
CountryChina
CitySuzhou
Period17-10-2317-10-27

    Fingerprint

All Science Journal Classification (ASJC) codes

  • Music
  • Information Systems

Cite this

Syue, J. L., Su, L., Lin, Y. J., Li, P. C., Lu, Y. K., Wang, Y. L., & Su, A. W. Y. (2017). Accurate audio-to-score alignment for expressive violin recordings. In S. J. Cunningham, Z. Duan, X. Hu, & D. Turnbull (Eds.), Proceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017 (pp. 250-256). (Proceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017). International Society for Music Information Retrieval.