Key points are not available for this paper at this time.
Dynamic time warping (DTW) has proven to be an extremely effective method for both aligning and matching recordings of music to corresponding MIDI transcriptions. However, its performance is heavily affected by factors such as the representation used for the audio and MIDI data and its adjustable parameters. We therefore investigate automatically optimizing the design of DTW-based alignment and matching systems. Our approach uses Bayesian optimization to tune system design and parameters over a synthetically-created dataset of audio and MIDI pairs. We then perform an exhaustive search over DTW score normalization techniques to find the optimal method for reporting a reliable alignment confidence score, as required in matching tasks. This results in a DTW-based system which is conceptually simple and highly accurate at both alignment and matching. We verified that this system achieves high performance in a large-scale qualitative evaluation of real-world alignments.
Building similarity graph...
Analyzing shared references across papers
Loading...
Colin Raffel
Vector Institute
Daniel P. W. Ellis
European Space Agency
Columbia University
Building similarity graph...
Analyzing shared references across papers
Loading...
Raffel et al. (Tue,) studied this question.
synapsesocial.com/papers/6a14adba4084ab02a46bee40 — DOI: https://doi.org/10.1109/icassp.2016.7471641