Joint multi-pitch detection and score transcription for polyphonic piano music

Liu, L; Morfi, G-V; Benetos, E; IEEE International Conference on Acoustics, Speech and Signal Processing

View/Open

Accepted version (338.3Kb)

Pagination

? - ? (5)

Publisher

IEEE

Publisher URL

https://2021.ieeeicassp.org/

Metadata

Show full item record

Abstract

Research on automatic music transcription has largely focused on multi-pitch detection; there is limited discussion on how to obtain a machine- or human-readable score transcription. In this paper, we propose a method for joint multi-pitch detection and score transcription for polyphonic piano music. The outputs of our system include both a piano-roll representation (a descriptive transcription) and a symbolic musical notation (a prescriptive transcription). Unlike traditional methods that further convert MIDI transcriptions into musical scores, we use a multitask model combined with a Convolutional Recurrent Neural Network and Sequence-to-sequence models with attention mechanisms. We propose a Reshaped score representation that outperforms a LilyPond representation in terms of both prediction accuracy and time/memory resources, and compare different input audio spectrograms. We also create a new synthesized dataset for score transcription research. Experimental results show that the joint model outperforms a single-task model in score transcription.

Authors

Liu, L; Morfi, G-V; Benetos, E; IEEE International Conference on Acoustics, Speech and Signal Processing

URI

https://qmro.qmul.ac.uk/xmlui/handle/123456789/70432

Collections

Electronic Engineering and Computer Science [3475]

Copyright statements

© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.