Show simple item record

dc.contributor.authorLiu, L
dc.contributor.authorMorfi, G-V
dc.contributor.authorBenetos, E
dc.contributor.authorIEEE International Conference on Acoustics, Speech and Signal Processing
dc.date.accessioned2021-02-19T16:00:07Z
dc.date.available2021-01-30
dc.date.available2021-02-19T16:00:07Z
dc.date.issued2021-06-06
dc.identifier.urihttps://qmro.qmul.ac.uk/xmlui/handle/123456789/70432
dc.description.abstractResearch on automatic music transcription has largely focused on multi-pitch detection; there is limited discussion on how to obtain a machine- or human-readable score transcription. In this paper, we propose a method for joint multi-pitch detection and score transcription for polyphonic piano music. The outputs of our system include both a piano-roll representation (a descriptive transcription) and a symbolic musical notation (a prescriptive transcription). Unlike traditional methods that further convert MIDI transcriptions into musical scores, we use a multitask model combined with a Convolutional Recurrent Neural Network and Sequence-to-sequence models with attention mechanisms. We propose a Reshaped score representation that outperforms a LilyPond representation in terms of both prediction accuracy and time/memory resources, and compare different input audio spectrograms. We also create a new synthesized dataset for score transcription research. Experimental results show that the joint model outperforms a single-task model in score transcription.en_US
dc.format.extent? - ? (5)
dc.publisherIEEEen_US
dc.subjectautomatic music transcriptionen_US
dc.subjectsequence-to-sequence modelsen_US
dc.subjectscore transcriptionen_US
dc.titleJoint multi-pitch detection and score transcription for polyphonic piano musicen_US
dc.typeConference Proceedingen_US
dc.rights.holder© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
pubs.author-urlhttps://cheriell.github.io/en_US
pubs.notesNot knownen_US
pubs.publication-statusAccepteden_US
pubs.publisher-urlhttps://2021.ieeeicassp.org/en_US
dcterms.dateAccepted2021-01-30
rioxxterms.funderDefault funderen_US
rioxxterms.identifier.projectDefault projecten_US
qmul.funderUKRI Centre for Doctoral Training in Artificial Intelligence and Music::Engineering and Physical Sciences Research Councilen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record