dc.contributor.author | Lins, F | |
dc.contributor.author | Johann, M | |
dc.contributor.author | BENETOS, E | |
dc.contributor.author | Schramm, R | |
dc.contributor.author | IEEE International Conference on Acoustics, Speech, and Signal Processing | |
dc.date.accessioned | 2019-03-26T10:24:50Z | |
dc.date.available | 2019-02-01 | |
dc.date.available | 2019-03-26T10:24:50Z | |
dc.date.issued | 2019-05-12 | |
dc.identifier.uri | https://qmro.qmul.ac.uk/xmlui/handle/123456789/56489 | |
dc.description.abstract | This paper presents a method for automatic transcription of the diatonic Harmonica instrument. It estimates the multi-pitch activations through a spectrogram factorisation framework. This framework is based on Probabilistic Latent Component Analysis (PLCA) and uses a fixed 4-dimensional dictionary with spectral templates extracted from Harmonica's instrument timbre. Methods based on spectrogram factorisation may suffer from local-optima issues in the presence of harmonic overlap or considerable timbre variability. To alleviate this issue, we propose a set of harmonic constraints that are inherent to the Harmonica instrument note layout or are caused by specific diatonic Harmonica playing techniques. These constraints help to guide the factorisation process until convergence into meaningful multi-pitch activations is achieved. This work also builds a new audio dataset containing solo recordings of diatonic Harmonica excerpts and the respective multi-pitch annotations. We compare our proposed approach against multiple baseline techniques for automatic music transcription on this dataset and report the results based on frame-based F-measure statistics. | en_US |
dc.format.extent | ? - ? (5) | |
dc.publisher | IEEE | en_US |
dc.title | Automatic Transcription of Diatonic Harmonica Recordings | en_US |
dc.type | Conference Proceeding | en_US |
dc.rights.holder | © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | |
pubs.notes | No embargo | en_US |
pubs.notes | IEEE conference, allows uploading postprints at institutional repositories. | en_US |
pubs.publication-status | Accepted | en_US |
pubs.publisher-url | https://2019.ieeeicassp.org/ | en_US |
dcterms.dateAccepted | 2019-02-01 | |
rioxxterms.funder | Default funder | en_US |
rioxxterms.identifier.project | Default project | en_US |
qmul.funder | A Machine Learning Framework for Audio Analysis and Retrieval::Royal Academy of Engineering | en_US |