Show simple item record

dc.contributor.advisor© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.contributor.authorO'Hanlon, K
dc.contributor.authorBenetos, E
dc.contributor.authorDixon, S
dc.contributor.authorIEEE International Workshop on Machine Learning for Signal Processing (MLSP)
dc.date.accessioned2021-10-01T13:29:34Z
dc.date.available2021-08-15
dc.date.available2021-10-01T13:29:34Z
dc.date.issued2021-10-25
dc.identifier.urihttps://qmro.qmul.ac.uk/xmlui/handle/123456789/74336
dc.description.abstractDeep Learning (DL) has recently been applied successfully to the task of Cover Song Identification (CSI). Meanwhile, neural networks that consider music signal data structure in their design have been developed. In this paper, we propose a Pitch Class Key-Invariant Network, PiCKINet, for CSI. Like some other CSI networks, PiCKINet inputs a Constant-Q Transform (CQT) pitch feature. Unlike other such networks, large multi-octave kernels produce a latent representation with pitch class dimensions that are maintained throughout PiCKINet by key-invariant convolutions. PiCKINet is seen to be more effective, and efficient, than other CQT-based networks. We also propose an extended variant, PiCKINet+, that employs a centre loss penalty, squeeze and excite units, and octave swapping data augmentation. PiCKINet+ shows an improvement of ~17% MAP relative to the well-known CQTNet when tested on a set of ~16K tracks.en_US
dc.format.extent? - ? (6)
dc.publisherIEEEen_US
dc.titleDetecting cover songs with pitch class key-invariant networksen_US
dc.typeConference Proceedingen_US
dc.rights.holder© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
pubs.notesNot knownen_US
pubs.publication-statusAccepteden_US
pubs.publisher-urlhttps://2021.ieeemlsp.org/en_US
dcterms.dateAccepted2021-08-15
rioxxterms.funderDefault funderen_US
rioxxterms.identifier.projectDefault projecten_US
qmul.funderDevelopment of next generation music recognition algorithm for content monitoring::Innovate UKen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record