Reliable Local Explanations for Machine Listening

MISHRA, S; Benetos, E; Sturm, B; Dixon, S; International Joint Conference on Neural Networks (IJCNN)

dc.contributor.author	MISHRA, S
dc.contributor.author	Benetos, E
dc.contributor.author	Sturm, B
dc.contributor.author	Dixon, S
dc.contributor.author	International Joint Conference on Neural Networks (IJCNN)
dc.date.accessioned	2020-06-01T11:08:29Z
dc.date.available	2020-03-20
dc.date.available	2020-06-01T11:08:29Z
dc.date.issued	2020-07-19
dc.identifier.uri	https://qmro.qmul.ac.uk/xmlui/handle/123456789/64505
dc.description.abstract	One way to analyse the behaviour of machine learning models is through local explanations that highlight input features that maximally influence model predictions. Sensitivity analysis, which involves analysing the effect of input perturbations on model predictions, is one of the methods to generate local explanations. Meaningful input perturbations are essential for generating reliable explanations, but there exists limited work on what such perturbations are and how to perform them. This work investigates these questions in the context of machine listening models that analyse audio. Specifically, we use a state-of-the-art deep singing voice detection (SVD) model to analyse whether explanations from SoundLIME (a local explanation method) are sensitive to how the method perturbs model inputs. The results demonstrate that SoundLIME explanations are sensitive to the content in the occluded input regions. We further propose and demonstrate a novel method for quantitatively identifying suitable content type(s) for reliably occluding inputs of machine listening models. The results for the SVD model suggest that the average magnitude of input mel-spectrogram bins is the most suitable content type for temporal explanations.	en_US
dc.format.extent	? - ? (8)
dc.publisher	IEEE	en_US
dc.title	Reliable Local Explanations for Machine Listening	en_US
dc.type	Conference Proceeding	en_US
dc.rights.holder	© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
pubs.notes	Not known	en_US
pubs.publication-status	Accepted	en_US
pubs.publisher-url	https://wcci2020.org/	en_US
dcterms.dateAccepted	2020-03-20
rioxxterms.funder	Default funder	en_US
rioxxterms.identifier.project	Default project	en_US
qmul.funder	A Machine Learning Framework for Audio Analysis and Retrieval::Royal Academy of Engineering	en_US

Files in this item

Name:: Benetos Reliable Local Explanations ...
Size:: 1.162Mb
Format:: application/
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

Electronic Engineering and Computer Science [3475]

Show simple item record