A generative model for natural sounds based on latent force modelling

Wilkinson, WJ; Reiss, JD; Stowell, D; LVA-ICA

dc.contributor.author	Wilkinson, WJ	en_US
dc.contributor.author	Reiss, JD	en_US
dc.contributor.author	Stowell, D	en_US
dc.contributor.author	LVA-ICA	en_US
dc.date.accessioned	2018-07-25T13:24:35Z
dc.date.available	2018-03-19	en_US
dc.date.issued	2018-06-06	en_US
dc.date.submitted	2018-07-16T15:51:19.468Z
dc.identifier.isbn	9783319937632	en_US
dc.identifier.issn	0302-9743	en_US
dc.identifier.uri	http://qmro.qmul.ac.uk/xmlui/handle/123456789/42563
dc.description.abstract	Generative models based on subband amplitude envelopes of natural sounds have resulted in convincing synthesis, showing subband amplitude modulation to be a crucial component of auditory perception. Probabilistic latent variable analysis can be particularly insightful, but existing approaches don’t incorporate prior knowledge about the physical behaviour of amplitude envelopes, such as exponential decay or feedback. We use latent force modelling, a probabilistic learning paradigm that encodes physical knowledge into Gaussian process regression, to model correlation across spectral subband envelopes. We augment the standard latent force model approach by explicitly modelling dependencies across multiple time steps. Incorporating this prior knowledge strengthens the interpretation of the latent functions as the source that generated the signal. We examine this interpretation via an experiment showing that sounds generated by sampling from our probabilistic model are perceived to be more realistic than those generated by comparative models based on nonnegative matrix factorisation, even in cases where our model is outperformed from a reconstruction error perspective.	en_US
dc.format.extent	259 - 269	en_US
dc.rights	This is a pre-copyedited, author-produced version of an article accepted for publication in International Conference on Latent Variable Analysis and Signal Separation following peer review. The version of record is available https://link.springer.com/chapter/10.1007%2F978-3-319-93764-9_25
dc.title	A generative model for natural sounds based on latent force modelling	en_US
dc.type	Conference Proceeding
dc.rights.holder	© Springer International Publishing AG, part of Springer Nature 2018
dc.identifier.doi	10.1007/978-3-319-93764-9_25	en_US
pubs.notes	Not known	en_US
pubs.publication-status	Published	en_US
pubs.volume	10891 LNCS	en_US
dcterms.dateAccepted	2018-03-19	en_US
qmul.funder	Structured machine listening for soundscapes with multiple birds::EPSRC	en_US

Files in this item

Name:: Wilkinson A Generative Model ...
Size:: 1.441Mb
Format:: application/
Description:: Accepted Version

View/Open

This item appears in the following Collection(s)

Organismal Biology [244]

Show simple item record