dc.contributor.author | Wilkinson, WJ | en_US |
dc.contributor.author | Reiss, JD | en_US |
dc.contributor.author | Stowell, D | en_US |
dc.contributor.author | LVA-ICA | en_US |
dc.date.accessioned | 2018-07-25T13:24:35Z | |
dc.date.available | 2018-03-19 | en_US |
dc.date.issued | 2018-06-06 | en_US |
dc.date.submitted | 2018-07-16T15:51:19.468Z | |
dc.identifier.isbn | 9783319937632 | en_US |
dc.identifier.issn | 0302-9743 | en_US |
dc.identifier.uri | http://qmro.qmul.ac.uk/xmlui/handle/123456789/42563 | |
dc.description.abstract | Generative models based on subband amplitude envelopes of natural sounds have resulted in convincing synthesis, showing subband amplitude modulation to be a crucial component of auditory perception. Probabilistic latent variable analysis can be particularly insightful, but existing approaches don’t incorporate prior knowledge about the physical behaviour of amplitude envelopes, such as exponential decay or feedback. We use latent force modelling, a probabilistic learning paradigm that encodes physical knowledge into Gaussian process regression, to model correlation across spectral subband envelopes. We augment the standard latent force model approach by explicitly modelling dependencies across multiple time steps. Incorporating this prior knowledge strengthens the interpretation of the latent functions as the source that generated the signal. We examine this interpretation via an experiment showing that sounds generated by sampling from our probabilistic model are perceived to be more realistic than those generated by comparative models based on nonnegative matrix factorisation, even in cases where our model is outperformed from a reconstruction error perspective. | en_US |
dc.format.extent | 259 - 269 | en_US |
dc.rights | This is a pre-copyedited, author-produced version of an article accepted for publication in International Conference on Latent Variable Analysis and Signal Separation following peer review. The version of record is available https://link.springer.com/chapter/10.1007%2F978-3-319-93764-9_25 | |
dc.title | A generative model for natural sounds based on latent force modelling | en_US |
dc.type | Conference Proceeding | |
dc.rights.holder | © Springer International Publishing AG, part of Springer Nature 2018 | |
dc.identifier.doi | 10.1007/978-3-319-93764-9_25 | en_US |
pubs.notes | Not known | en_US |
pubs.publication-status | Published | en_US |
pubs.volume | 10891 LNCS | en_US |
dcterms.dateAccepted | 2018-03-19 | en_US |
qmul.funder | Structured machine listening for soundscapes with multiple birds::EPSRC | en_US |