Show simple item record

dc.contributor.authorWilkinson, WJen_US
dc.contributor.authorAndersen, MRen_US
dc.contributor.authorReiss, JDen_US
dc.contributor.authorStowell, Den_US
dc.contributor.authorSolin, Aen_US
dc.date.accessioned2019-05-16T09:48:43Z
dc.identifier.urihttps://qmro.qmul.ac.uk/xmlui/handle/123456789/57579
dc.descriptionAccepted to the Thirty-sixth International Conference on Machine Learning (ICML) 2019en_US
dc.descriptionAccepted to the Thirty-sixth International Conference on Machine Learning (ICML) 2019en_US
dc.descriptionAccepted to the Thirty-sixth International Conference on Machine Learning (ICML) 2019en_US
dc.description.abstractA typical audio signal processing pipeline includes multiple disjoint analysis stages, including calculation of a time-frequency representation followed by spectrogram-based feature analysis. We show how time-frequency analysis and nonnegative matrix factorisation can be jointly formulated as a spectral mixture Gaussian process model with nonstationary priors over the amplitude variance parameters. Further, we formulate this nonlinear model's state space representation, making it amenable to infinite-horizon Gaussian process regression with approximate inference via expectation propagation, which scales linearly in the number of time steps and quadratically in the state dimensionality. By doing so, we are able to process audio signals with hundreds of thousands of data points. We demonstrate, on various tasks with empirical data, how this inference scheme outperforms more standard techniques that rely on extended Kalman filtering.en_US
dc.subjectstat.MLen_US
dc.subjectstat.MLen_US
dc.subjectcs.LGen_US
dc.subjectcs.SDen_US
dc.subjecteess.ASen_US
dc.subjecteess.SPen_US
dc.titleEnd-to-End Probabilistic Inference for Nonstationary Audio Analysisen_US
dc.typeConference Proceeding
dc.rights.holder© The Author(s) 2019
pubs.author-urlhttp://arxiv.org/abs/1901.11436v5en_US
pubs.notesNot knownen_US
rioxxterms.funderDefault funderen_US
rioxxterms.identifier.projectDefault projecten_US
qmul.funderStructured machine listening for soundscapes with multiple birds::EPSRCen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record