Gaussian Process Modelling for Audio Signals

Wilkinson, W

dc.contributor.author	Wilkinson, W
dc.date.accessioned	2019-11-13T10:56:23Z
dc.date.available	2019-11-13T10:56:23Z
dc.date.issued	04/10/2019
dc.identifier.citation	Wilkinson, W. 2019. Gaussian Process Modelling for Audio Signals. Queen Mary University of London	en_US
dc.identifier.uri	https://qmro.qmul.ac.uk/xmlui/handle/123456789/61329
dc.description	PhD	en_US
dc.description.abstract	Audio signals are characterised and perceived based on how their spectral make-up changes with time. Uncovering the behaviour of latent spectral components is at the heart of many real-world applications involving sound, but is a highly ill-posed task given the infi nite number of ways any signal can be decomposed. This motivates the use of prior knowledge and a probabilistic modelling paradigm that can characterise uncertainty. This thesis studies the application of Gaussian processes to audio, which offer a principled non-parametric way to specify probability distributions over functions whilst also encoding prior knowledge. Along the way we consider what prior knowledge we have about sound, the way it behaves, and the way it is perceived, and write down these assumptions in the form of probabilistic models. We show how Bayesian time-frequency analysis can be reformulated as a spectral mixture Gaussian process, and utilise modern day inference methods to carry out joint time-frequency analysis and nonnegative matrix factorisation. Our reformulation results in increased modelling flexibility, allowing more sophisticated prior knowledge to be encoded, which improves performance on a missing data synthesis task. We demonstrate the generality of this paradigm by showing how the joint model can additionally be applied to both denoising and source separation tasks without modi cation. We propose a hybrid statistical-physical model for audio spectrograms based on observations about the way amplitude envelopes decay over time, as well as a nonlinear model based on deep Gaussian processes. We examine the benefi ts of these methods, all of which are generative in the sense that novel signals can be sampled from the underlying models, allowing us to consider the extent to which they encode the important perceptual characteristics of sound.	en_US
dc.language.iso	en	en_US
dc.publisher	Queen Mary University of London
dc.subject	Women’s Health Research	en_US
dc.subject	pregnancy	en_US
dc.subject	Diet	en_US
dc.subject	physical activity	en_US
dc.subject	gestational weight gain	en_US
dc.subject	Intervention outcomes	en_US
dc.title	Gaussian Process Modelling for Audio Signals	en_US
dc.type	Thesis	en_US
dc.rights.holder	The copyright of this thesis rests with the author and no quotation from it or information derived from it may be published without the prior written consent of the author

Files in this item

Name:: WILKINSON_W_J_PhD_Final_041019.pdf
Size:: 2.087Mb
Format:: application/
Description:: WILKINSON_W_J_PhD_Final_041019.pdf

View/Open

This item appears in the following Collection(s)

Theses [4189]
Theses Awarded by Queen Mary University of London

Show simple item record