Show simple item record

dc.contributor.authorWilkinson, W
dc.date.accessioned2019-11-13T10:56:23Z
dc.date.available2019-11-13T10:56:23Z
dc.date.issued04/10/2019
dc.identifier.citationWilkinson, W. 2019. Gaussian Process Modelling for Audio Signals. Queen Mary University of Londonen_US
dc.identifier.urihttps://qmro.qmul.ac.uk/xmlui/handle/123456789/61329
dc.descriptionPhDen_US
dc.description.abstractAudio signals are characterised and perceived based on how their spectral make-up changes with time. Uncovering the behaviour of latent spectral components is at the heart of many real-world applications involving sound, but is a highly ill-posed task given the infi nite number of ways any signal can be decomposed. This motivates the use of prior knowledge and a probabilistic modelling paradigm that can characterise uncertainty. This thesis studies the application of Gaussian processes to audio, which offer a principled non-parametric way to specify probability distributions over functions whilst also encoding prior knowledge. Along the way we consider what prior knowledge we have about sound, the way it behaves, and the way it is perceived, and write down these assumptions in the form of probabilistic models. We show how Bayesian time-frequency analysis can be reformulated as a spectral mixture Gaussian process, and utilise modern day inference methods to carry out joint time-frequency analysis and nonnegative matrix factorisation. Our reformulation results in increased modelling flexibility, allowing more sophisticated prior knowledge to be encoded, which improves performance on a missing data synthesis task. We demonstrate the generality of this paradigm by showing how the joint model can additionally be applied to both denoising and source separation tasks without modi cation. We propose a hybrid statistical-physical model for audio spectrograms based on observations about the way amplitude envelopes decay over time, as well as a nonlinear model based on deep Gaussian processes. We examine the benefi ts of these methods, all of which are generative in the sense that novel signals can be sampled from the underlying models, allowing us to consider the extent to which they encode the important perceptual characteristics of sound.en_US
dc.language.isoenen_US
dc.publisherQueen Mary University of London
dc.subjectWomen’s Health Researchen_US
dc.subjectpregnancyen_US
dc.subjectDieten_US
dc.subjectphysical activityen_US
dc.subjectgestational weight gainen_US
dc.subjectIntervention outcomesen_US
dc.titleGaussian Process Modelling for Audio Signalsen_US
dc.typeThesisen_US
dc.rights.holderThe copyright of this thesis rests with the author and no quotation from it or information derived from it may be published without the prior written consent of the author


Files in this item

Thumbnail

This item appears in the following Collection(s)

  • Theses [4189]
    Theses Awarded by Queen Mary University of London

Show simple item record