Automatic Music Transcription using Structure and Sparsity

O'Hanlon, Ken

dc.contributor.author	O'Hanlon, Ken
dc.date.accessioned	2015-09-22T12:41:19Z
dc.date.available	2015-09-22T12:41:19Z
dc.date.issued	02/05/2014
dc.identifier.citation	O'Hanlon., K.. 2014. Automatic Music Transcription using Structure and Sparsity. Queen Mary University of London	en_US
dc.identifier.uri	http://qmro.qmul.ac.uk/xmlui/handle/123456789/8818
dc.description	Phd	en_US
dc.description.abstract	Automatic Music Transcription seeks a machine understanding of a musical signal in terms of pitch-time activations. One popular approach to this problem is the use of spectrogram decompositions, whereby a signal matrix is decomposed over a dictionary of spectral templates, each representing a note. Typically the decomposition is performed using gradient descent based methods, performed using multiplicative updates based on Non-negative Matrix Factorisation (NMF). The final representation may be expected to be sparse, as the musical signal itself is considered to consist of few active notes. In this thesis some concepts that are familiar in the sparse representations literature are introduced to the AMT problem. Structured sparsity assumes that certain atoms tend to be active together. In the context of AMT this affords the use of subspace modelling of notes, and non-negative group sparse algorithms are proposed in order to exploit the greater modelling capability introduced. Stepwise methods are often used for decomposing sparse signals and their use for AMT has previously been limited. Some new approaches to AMT are proposed by incorporation of stepwise optimal approaches with promising results seen. Dictionary coherence is used to provide recovery conditions for sparse algorithms. While such guarantees are not possible in the context of AMT, it is found that coherence is a useful parameter to consider, affording improved performance in spectrogram decompositions.	en_US
dc.language.iso	en	en_US
dc.publisher	Queen Mary University of London
dc.subject	Electronic Engineering	en_US
dc.subject	Terahertz frequency domain	en_US
dc.subject	Electromagnetic fields	en_US
dc.subject	Bio-molecules	en_US
dc.subject	Molecular dynamics	en_US
dc.title	Automatic Music Transcription using Structure and Sparsity	en_US
dc.type	Thesis	en_US
dc.rights.holder	The copyright of this thesis rests with the author and no quotation from it or information derived from it may be published without the prior written consent of the author

Files in this item

Name:: O'Hanlon, Ken 020514.pdf
Size:: 1.149Mb
Format:: application/

View/Open

This item appears in the following Collection(s)

Theses [4222]
Theses Awarded by Queen Mary University of London

Show simple item record