Show simple item record

dc.contributor.authorCHENG, Ten_US
dc.contributor.editorDixon, Sen_US
dc.contributor.editorMauch, Men_US
dc.date.accessioned2017-01-06T12:45:26Z
dc.date.issued2016-11en_US
dc.date.submitted2017-01-06T12:05:11.130Z
dc.identifier.urihttp://qmro.qmul.ac.uk/xmlui/handle/123456789/18421
dc.descriptionThis work was supported by a joint Queen Mary/China Scholarship Council Scholarship.
dc.descriptionThis work was supported by a joint Queen Mary/China Scholarship Council Scholarship.en_US
dc.descriptionThis work was supported by a joint Queen Mary/China Scholarship Council Scholarship.en_US
dc.descriptionThis work was supported by a joint Queen Mary/China Scholarship Council Scholarship.en_US
dc.descriptionThis work was supported by a joint Queen Mary/China Scholarship Council Scholarship.en_US
dc.description.abstractIn this thesis we exploit piano acoustics to automatically transcribe piano recordings into a symbolic representation: the pitch and timing of each detected note. To do so we use approaches based on non-negative matrix factorisation (NMF). To motivate the main contributions of this thesis, we provide two preparatory studies: a study of using a deterministic annealing EM algorithm in a matrix factorisation-based system, and a study of decay patterns of partials in real-word piano tones. Based on these studies, we propose two generative NMF-based models which explicitly model different piano acoustical features. The first is an attack/decay model, that takes into account the time-varying timbre and decaying energy of piano sounds. The system divides a piano note into percussive attack and harmonic decay stages, and separately models the two parts using two sets of templates and amplitude envelopes. The two parts are coupled by the note activations. We simplify the decay envelope by an exponentially decaying function. The proposed method improves the performance of supervised piano transcription. The second model aims at using the spectral width of partials as an independent indicator of the duration of piano notes. Each partial is represented by a Gaussian function, with the spectral width indicated by the standard deviation. The spectral width is large in the attack part, but gradually decreases to a stable value and remains constant in the decay part. The model provides a new aspect to understand the time-varying timbre of piano notes, but furtherinvestigation is needed to use it effectively to improve piano transcription. We demonstrate the utility of the proposed systems in piano music transcription and analysis. Results show that explicitly modelling piano acoustical features, especially temporal features, can improve the transcription performance.en_US
dc.description.sponsorshipQueen Mary/China Scholarship Council Scholarship.en_US
dc.language.isoenen_US
dc.subjectpiano acousticsen_US
dc.subjectnon-negative matrix factorisationen_US
dc.subjecttranscription performanceen_US
dc.titleExploiting Piano Acoustics in Automatic Transcriptionen_US
dc.rights.holderThe copyright of this thesis rests with the author and no quotation from it or information derived from it may be published without the prior written consent of the author
pubs.notesNo embargoen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

  • Theses [4209]
    Theses Awarded by Queen Mary University of London

Show simple item record