Multi-pitch detection and voice assignment for a cappella recordings of multiple singers

Schramm, R; McLeod, A; Steedman, M; Benetos, E; 18th International Society for Music Information Retrieval Conference (ISMIR 2017)

dc.contributor.author	Schramm, R	en_US
dc.contributor.author	McLeod, A	en_US
dc.contributor.author	Steedman, M	en_US
dc.contributor.author	Benetos, E	en_US
dc.contributor.author	18th International Society for Music Information Retrieval Conference (ISMIR 2017)	en_US
dc.date.accessioned	2017-08-17T10:13:45Z
dc.date.available	2017-06-23	en_US
dc.date.issued	2017-10-23	en_US
dc.date.submitted	2017-07-16T10:05:55.440Z
dc.identifier.uri	http://qmro.qmul.ac.uk/xmlui/handle/123456789/25292
dc.description.abstract	This paper presents a multi-pitch detection and voice assignment method applied to audio recordings containing a cappella performances with multiple singers. A novel approach combining an acoustic model for multi-pitch detection and a music language model for voice separation and assignment is proposed. The acoustic model is a spectrogram factorization process based on Probabilistic Latent Component Analysis (PLCA), driven by a 6-dimensional dictionary with pre-learned spectral templates. The voice separation component is based on hidden Markov models that use musicological assumptions. By integrating the models, the system can detect multiple concurrent pitches in vocal music and assign each detected pitch to a specific voice corresponding to a voice type such as soprano, alto, tenor or bass (SATB). This work focuses on four-part compositions, and evaluations on recordings of Bach Chorales and Barbershop quartets show that our integrated approach achieves an F-measure of over 70% for frame-based multi-pitch detection and over 45% for four-voice assignment.	en_US
dc.format.extent	552 - 559 (8)	en_US
dc.publisher	ISMIR	en_US
dc.rights	18th International Society for Music Information Retrieval Conference (ISMIR 2017
dc.title	Multi-pitch detection and voice assignment for a cappella recordings of multiple singers	en_US
dc.type	Conference Proceeding
dc.rights.holder	© The Author(s) 2017
pubs.notes	No embargo	en_US
pubs.notes	ISMIR has CC-BY proceedings, there is no embargo	en_US
pubs.publication-status	Accepted	en_US
pubs.publisher-url	https://ismir2017.smcnus.org/	en_US
dcterms.dateAccepted	2017-06-23	en_US
qmul.funder	A Machine Learning Framework for Audio Analysis and Retrieval::Royal Academy of Engineering	en_US
qmul.funder	A Machine Learning Framework for Audio Analysis and Retrieval::Royal Academy of Engineering	en_US

Files in this item

Name:: Benetos Multi-pitch detection ...
Size:: 316.7Kb
Format:: application/
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

Electronic Engineering and Computer Science [3475]

Show simple item record