Multi-pitch detection and voice assignment for a cappella recordings of multiple singers

Schramm, R; McLeod, A; Steedman, M; Benetos, E; 18th International Society for Music Information Retrieval Conference (ISMIR 2017)

View/Open

Accepted version (316.7Kb)

Pagination

552 - 559 (8)

Publisher

ISMIR

Publisher URL

https://ismir2017.smcnus.org/

Metadata

Show full item record

Abstract

This paper presents a multi-pitch detection and voice assignment method applied to audio recordings containing a cappella performances with multiple singers. A novel approach combining an acoustic model for multi-pitch detection and a music language model for voice separation and assignment is proposed. The acoustic model is a spectrogram factorization process based on Probabilistic Latent Component Analysis (PLCA), driven by a 6-dimensional dictionary with pre-learned spectral templates. The voice separation component is based on hidden Markov models that use musicological assumptions. By integrating the models, the system can detect multiple concurrent pitches in vocal music and assign each detected pitch to a specific voice corresponding to a voice type such as soprano, alto, tenor or bass (SATB). This work focuses on four-part compositions, and evaluations on recordings of Bach Chorales and Barbershop quartets show that our integrated approach achieves an F-measure of over 70% for frame-based multi-pitch detection and over 45% for four-voice assignment.

Authors

Schramm, R; McLeod, A; Steedman, M; Benetos, E; 18th International Society for Music Information Retrieval Conference (ISMIR 2017)

URI

http://qmro.qmul.ac.uk/xmlui/handle/123456789/25292

Collections

Electronic Engineering and Computer Science [3495]

Licence information

18th International Society for Music Information Retrieval Conference (ISMIR 2017