Agreement among human and annotated transcriptions of global songs

Ozaki, Y; McBride, J; Benetos, E; Pfordresher, PQ; Six, J; T. Tierney, A; Proutskova, P; Sakai, E; Kondo, H; Fukatsu, H; Fujii, S; Savage, PE; 22nd International Society for Music Information Retrieval Conference (ISMIR)

View/Open

Accepted version (6.453Mb)

Publisher

International Society for Music Information Retrieval

Publisher URL

https://ismir2021.ismir.net/

Metadata

Show full item record

Abstract

Cross-cultural musical analysis requires standardized symbolic representation of sounds such as score notation. However, transcription into notation is usually conducted manually by ear, which is time-consuming and subjective. Our aim is to evaluate the reliability of existing methods for transcribing songs from diverse societies. We had 3 experts independently transcribe a sample of 32 excerpts of traditional monophonic songs from around the world (half a cappella, half with instrumental accompaniment). 16 songs also had pre-existing transcriptions created by 3 different experts. We compared these human transcriptions against one another and against 10 automatic music transcription algorithms. We found that human transcriptions can be sufficiently reliable (~90% agreement, κ ~.7), but current automated methods are not (<60% agreement, κ <.4). No automated method clearly outperformed others, in contrast to our predictions. These results suggest that improving automated methods for cross-cultural music transcription is critical for diversifying MIR.

Authors

Ozaki, Y; McBride, J; Benetos, E; Pfordresher, PQ; Six, J; T. Tierney, A; Proutskova, P; Sakai, E; Kondo, H; Fukatsu, H

URI

https://qmro.qmul.ac.uk/xmlui/handle/123456789/73595

Collections

Electronic Engineering and Computer Science [3387]