Agreement among human and automated estimates of similarity in a global music sample

Daikoku, H; Ding, S; Benetos, E; Wood, ALC; Shimizono, T; Sanne, US; Fujii, S; Savage, PE; 10th International Workshop on Folk Music Analysis (FMA 2022)

View/Open

Accepted version (893.2Kb)

Pagination

? - ? (7)

Metadata

Show full item record

Abstract

While music information retrieval (MIR) has made substantial progress in automatic analysis of audio similarity for Western music, it remains unclear whether these algorithms can be meaningfully applied to cross-cultural analyses of more diverse musics. Here we collect perceptual ratings from 62 Japanese participants using a global sample of 30 traditional songs, and compare these ratings against both pre-existing expert annotations and audio similarity algorithms. We find that different methods of perceptual ratings all produced similar, moderate levels of inter-rater agreement comparable to previous studies, but that agreement between human and automated methods is always low regardless of the specific methods used to calculate musical similarity. Our findings suggest that the MIR methods tested are unable to measure cross-cultural music similarity in perceptually meaningful ways.

Authors

Daikoku, H; Ding, S; Benetos, E; Wood, ALC; Shimizono, T; Sanne, US; Fujii, S; Savage, PE; 10th International Workshop on Folk Music Analysis (FMA 2022)

URI

https://qmro.qmul.ac.uk/xmlui/handle/123456789/78967

Collections

Electronic Engineering and Computer Science [3475]

Licence information

This item is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.