Browsing School of Electronic Engineering and Computer Science by Author "Tovstogan, P"
Now showing items 1-1 of 1
-
The Song Describer dataset: a corpus of audio captions for music-and-language evaluation
Manco, I; Weck, B; Doh, S; Won, M; Zhang, Y; Bodganov, D; Wu, Y; Chen, K; Tovstogan, P; Benetos, E (NeurIPS Machine Learning for Audio Workshop, 2023-12-16)We introduce the Song Describer dataset (SDD), a new crowdsourced corpus of high-quality audio-caption pairs, designed for the evaluation of music-and-language models. The dataset consists of 1.1k human-written natural ...