Browsing School of Electronic Engineering and Computer Science by Author "Tovstogan, P"

Now showing items 1-1 of 1

The Song Describer dataset: a corpus of audio captions for music-and-language evaluation

Manco, I; Weck, B; Doh, S; Won, M; Zhang, Y; Bodganov, D; Wu, Y; Chen, K; Tovstogan, P; Benetos, E (NeurIPS Machine Learning for Audio Workshop, 2023-12-16)

We introduce the Song Describer dataset (SDD), a new crowdsourced corpus of high-quality audio-caption pairs, designed for the evaluation of music-and-language models. The dataset consists of 1.1k human-written natural ...