Now showing items 1-1 of 1

    • The Song Describer dataset: a corpus of audio captions for music-and-language evaluation 

      Manco, I; Weck, B; Doh, S; Won, M; Zhang, Y; Bodganov, D; Wu, Y; Chen, K; Tovstogan, P; Benetos, E (NeurIPS Machine Learning for Audio Workshop, 2023-12-16)
      We introduce the Song Describer dataset (SDD), a new crowdsourced corpus of high-quality audio-caption pairs, designed for the evaluation of music-and-language models. The dataset consists of 1.1k human-written natural ...