An extensible cluster-graph taxonomy for open set sound scene analysis

BEAR, H; BENETOS, E; Workshop on Detection and Classification of Acoustic Scenes and Events

View/Open

Accepted version (419.0Kb)

Publisher URL

http://dcase.community/workshop2018/

Metadata

Show full item record

Abstract

We present a new extensible and divisible taxonomy for open set sound scene analysis. This new model allows complex scene analysis with tangible descriptors and perception labels. Its novel structure is a cluster graph such that each cluster (or subset) can stand alone for targeted analyses such as office sound event detection, whilst maintaining integrity over the whole graph (superset) of labels. The key design benefit is its extensibility as new labels are needed during new data capture. Furthermore, datasets which use the same taxonomy are easily augmented, saving future data collection effort. We balance the details needed for complex scene analysis with avoiding 'the taxonomy of everything' with our framework to ensure no duplicity in the superset of labels and demonstrate this with DCASE challenge classifications.

Authors

BEAR, H; BENETOS, E; Workshop on Detection and Classification of Acoustic Scenes and Events

URI

http://qmro.qmul.ac.uk/xmlui/handle/123456789/45944

Collections

Centre for Digital Music (C4DM) [210]

Licence information

This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits any use, distribution and reproduction in any medium, provided the original author(s) and source are credited.