dc.contributor.author | Alex, A | |
dc.contributor.author | Wang, L | |
dc.contributor.author | Gastaldo, P | |
dc.contributor.author | Cavallaro, A | |
dc.contributor.author | IEEE 23rd International Workshop on Multimedia Signal Processing | |
dc.date.accessioned | 2021-10-08T09:59:45Z | |
dc.date.available | 2021-07-24 | |
dc.date.available | 2021-10-08T09:59:45Z | |
dc.date.issued | 2021 | |
dc.identifier.uri | https://qmro.qmul.ac.uk/xmlui/handle/123456789/74424 | |
dc.description.abstract | Deep learning has advanced the state of the art of single-channel speech separation. However, separation models may overfit the training data and generalization across datasets is still an open problem in real-world conditions with noise. In this paper we address the generalization problem with Mixup as data augmentation approach. Mixup creates new training examples from linear combinations of samples during mini-batch training. We propose four variations of Mixup and assess the improved generalization of a speech separation model, DPRNN, with cross-corpus evaluation on LibriMix, TIMIT and VCTK datasets. DPRNN allows efficient modelling of longer input sequences by splitting the learnt representation from input mixture segment into small chunks and performing intra and inter chunk operations iteratively. We show that training DPRNN with the proposed Data-only Mixup augmentation variation improves performance on an unseen dataset in noisy conditions when compared to the baseline SpecAugment augmented models, while having comparable performance on the source dataset. | en_US |
dc.publisher | IEEE | en_US |
dc.title | Mixup Augmentation for Generalizable Speech Separation | en_US |
dc.type | Conference Proceeding | en_US |
dc.rights.holder | © 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | |
pubs.notes | Not known | en_US |
pubs.publication-status | Accepted | en_US |
dcterms.dateAccepted | 2021-07-24 | |
rioxxterms.funder | Default funder | en_US |
rioxxterms.identifier.project | Default project | en_US |