Mixup Augmentation for Generalizable Speech Separation

Alex, A; Wang, L; Gastaldo, P; Cavallaro, A; IEEE 23rd International Workshop on Multimedia Signal Processing

dc.contributor.author	Alex, A
dc.contributor.author	Wang, L
dc.contributor.author	Gastaldo, P
dc.contributor.author	Cavallaro, A
dc.contributor.author	IEEE 23rd International Workshop on Multimedia Signal Processing
dc.date.accessioned	2021-10-08T09:59:45Z
dc.date.available	2021-07-24
dc.date.available	2021-10-08T09:59:45Z
dc.date.issued	2021
dc.identifier.uri	https://qmro.qmul.ac.uk/xmlui/handle/123456789/74424
dc.description.abstract	Deep learning has advanced the state of the art of single-channel speech separation. However, separation models may overfit the training data and generalization across datasets is still an open problem in real-world conditions with noise. In this paper we address the generalization problem with Mixup as data augmentation approach. Mixup creates new training examples from linear combinations of samples during mini-batch training. We propose four variations of Mixup and assess the improved generalization of a speech separation model, DPRNN, with cross-corpus evaluation on LibriMix, TIMIT and VCTK datasets. DPRNN allows efficient modelling of longer input sequences by splitting the learnt representation from input mixture segment into small chunks and performing intra and inter chunk operations iteratively. We show that training DPRNN with the proposed Data-only Mixup augmentation variation improves performance on an unseen dataset in noisy conditions when compared to the baseline SpecAugment augmented models, while having comparable performance on the source dataset.	en_US
dc.publisher	IEEE	en_US
dc.title	Mixup Augmentation for Generalizable Speech Separation	en_US
dc.type	Conference Proceeding	en_US
dc.rights.holder	© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
pubs.notes	Not known	en_US
pubs.publication-status	Accepted	en_US
dcterms.dateAccepted	2021-07-24
rioxxterms.funder	Default funder	en_US
rioxxterms.identifier.project	Default project	en_US

Files in this item

Name:: Wang Mixup Augmentation for 2021 ...
Size:: 3.469Mb
Format:: application/
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

Electronic Engineering and Computer Science [3387]

Show simple item record