SpecMix : A Mixed Sample Data Augmentation Method for Training with Time-Frequency Domain Features
Sample (material)
Training set
DOI:
10.21437/interspeech.2021-103
Publication Date:
2021-08-27T05:59:39Z
AUTHORS (3)
ABSTRACT
A mixed sample data augmentation strategy is proposed to enhance the performance of models on audio scene classification, sound event and speech enhancement tasks.While there have been several methods shown be effective in improving image classification performance, their efficacy toward time-frequency domain features not assured.We propose a novel approach named "Specmix" specifically designed for dealing with features.The method consists mixing two different samples by applying masks preserving spectral correlation each sample.Our experiments acoustic tasks show that Specmix improves various neural network architectures maximum 2.7%.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (0)
CITATIONS (19)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....