NFDI4DS | UHH-SEMS - Publication Details

Audio Contrastive based Fine-tuning

FOS: Computer and information sciences Sound (cs.SD) Computer Science - Computation and Language Artificial Intelligence (cs.AI) Computer Science - Artificial Intelligence Audio and Speech Processing (eess.AS) FOS: Electrical engineering, electronic engineering, information engineering Computation and Language (cs.CL) Computer Science - Sound Electrical Engineering and Systems Science - Audio and Speech Processing

DOI: 10.48550/arxiv.2309.11895 Publication Date: 2023-01-01

Abstract Supplemental Material References Cited by

AUTHORS (6)

Wang, Yang

Liang, Qibin

Xiao, Chenghao

Li, Yizhi

Moubayed, Noura Al

Lin, Chenghua

ABSTRACT

Audio classification plays a crucial role in speech and sound processing tasks with a wide range of applications. There still remains a challenge of striking the right balance between fitting the model to the training data (avoiding overfitting) and enabling it to generalise well to a new domain. Leveraging the transferability of contrastive learning, we introduce Audio Contrastive-based Fine-tuning (AudioConFit), an efficient approach characterised by robust generalisability. Empirical experiments on a variety of audio classification tasks demonstrate the effectiveness and robustness of our approach, which achieves state-of-the-art results in various settings.<br/>Under review<br/>

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENAIRE - Products

PlumX Metrics

Audio Contrastive based Fine-tuning

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....