Published on 28 September 2024
Codecfake dataset - training set (part 3 of 3)
View DatasetDescription
This dataset is the training set (part 3 of 3) of the Codecfake dataset , corresponding to the manuscript "The Codecfake Dataset and Countermeasures for Universal Deepfake Audio Detection".AbstractWith the proliferation of Audio Language Model (ALM) based deepfake audio, there is an urgent need for effective detection methods. Unlike traditional deepfake audio generation, which often involves multi-step processes culminating in vocoder usage, ALM directly utilizes neural codec methods to decode discrete codes into audio. Moreover, driven by large-scale data, ALMs exhibit remarkable robustness and versatility, posing a significant challenge to current audio deepfake detection (ADD)models. To effectively detect ALM-based deepfake audio, we focus on the mechanism of the ALM-based audio generation method, the conversion from neural codec to waveform. We initially construct the Codecfake dataset, an open-source large-scale dataset, including two languages, millions of audio samples, and various test conditions, tailored for ALM-based audio detection. Additionally, to achieve universal detection of deepfake audio and tackle domain ascent bias issue of original SAM, we proposethe CSAM strategy to learn a domain balanced and generalized minima. Experiment results demonstrate that co-training on Codecfake dataset and vocoded dataset with CSAM strategy yield the lowest average Equal Error Rate (EER) of 0.616% across all test conditions compared to baseline models.Codecfake DatasetDue to platform restrictions on the size of zenodo repositories, we have divided the Codecfake dataset into various subsets as shown in the table below:Codecfake datasetdescriptionlinktraining set (part 1 of 3) & labeltrain_split.zip & train_split.z01 - train_split.z05https://zenodo.org/records/13838106training set (part 2 of 3)train_split.z06 - train_split.z10https://zenodo.org/records/13841652training set (part 3 of 3)train_split.z11 - train_split.z16https://zenodo.org/records/13853860development setdev_split.zip & dev_split.z01 - dev_split.z02https://zenodo.org/records/13841216test set (part 1 of 2)Codec test: C1.zip - C6.cip & ALM test: A1.zip - A3.ziphttps://zenodo.org/records/13838823test set (part 2 of 2)Codec unseen test: C7.ziphttps://zenodo.org/records/11125029CountermeasureThe source code of the countermeasure and pre-trained model are available on GitHub https://github.com/xieyuankun/Codecfake.The Codecfake dataset and pre-trained model are licensed with CC BY-NC-ND 4.0 license.
Citations (0)
No citations found
Mentions (0)
No mentions found
Metrics Over Time
Publication Details
Subfield
Signal Processing
Field
Computer Science
Domain
Physical Sciences
Confidence Score
45%
Source
Scholar Data Model