CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems

Interspeech 2024

TL;DR: We show that better detection of deepfake speech from codec-based TTS systems can be achieved by training models on speech re-synthesized with neural audio codecs. We also release the CodecFake dataset for this purpose.

Dataset Download

We provide the CodecFake dataset in two forms:

Huggingface Datasets

from datasets import load_dataset
a = load_dataset("rogertseng/CodecFake")

ZIP files

Train Fake Speech Detectors on CodecFake

See instructions under detection for more.

Dataset Creation Pipeline

TBA, see dataset_creation

Acknowledgement

CodecFake is created based on the VCTK dataset, licensed under CC-BY-4.0.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
dataset_creation @ 5bb2b98		dataset_creation @ 5bb2b98
detection @ 8e5f5f4		detection @ 8e5f5f4
.gitmodules		.gitmodules
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems

Dataset Download

Train Fake Speech Detectors on CodecFake

Dataset Creation Pipeline

Acknowledgement

About

Releases

Packages

roger-tseng/CodecFake

Folders and files

Latest commit

History

Repository files navigation

CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems

Dataset Download

Train Fake Speech Detectors on CodecFake

Dataset Creation Pipeline

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages