Dual-Channel Deepfake Audio Detection: Leveraging Direct and Reverberant Waveforms

Deepfake content-including audio, video, images, and text-synthesized or modified using artificial intelligence is designed to convincingly mimic real content. As deepfake generation technology advances, detecting deepfake content presents significant challenges. While recent progress has been made...

Full description

Saved in:

Bibliographic Details
Main Authors:	Gunwoo Lee, Jungmin Lee, Minkyo Jung, Joseph Lee, Kihun Hong, Souhwan Jung, Yoseob Han
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Deepfake audio detection dual-channel data direct waveform reverberant waveform
Online Access:	https://ieeexplore.ieee.org/document/10849546/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832576759515578368
author	Gunwoo Lee Jungmin Lee Minkyo Jung Joseph Lee Kihun Hong Souhwan Jung Yoseob Han
author_facet	Gunwoo Lee Jungmin Lee Minkyo Jung Joseph Lee Kihun Hong Souhwan Jung Yoseob Han
author_sort	Gunwoo Lee
collection	DOAJ
description	Deepfake content-including audio, video, images, and text-synthesized or modified using artificial intelligence is designed to convincingly mimic real content. As deepfake generation technology advances, detecting deepfake content presents significant challenges. While recent progress has been made in detection techniques, identifying deepfake audio remains particularly challenging. Previous approaches have attempted to capture deepfake features by combining video and audio content; however, these methods are ineffective when video and audio are mismatched due to occlusion. To address this, we propose a novel dual-channel deepfake audio detection model that leverages the direct and reverberant components extracted from raw audio signals, focusing exclusively on audio-based detection without reliance on video content. Across various datasets, including ASVspoof2019, FakeAVCeleb, and sport press conference datasets collected by our group, the proposed dual-channel model demonstrates significant improvements in quantitative metrics such as equal error rate and area under the curve. The implementation is available at <uri>https://github.com/gunwoo5034/Dual-Channel-Audio-Deepfake-Detection</uri>.
format	Article
id	doaj-art-657c7498ddb84b75ae49abf1828b682a
institution	Kabale University
issn	2169-3536
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-657c7498ddb84b75ae49abf1828b682a2025-01-31T00:01:39ZengIEEEIEEE Access2169-35362025-01-0113180401805210.1109/ACCESS.2025.353277510849546Dual-Channel Deepfake Audio Detection: Leveraging Direct and Reverberant WaveformsGunwoo Lee0https://orcid.org/0009-0008-4185-5626Jungmin Lee1Minkyo Jung2Joseph Lee3https://orcid.org/0009-0007-3817-0514Kihun Hong4https://orcid.org/0000-0002-5538-3630Souhwan Jung5https://orcid.org/0000-0003-2676-3412Yoseob Han6https://orcid.org/0000-0002-0382-7826School of Electronic Engineering, Soongsil University, Seoul, Republic of KoreaSchool of Electronic Engineering, Soongsil University, Seoul, Republic of KoreaSchool of Electronic Engineering, Soongsil University, Seoul, Republic of KoreaDepartment of Statistics and Actuarial Science, Soongsil University, Seoul, Republic of KoreaSchool of Electronic Engineering, Soongsil University, Seoul, Republic of KoreaSchool of Electronic Engineering, Soongsil University, Seoul, Republic of KoreaSchool of Electronic Engineering, Soongsil University, Seoul, Republic of KoreaDeepfake content-including audio, video, images, and text-synthesized or modified using artificial intelligence is designed to convincingly mimic real content. As deepfake generation technology advances, detecting deepfake content presents significant challenges. While recent progress has been made in detection techniques, identifying deepfake audio remains particularly challenging. Previous approaches have attempted to capture deepfake features by combining video and audio content; however, these methods are ineffective when video and audio are mismatched due to occlusion. To address this, we propose a novel dual-channel deepfake audio detection model that leverages the direct and reverberant components extracted from raw audio signals, focusing exclusively on audio-based detection without reliance on video content. Across various datasets, including ASVspoof2019, FakeAVCeleb, and sport press conference datasets collected by our group, the proposed dual-channel model demonstrates significant improvements in quantitative metrics such as equal error rate and area under the curve. The implementation is available at <uri>https://github.com/gunwoo5034/Dual-Channel-Audio-Deepfake-Detection</uri>.https://ieeexplore.ieee.org/document/10849546/Deepfake audio detectiondual-channel datadirect waveformreverberant waveform
spellingShingle	Gunwoo Lee Jungmin Lee Minkyo Jung Joseph Lee Kihun Hong Souhwan Jung Yoseob Han Dual-Channel Deepfake Audio Detection: Leveraging Direct and Reverberant Waveforms IEEE Access Deepfake audio detection dual-channel data direct waveform reverberant waveform
title	Dual-Channel Deepfake Audio Detection: Leveraging Direct and Reverberant Waveforms
title_full	Dual-Channel Deepfake Audio Detection: Leveraging Direct and Reverberant Waveforms
title_fullStr	Dual-Channel Deepfake Audio Detection: Leveraging Direct and Reverberant Waveforms
title_full_unstemmed	Dual-Channel Deepfake Audio Detection: Leveraging Direct and Reverberant Waveforms
title_short	Dual-Channel Deepfake Audio Detection: Leveraging Direct and Reverberant Waveforms
title_sort	dual channel deepfake audio detection leveraging direct and reverberant waveforms
topic	Deepfake audio detection dual-channel data direct waveform reverberant waveform
url	https://ieeexplore.ieee.org/document/10849546/
work_keys_str_mv	AT gunwoolee dualchanneldeepfakeaudiodetectionleveragingdirectandreverberantwaveforms AT jungminlee dualchanneldeepfakeaudiodetectionleveragingdirectandreverberantwaveforms AT minkyojung dualchanneldeepfakeaudiodetectionleveragingdirectandreverberantwaveforms AT josephlee dualchanneldeepfakeaudiodetectionleveragingdirectandreverberantwaveforms AT kihunhong dualchanneldeepfakeaudiodetectionleveragingdirectandreverberantwaveforms AT souhwanjung dualchanneldeepfakeaudiodetectionleveragingdirectandreverberantwaveforms AT yoseobhan dualchanneldeepfakeaudiodetectionleveragingdirectandreverberantwaveforms

Dual-Channel Deepfake Audio Detection: Leveraging Direct and Reverberant Waveforms

Similar Items