Localization and detection of deepfake videos based on self-blending method

Abstract Deepfake technology, which encompasses various video manipulation techniques implemented through deep learning algorithms-such as face swapping and expression alteration-has advanced to generate fake videos that are increasingly difficult for human observers to detect, posing significant th...

Full description

Saved in:

Bibliographic Details
Main Authors:	Junfeng Xu, Xintao Liu, Weiguo Lin, Wenqing Shang, Yuefeng Wang
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-01-01
Series:	Scientific Reports
Online Access:	https://doi.org/10.1038/s41598-025-88523-1
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832571755753897984
author	Junfeng Xu Xintao Liu Weiguo Lin Wenqing Shang Yuefeng Wang
author_facet	Junfeng Xu Xintao Liu Weiguo Lin Wenqing Shang Yuefeng Wang
author_sort	Junfeng Xu
collection	DOAJ
description	Abstract Deepfake technology, which encompasses various video manipulation techniques implemented through deep learning algorithms-such as face swapping and expression alteration-has advanced to generate fake videos that are increasingly difficult for human observers to detect, posing significant threats to societal security. Existing methods for detecting deepfake videos aim to identify such manipulated content to effectively prevent the spread of misinformation. However, these methods often suffer from limited generalization capabilities, exhibiting poor performance when detecting fake videos outside of their training datasets. Moreover, research on the precise localization of manipulated regions within deepfake videos is limited, primarily due to the absence of datasets with fine-grained annotations that specify which regions have been manipulated.To address these challenges, this paper proposes a novel spatial-based training method that does not require fake samples to detect spatial manipulations in deepfake videos. By employing a technique that combines multi-part local displacement deformation and fusion, we generate more diverse deepfake feature data, enhancing the detection accuracy of specific manipulation methods while producing mixed-region labels to guide manipulation localization. We utilize the Swin-Unet model for manipulation localization detection, incorporating classification loss functions, local difference loss functions, and manipulation localization loss functions to effectively improve the precision of localization and detection.Experimental results demonstrate that the proposed spatial-based training method without fake samples effectively simulates the features present in real datasets. Our method achieves satisfactory detection accuracy on datasets such as FF++, Celeb-DF, and DFDC, while accurately localizing the manipulated regions. These findings indicate the effectiveness of the proposed self-blending method and model in deepfake video detection and manipulation localization.
format	Article
id	doaj-art-cf032ffbd8d54563ad386b4ea9b0584d
institution	Kabale University
issn	2045-2322
language	English
publishDate	2025-01-01
publisher	Nature Portfolio
record_format	Article
series	Scientific Reports
spelling	doaj-art-cf032ffbd8d54563ad386b4ea9b0584d2025-02-02T12:23:33ZengNature PortfolioScientific Reports2045-23222025-01-0115111510.1038/s41598-025-88523-1Localization and detection of deepfake videos based on self-blending methodJunfeng Xu0Xintao Liu1Weiguo Lin2Wenqing Shang3Yuefeng Wang4Communication University of China, School of Computer & Cyber SciencesCommunication University of China, School of Computer & Cyber SciencesCommunication University of China, School of Computer & Cyber SciencesCommunication University of China, School of Computer & Cyber SciencesCommunication University of China, School of Computer & Cyber SciencesAbstract Deepfake technology, which encompasses various video manipulation techniques implemented through deep learning algorithms-such as face swapping and expression alteration-has advanced to generate fake videos that are increasingly difficult for human observers to detect, posing significant threats to societal security. Existing methods for detecting deepfake videos aim to identify such manipulated content to effectively prevent the spread of misinformation. However, these methods often suffer from limited generalization capabilities, exhibiting poor performance when detecting fake videos outside of their training datasets. Moreover, research on the precise localization of manipulated regions within deepfake videos is limited, primarily due to the absence of datasets with fine-grained annotations that specify which regions have been manipulated.To address these challenges, this paper proposes a novel spatial-based training method that does not require fake samples to detect spatial manipulations in deepfake videos. By employing a technique that combines multi-part local displacement deformation and fusion, we generate more diverse deepfake feature data, enhancing the detection accuracy of specific manipulation methods while producing mixed-region labels to guide manipulation localization. We utilize the Swin-Unet model for manipulation localization detection, incorporating classification loss functions, local difference loss functions, and manipulation localization loss functions to effectively improve the precision of localization and detection.Experimental results demonstrate that the proposed spatial-based training method without fake samples effectively simulates the features present in real datasets. Our method achieves satisfactory detection accuracy on datasets such as FF++, Celeb-DF, and DFDC, while accurately localizing the manipulated regions. These findings indicate the effectiveness of the proposed self-blending method and model in deepfake video detection and manipulation localization.https://doi.org/10.1038/s41598-025-88523-1
spellingShingle	Junfeng Xu Xintao Liu Weiguo Lin Wenqing Shang Yuefeng Wang Localization and detection of deepfake videos based on self-blending method Scientific Reports
title	Localization and detection of deepfake videos based on self-blending method
title_full	Localization and detection of deepfake videos based on self-blending method
title_fullStr	Localization and detection of deepfake videos based on self-blending method
title_full_unstemmed	Localization and detection of deepfake videos based on self-blending method
title_short	Localization and detection of deepfake videos based on self-blending method
title_sort	localization and detection of deepfake videos based on self blending method
url	https://doi.org/10.1038/s41598-025-88523-1
work_keys_str_mv	AT junfengxu localizationanddetectionofdeepfakevideosbasedonselfblendingmethod AT xintaoliu localizationanddetectionofdeepfakevideosbasedonselfblendingmethod AT weiguolin localizationanddetectionofdeepfakevideosbasedonselfblendingmethod AT wenqingshang localizationanddetectionofdeepfakevideosbasedonselfblendingmethod AT yuefengwang localizationanddetectionofdeepfakevideosbasedonselfblendingmethod

Localization and detection of deepfake videos based on self-blending method

Similar Items