FBI-Net: Frequency-Based Image Forgery Localization via Multitask Learning With Self-Attention

Image forgery is easily manufactured for illegal acts such as spreading misleading information, which can have unfortunate consequences for society. In this work, we propose a Discrete Cosine Transformation (DCT) based multi-task learning network named FBI-Net, for forgery localization. Our proposed...

Full description

Saved in:

Bibliographic Details
Main Authors:	A-Rom Gu, Ju-Hyeon Nam, Sang-Chul Lee
Format:	Article
Language:	English
Published:	IEEE 2022-01-01
Series:	IEEE Access
Subjects:	Deep learning forgery detection frequency domain self-attention localization
Online Access:	https://ieeexplore.ieee.org/document/9793665/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841536179095207936
author	A-Rom Gu Ju-Hyeon Nam Sang-Chul Lee
author_facet	A-Rom Gu Ju-Hyeon Nam Sang-Chul Lee
author_sort	A-Rom Gu
collection	DOAJ
description	Image forgery is easily manufactured for illegal acts such as spreading misleading information, which can have unfortunate consequences for society. In this work, we propose a Discrete Cosine Transformation (DCT) based multi-task learning network named FBI-Net, for forgery localization. Our proposed network adopts a fully convolutional encoder-decoder architecture, consisting of three encoders sharing parameters, a bridge attention module, and two output streams in the decoder. The encoder takes three inputs: RGB images and high-/low- DCT-filtered images. High-frequency components help learn object characteristics that improve CNN accuracy; low-frequency components are essential frequency information to keep most of the energy found in the typical DCT. Subsequently, Dilated Frequency Self-Attention Module, DFSAM in the bridge layer, is incorporated into the network to recalibrate the fused features and enhance the representation. Finally, in the decoder stage, region and edge information of the label are learned through multi-task learning to provide more extensive supervision for forged region localization; the edge stream will give a deeper understanding of features between forged and authentic images and help learn how to predict exquisite representations in images. Simultaneously, the auxiliary features from the pre-trained segmentation model are fused to separate the segmented background and objects, drawing the segmentation result of the dense region obtained. Extensive experiments show that our proposed FBI-Net outperforms existing forgery localization methods on six benchmark splicing and copy-move image datasets, CASIA TIDE v 1.0, CASIA TIDE v 2.0, Carvalho, Columbia, Coverage, IMD2020, achieving the best performance in an average of IoU of 70.99% and F1-score of 76.98% which is 9.79%, 9.82% higher than the previous method, respectively.
format	Article
id	doaj-art-6842ad7f94ad426ca50bdcc7c7a1fc18
institution	Kabale University
issn	2169-3536
language	English
publishDate	2022-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-6842ad7f94ad426ca50bdcc7c7a1fc182025-01-15T00:01:15ZengIEEEIEEE Access2169-35362022-01-0110627516276210.1109/ACCESS.2022.31820249793665FBI-Net: Frequency-Based Image Forgery Localization via Multitask Learning With Self-AttentionA-Rom Gu0Ju-Hyeon Nam1Sang-Chul Lee2https://orcid.org/0000-0002-6973-2416Department of Computer Science and Engineering, Inha University, Incheon, South KoreaDepartment of Computer Science and Engineering, Inha University, Incheon, South KoreaDepartment of Computer Science and Engineering, Inha University, Incheon, South KoreaImage forgery is easily manufactured for illegal acts such as spreading misleading information, which can have unfortunate consequences for society. In this work, we propose a Discrete Cosine Transformation (DCT) based multi-task learning network named FBI-Net, for forgery localization. Our proposed network adopts a fully convolutional encoder-decoder architecture, consisting of three encoders sharing parameters, a bridge attention module, and two output streams in the decoder. The encoder takes three inputs: RGB images and high-/low- DCT-filtered images. High-frequency components help learn object characteristics that improve CNN accuracy; low-frequency components are essential frequency information to keep most of the energy found in the typical DCT. Subsequently, Dilated Frequency Self-Attention Module, DFSAM in the bridge layer, is incorporated into the network to recalibrate the fused features and enhance the representation. Finally, in the decoder stage, region and edge information of the label are learned through multi-task learning to provide more extensive supervision for forged region localization; the edge stream will give a deeper understanding of features between forged and authentic images and help learn how to predict exquisite representations in images. Simultaneously, the auxiliary features from the pre-trained segmentation model are fused to separate the segmented background and objects, drawing the segmentation result of the dense region obtained. Extensive experiments show that our proposed FBI-Net outperforms existing forgery localization methods on six benchmark splicing and copy-move image datasets, CASIA TIDE v 1.0, CASIA TIDE v 2.0, Carvalho, Columbia, Coverage, IMD2020, achieving the best performance in an average of IoU of 70.99% and F1-score of 76.98% which is 9.79%, 9.82% higher than the previous method, respectively.https://ieeexplore.ieee.org/document/9793665/Deep learningforgery detectionfrequency domainself-attentionlocalization
spellingShingle	A-Rom Gu Ju-Hyeon Nam Sang-Chul Lee FBI-Net: Frequency-Based Image Forgery Localization via Multitask Learning With Self-Attention IEEE Access Deep learning forgery detection frequency domain self-attention localization
title	FBI-Net: Frequency-Based Image Forgery Localization via Multitask Learning With Self-Attention
title_full	FBI-Net: Frequency-Based Image Forgery Localization via Multitask Learning With Self-Attention
title_fullStr	FBI-Net: Frequency-Based Image Forgery Localization via Multitask Learning With Self-Attention
title_full_unstemmed	FBI-Net: Frequency-Based Image Forgery Localization via Multitask Learning With Self-Attention
title_short	FBI-Net: Frequency-Based Image Forgery Localization via Multitask Learning With Self-Attention
title_sort	fbi net frequency based image forgery localization via multitask learning with self attention
topic	Deep learning forgery detection frequency domain self-attention localization
url	https://ieeexplore.ieee.org/document/9793665/
work_keys_str_mv	AT aromgu fbinetfrequencybasedimageforgerylocalizationviamultitasklearningwithselfattention AT juhyeonnam fbinetfrequencybasedimageforgerylocalizationviamultitasklearningwithselfattention AT sangchullee fbinetfrequencybasedimageforgerylocalizationviamultitasklearningwithselfattention

FBI-Net: Frequency-Based Image Forgery Localization via Multitask Learning With Self-Attention

Similar Items