An Image Manipulation Localization Method Based on Dual-Branch Hybrid Convolution
Abstract
In existing image manipulation localization methods, the receptive field of standard convolution is limited, and during feature transfer, it is easy to lose high-frequency information about traces of manipulation. In addition, during feature fusion, the use of fixed sampling kernels makes it difficult to focus on local changes in features, leading to limited localization accuracy. This paper proposes an image manipulation localization method based on dual-branch hybrid convolution. First, a dual-branch hybrid convolution module is designed to expand the receptive field of the model to enhance the feature extraction ability of contextual semantic information, while also enabling the model to focus more on the high-frequency detail features of manipulation traces while localizing the manipulated area. Second, a multi-scale content-aware feature fusion module is used to dynamically generate adaptive sampling kernels for each position in the feature map, enabling the model to focus more on the details of local features while locating the manipulated area. Experimental results on multiple datasets show that this method not only effectively improves the accuracy of image manipulation localization but also enhances the robustness of the model.
References
Jin X, Yu W, Shi W, 2024, Image Manipulation Localization via Dynamic Cross-Modality Fusion and Progressive Integration. Neurocomputing, 610: 128607.
Wei H, Yan C, Li H, 2024, Image Tampering Localization Based on Integrated Multiscale Attention. Journal of Computer-Aided Design & Computer Graphics, 36(08): 1237–1245.
Zeng Z, Tan P, 2025, Image Tampering Detection and Localization Model Based on Multi-Branch HRNet. Modern Electronic Technique, 48(03): 35–42.
Varlamova AA, Kuznetsov AV, 2017, Image Splicing Localization Based on CFA-Artifacts Analysis. Computer Optics, 41(6): 920–930.
Hussien NY, Mahmoud RO, Zayed HH, 2020, Deep Learning on Digital Image Splicing Detection Using CFA Artifacts. International Journal of Sociotechnology and Knowledge Development (IJSKD), 12(2): 31–44.
Vidyadharan DS, Thampi SM, 2018, Evaluating Color and Texture Features for Forgery Localization from Illuminant Maps. Multimedia Tools and Applications, 77: 21131–21161.
Niyishaka P, Bhagvati C, 2021, Image Splicing Detection Technique Based on Illumination-Reflectance Model and LBP. Multimedia Tools and Applications, 80(2): 2161–2175.
Zhe S, Peng S, 2020, Authentication of Splicing Manipulation by Exposing Inconsistency in Color Shift. Multimedia Tools and Applications, 79(11): 8235–8248.
Lyu S, Pan X, Zhang X, 2014, Exposing Region Splicing Forgeries with Blind Local Noise Estimation. International Journal of Computer Vision, 110: 202–221.
Dong J, Chen L, Tian J, et al., 2016, A Novel Image Splicing Detection Method Based on the Inconsistency of Image Noise, 2016 IEEE 11th Conference on Industrial Electronics and Applications (ICIEA), IEEE, 560–563.
Zhu N, Li Z, 2018, Blind Image Splicing Detection via Noise Level Function. Signal Processing: Image Communication, 68: 181–192.
Wang SL, Liew AWC, Li SH, et al., 2014, Detection of Shifted Double JPEG Compression by an Adaptive DCT Coefficient Model. EURASIP Journal on Advances in Signal Processing, 2014: 1–17.
Thai TH, Cogranne R, Retraint F, et al., 2016, JPEG Quantization Step Estimation and Its Applications to Digital Image Forensics. IEEE Transactions on Information Forensics and Security, 12(1): 123–133.
Iakovidou C, Zampoglou M, Papadopoulos S, et al., 2018, Content-Aware Detection of JPEG Grid Inconsistencies for Intuitive Image Forensics. Journal of Visual Communication and Image Representation, 54: 155–170.
Zhou P, Han X, Morariu VI, et al., 2018, Learning Rich Features for Image Manipulation Detection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1053–1061.
Wu Y, Abd Almageed W, Natarajan P, 2019, Mantra-Net: Manipulation Tracing Network for Detection and Localization of Image Forgeries with Anomalous Features, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 9543–9552.
Bayar B, Stamm MC, 2018, Constrained Convolutional Neural Networks: A New Approach Towards General Purpose Image Manipulation Detection. IEEE Transactions on Information Forensics and Security, 13(11): 2691–2706.
Hu X, Zhang Z, Jiang Z, et al., 2020, SPAN: Spatial Pyramid Attention Network for Image Manipulation Localization, Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXI 16, Springer International Publishing, 312–328.
Chen X, Dong C, Ji J, et al., 2021, Image Manipulation Detection by Multi-View Multi-Scale Supervision, Proceedings of the IEEE/CVF International Conference on Computer Vision, 14185–14193.
Hao J, Zhang Z, Yang S, et al., 2021, Transforensics: Image Forgery Localization with Dense Self-Attention, Proceedings of the IEEE/CVF International Conference on Computer Vision, 15055–15064.
Wang J, Wu Z, Chen J, et al., 2022, Objectformer for Image Manipulation Detection and Localization, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2364–2373.
Ma X, Du B, Jiang Z, et al., 2023, IML-ViT: Benchmarking Image Manipulation Localization by Vision Transformer. arXiv. https://arxiv.org/abs/2307.14863
Zeng K, Cheng R, Tan W, et al., 2024, MGQFormer: Mask-Guided Query-Based Transformer for Image Manipulation Localization, Proceedings of the AAAI Conference on Artificial Intelligence, 38(7): 6944–6952.
Ronneberger O, Fischer P, Brox T, 2015, U-Net: Convolutional Networks for Biomedical Image Segmentation, Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5–9, 2015, Proceedings, Part III 18, Springer International Publishing, 234–241.
Bi X, Wei Y, Xiao B, et al., 2019, RRU-Net: The Ringed Residual U-Net for Image Splicing Forgery Detection, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.
Finder SE, Amoyal R, Treister E, et al., 2024, Wavelet Convolutions for Large Receptive Fields, European Conference on Computer Vision, Springer Nature Switzerland, Cham, 363–380.
Wang J, Chen K, Xu R, et al., 2019, Carafe: Content-Aware Reassembly of Features, Proceedings of the IEEE/CVF International Conference on Computer Vision, 3007–3016.
Dong J, Wang W, Tan T, 2013, Casia Image Tampering Detection Evaluation Database, 2013 IEEE China Summit and International Conference on Signal and Information Processing, IEEE, 422–426.
Hsu YF, Chang SF, 2006, Detecting Image Splicing Using Geometry Invariants and Camera Characteristics Consistency, 2006 IEEE International Conference on Multimedia and Expo, IEEE, 549–552.
Guan H, Kozak M, Robertson E, et al., 2019, MFC Datasets: Large-Scale Benchmark Datasets for Media Forensic Challenge Evaluation, 2019 IEEE Winter Applications of Computer Vision Workshops (WACVW), IEEE, 63–72.
Zhou P, Chen B C, Han X, et al., 2020, Generate, Segment, and Refine: Towards Generic Manipulation Segmentation, Proceedings of the AAAI Conference on Artificial Intelligence, 34(07): 13058–13065.
Zhuang P, Li H, Tan S, et al., 2021, Image Tampering Localization Using a Dense Fully Convolutional Network. IEEE Transactions on Information Forensics and Security, 16: 2986–2999.
Zhuo L, Tan S, Li B, et al., 2022, Self-Adversarial Training Incorporating Forgery Attention for Image Forgery Localization. IEEE Transactions on Information Forensics and Security, 17: 819–834.
 
							