SPAN: Spatial Pyramid Attention Network forImage Manipulation Localization

doi:10.48550/arXiv.2009.00726

SPAN: Spatial Pyramid Attention Network forImage Manipulation Localization

We present a novel framework, Spatial Pyramid Attention Network (SPAN) for detection and localization of multiple types of image manipulations. The proposed architecture efficiently and effectively models the relationship between image patches at multiple scales by constructing a pyramid of local self-attention blocks. The design includes a novel position projection to encode the spatial positions of the patches. SPAN is trained on a generic, synthetic dataset but can also be fine tuned for specific datasets; The proposed method shows significant gains in performance on standard datasets over previous state-of-the-art methods.

Publication:

arXiv e-prints

Pub Date:

September 2020

DOI:

10.48550/arXiv.2009.00726

arXiv:

arXiv:2009.00726

Bibcode:

2020arXiv200900726H

Keywords:

Computer Science - Computer Vision and Pattern Recognition;
I.4.9

E-Print:

Accepted at ECCV 2020 (https://link.springer.com/chapter/10.1007%2F978-3-030-58589-1_19) Code Available at https://github.com/ZhiHanZ/IRIS0-SPAN/

NASA/ADS

SPAN: Spatial Pyramid Attention Network forImage Manipulation Localization

Abstract