Swin‐fisheye: Object detection for fisheye images

Abstract Fisheye cameras have been widely used in autonomous navigation, visual surveillance, and automatic driving. Due to severe geometric distortion, fisheye images cannot be processed effectively by conventional methods. The existing object detection algorithms cannot better detect the small tar...

Full description

Saved in:
Bibliographic Details
Main Authors: Dawei Zhang, Tingting Yang, Bokai Zhao
Format: Article
Language:English
Published: Wiley 2024-11-01
Series:IET Image Processing
Subjects:
Online Access:https://doi.org/10.1049/ipr2.13216
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Fisheye cameras have been widely used in autonomous navigation, visual surveillance, and automatic driving. Due to severe geometric distortion, fisheye images cannot be processed effectively by conventional methods. The existing object detection algorithms cannot better detect the small targets or the objects with large distortion in the fisheye images. The size and scene of available fisheye datasets (such as WoodScape and VOC‐360) cannot satisfy the training of robust network models. Herein, the authors propose Swin‐Fisheye, an end‐to‐end object detection algorithm based on Swin Transformer. A feature pyramid module based on deformable convolution (DFPM) is designed to obtain richer contextual information from the multi‐scale feature maps. In addition, a projection transformation algorithm (PTA) is proposed, which can convert rectilinear images into fisheye images more accurately, and then create a fisheye image dataset (COCO‐Fish). The results of extensive experiments conducted on VOC‐360, WoodScape, and COCO‐Fish demonstrate that the proposed algorithm can achieve satisfactory results compared with state‐of‐the‐art methods.
ISSN:1751-9659
1751-9667