A Comparative Study of Convolutional Neural Network and Transformer Architectures for Drone Detection in Thermal Images

The widespread growth of drone technology is generating new security paradigms, especially with regard to the unauthorized activities of UAVs in restricted or sensitive areas, as well as illegal and illicit activities or attacks. Among the various UAV detection technologies, vision systems in differ...

Full description

Saved in:
Bibliographic Details
Main Authors: Gian Gutierrez, Juan P. Llerena, Luis Usero, Miguel A. Patricio
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/1/109
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841549442256207872
author Gian Gutierrez
Juan P. Llerena
Luis Usero
Miguel A. Patricio
author_facet Gian Gutierrez
Juan P. Llerena
Luis Usero
Miguel A. Patricio
author_sort Gian Gutierrez
collection DOAJ
description The widespread growth of drone technology is generating new security paradigms, especially with regard to the unauthorized activities of UAVs in restricted or sensitive areas, as well as illegal and illicit activities or attacks. Among the various UAV detection technologies, vision systems in different spectra are postulated as outstanding technologies due to their peculiarities compared to other technologies. However, drone detection in thermal imaging is a challenging task due to specific factors such as thermal noise, temperature variability, or cluttered environments. This study addresses these challenges through a comparative evaluation of contemporary neural network architectures—specifically, convolutional neural networks (CNNs) and transformer-based models—for UAV detection in infrared imagery. The research focuses on real-world conditions and examines the performance of YOLOv9, GELAN, DETR, and ViTDet in different scenarios of the Anti-UAV Challenge 2023 dataset. The results show that YOLOv9 stands out for its real-time detection speed, while GELAN provides the highest accuracy in varying conditions and DETR performs reliably in thermally complex environments. The study contributes to the advancement of state-of-the-art UAV detection techniques and highlights the need for the further development of specialized models for specific detection scenarios.
format Article
id doaj-art-537d13b051394755b0d06ff2150d8842
institution Kabale University
issn 2076-3417
language English
publishDate 2024-12-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj-art-537d13b051394755b0d06ff2150d88422025-01-10T13:14:28ZengMDPI AGApplied Sciences2076-34172024-12-0115110910.3390/app15010109A Comparative Study of Convolutional Neural Network and Transformer Architectures for Drone Detection in Thermal ImagesGian Gutierrez0Juan P. Llerena1Luis Usero2Miguel A. Patricio3Applied Artificial Intelligence Group, Computer Science and Engineering Department, Universidad Carlos III de Madrid (ROR Code 03ths8210), 28270 Colmenarejo, Madrid, SpainApplied Artificial Intelligence Group, Computer Science and Engineering Department, Universidad Carlos III de Madrid (ROR Code 03ths8210), 28270 Colmenarejo, Madrid, SpainCognitive Science Research Group, Universidad de Alcalá, 28805 Alcalá de Henares, Madrid, SpainApplied Artificial Intelligence Group, Computer Science and Engineering Department, Universidad Carlos III de Madrid (ROR Code 03ths8210), 28270 Colmenarejo, Madrid, SpainThe widespread growth of drone technology is generating new security paradigms, especially with regard to the unauthorized activities of UAVs in restricted or sensitive areas, as well as illegal and illicit activities or attacks. Among the various UAV detection technologies, vision systems in different spectra are postulated as outstanding technologies due to their peculiarities compared to other technologies. However, drone detection in thermal imaging is a challenging task due to specific factors such as thermal noise, temperature variability, or cluttered environments. This study addresses these challenges through a comparative evaluation of contemporary neural network architectures—specifically, convolutional neural networks (CNNs) and transformer-based models—for UAV detection in infrared imagery. The research focuses on real-world conditions and examines the performance of YOLOv9, GELAN, DETR, and ViTDet in different scenarios of the Anti-UAV Challenge 2023 dataset. The results show that YOLOv9 stands out for its real-time detection speed, while GELAN provides the highest accuracy in varying conditions and DETR performs reliably in thermally complex environments. The study contributes to the advancement of state-of-the-art UAV detection techniques and highlights the need for the further development of specialized models for specific detection scenarios.https://www.mdpi.com/2076-3417/15/1/109unmanned aerial vehicles (UAVs)convolutional neural networks (CNNs)transformers (TNNs)thermal images
spellingShingle Gian Gutierrez
Juan P. Llerena
Luis Usero
Miguel A. Patricio
A Comparative Study of Convolutional Neural Network and Transformer Architectures for Drone Detection in Thermal Images
Applied Sciences
unmanned aerial vehicles (UAVs)
convolutional neural networks (CNNs)
transformers (TNNs)
thermal images
title A Comparative Study of Convolutional Neural Network and Transformer Architectures for Drone Detection in Thermal Images
title_full A Comparative Study of Convolutional Neural Network and Transformer Architectures for Drone Detection in Thermal Images
title_fullStr A Comparative Study of Convolutional Neural Network and Transformer Architectures for Drone Detection in Thermal Images
title_full_unstemmed A Comparative Study of Convolutional Neural Network and Transformer Architectures for Drone Detection in Thermal Images
title_short A Comparative Study of Convolutional Neural Network and Transformer Architectures for Drone Detection in Thermal Images
title_sort comparative study of convolutional neural network and transformer architectures for drone detection in thermal images
topic unmanned aerial vehicles (UAVs)
convolutional neural networks (CNNs)
transformers (TNNs)
thermal images
url https://www.mdpi.com/2076-3417/15/1/109
work_keys_str_mv AT giangutierrez acomparativestudyofconvolutionalneuralnetworkandtransformerarchitecturesfordronedetectioninthermalimages
AT juanpllerena acomparativestudyofconvolutionalneuralnetworkandtransformerarchitecturesfordronedetectioninthermalimages
AT luisusero acomparativestudyofconvolutionalneuralnetworkandtransformerarchitecturesfordronedetectioninthermalimages
AT miguelapatricio acomparativestudyofconvolutionalneuralnetworkandtransformerarchitecturesfordronedetectioninthermalimages
AT giangutierrez comparativestudyofconvolutionalneuralnetworkandtransformerarchitecturesfordronedetectioninthermalimages
AT juanpllerena comparativestudyofconvolutionalneuralnetworkandtransformerarchitecturesfordronedetectioninthermalimages
AT luisusero comparativestudyofconvolutionalneuralnetworkandtransformerarchitecturesfordronedetectioninthermalimages
AT miguelapatricio comparativestudyofconvolutionalneuralnetworkandtransformerarchitecturesfordronedetectioninthermalimages