Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer

Bird species detection is critical for applications such as the analysis of bird population dynamics and species diversity. However, this task remains challenging due to local structural similarities and class imbalances among bird species. Currently, most deep learning algorithms focus on designing...

Full description

Saved in:
Bibliographic Details
Main Authors: Chaoyang Li, Zhipeng He, Kai Lu, Chaoyang Fang
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/25/1/291
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841548895507709952
author Chaoyang Li
Zhipeng He
Kai Lu
Chaoyang Fang
author_facet Chaoyang Li
Zhipeng He
Kai Lu
Chaoyang Fang
author_sort Chaoyang Li
collection DOAJ
description Bird species detection is critical for applications such as the analysis of bird population dynamics and species diversity. However, this task remains challenging due to local structural similarities and class imbalances among bird species. Currently, most deep learning algorithms focus on designing local feature extraction modules while ignoring the importance of global information. However, this global information is essential for accurate bird species detection. To address this limitation, we propose BSD-Net, a bird species detection network. BSD-Net efficiently learns local and global information in pixels to accurately detect bird species. BSD-Net consists of two main components: a dual-branch feature mixer (DBFM) and a prediction balancing module (PBM). The dual-branch feature mixer extracts features from dichotomous feature segments using global attention and deep convolution, expanding the network’s receptive field and achieving a strong inductive bias, allowing the network to distinguish between similar local details. The prediction balance module balances the difference in feature space based on the pixel values of each category, thereby resolving category imbalances and improving the network’s detection accuracy. The experimental results using two public benchmarks and a self-constructed Poyang Lake Bird dataset demonstrate that BSD-Net outperforms existing methods, achieving 45.71% and 80.00% mAP50 with the CUB-200-2011 and Poyang Lake Bird datasets, respectively, and 66.03% AP with FBD-SV-2024, allowing for more accurate location and species information for bird detection tasks in video surveillance.
format Article
id doaj-art-0b67db09432a41dfb1faa155437050c8
institution Kabale University
issn 1424-8220
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj-art-0b67db09432a41dfb1faa155437050c82025-01-10T13:21:29ZengMDPI AGSensors1424-82202025-01-0125129110.3390/s25010291Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature MixerChaoyang Li0Zhipeng He1Kai Lu2Chaoyang Fang3Jiangxi Protected Area Construction Center, Nanchang 330006, ChinaJiangxi Protected Area Construction Center, Nanchang 330006, ChinaJiangxi Protected Area Construction Center, Nanchang 330006, ChinaKey Laboratory of Poyang Lake Wetland and Watershed Research, Ministry of Education, Jiangxi Normal University, Nanchang 330022, ChinaBird species detection is critical for applications such as the analysis of bird population dynamics and species diversity. However, this task remains challenging due to local structural similarities and class imbalances among bird species. Currently, most deep learning algorithms focus on designing local feature extraction modules while ignoring the importance of global information. However, this global information is essential for accurate bird species detection. To address this limitation, we propose BSD-Net, a bird species detection network. BSD-Net efficiently learns local and global information in pixels to accurately detect bird species. BSD-Net consists of two main components: a dual-branch feature mixer (DBFM) and a prediction balancing module (PBM). The dual-branch feature mixer extracts features from dichotomous feature segments using global attention and deep convolution, expanding the network’s receptive field and achieving a strong inductive bias, allowing the network to distinguish between similar local details. The prediction balance module balances the difference in feature space based on the pixel values of each category, thereby resolving category imbalances and improving the network’s detection accuracy. The experimental results using two public benchmarks and a self-constructed Poyang Lake Bird dataset demonstrate that BSD-Net outperforms existing methods, achieving 45.71% and 80.00% mAP50 with the CUB-200-2011 and Poyang Lake Bird datasets, respectively, and 66.03% AP with FBD-SV-2024, allowing for more accurate location and species information for bird detection tasks in video surveillance.https://www.mdpi.com/1424-8220/25/1/291bird detectionfeature extractionclass imbalancedeep learning
spellingShingle Chaoyang Li
Zhipeng He
Kai Lu
Chaoyang Fang
Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer
Sensors
bird detection
feature extraction
class imbalance
deep learning
title Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer
title_full Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer
title_fullStr Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer
title_full_unstemmed Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer
title_short Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer
title_sort bird species detection net bird species detection based on the extraction of local details and global information using a dual feature mixer
topic bird detection
feature extraction
class imbalance
deep learning
url https://www.mdpi.com/1424-8220/25/1/291
work_keys_str_mv AT chaoyangli birdspeciesdetectionnetbirdspeciesdetectionbasedontheextractionoflocaldetailsandglobalinformationusingadualfeaturemixer
AT zhipenghe birdspeciesdetectionnetbirdspeciesdetectionbasedontheextractionoflocaldetailsandglobalinformationusingadualfeaturemixer
AT kailu birdspeciesdetectionnetbirdspeciesdetectionbasedontheextractionoflocaldetailsandglobalinformationusingadualfeaturemixer
AT chaoyangfang birdspeciesdetectionnetbirdspeciesdetectionbasedontheextractionoflocaldetailsandglobalinformationusingadualfeaturemixer