Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer
Bird species detection is critical for applications such as the analysis of bird population dynamics and species diversity. However, this task remains challenging due to local structural similarities and class imbalances among bird species. Currently, most deep learning algorithms focus on designing...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2025-01-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/25/1/291 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841548895507709952 |
---|---|
author | Chaoyang Li Zhipeng He Kai Lu Chaoyang Fang |
author_facet | Chaoyang Li Zhipeng He Kai Lu Chaoyang Fang |
author_sort | Chaoyang Li |
collection | DOAJ |
description | Bird species detection is critical for applications such as the analysis of bird population dynamics and species diversity. However, this task remains challenging due to local structural similarities and class imbalances among bird species. Currently, most deep learning algorithms focus on designing local feature extraction modules while ignoring the importance of global information. However, this global information is essential for accurate bird species detection. To address this limitation, we propose BSD-Net, a bird species detection network. BSD-Net efficiently learns local and global information in pixels to accurately detect bird species. BSD-Net consists of two main components: a dual-branch feature mixer (DBFM) and a prediction balancing module (PBM). The dual-branch feature mixer extracts features from dichotomous feature segments using global attention and deep convolution, expanding the network’s receptive field and achieving a strong inductive bias, allowing the network to distinguish between similar local details. The prediction balance module balances the difference in feature space based on the pixel values of each category, thereby resolving category imbalances and improving the network’s detection accuracy. The experimental results using two public benchmarks and a self-constructed Poyang Lake Bird dataset demonstrate that BSD-Net outperforms existing methods, achieving 45.71% and 80.00% mAP50 with the CUB-200-2011 and Poyang Lake Bird datasets, respectively, and 66.03% AP with FBD-SV-2024, allowing for more accurate location and species information for bird detection tasks in video surveillance. |
format | Article |
id | doaj-art-0b67db09432a41dfb1faa155437050c8 |
institution | Kabale University |
issn | 1424-8220 |
language | English |
publishDate | 2025-01-01 |
publisher | MDPI AG |
record_format | Article |
series | Sensors |
spelling | doaj-art-0b67db09432a41dfb1faa155437050c82025-01-10T13:21:29ZengMDPI AGSensors1424-82202025-01-0125129110.3390/s25010291Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature MixerChaoyang Li0Zhipeng He1Kai Lu2Chaoyang Fang3Jiangxi Protected Area Construction Center, Nanchang 330006, ChinaJiangxi Protected Area Construction Center, Nanchang 330006, ChinaJiangxi Protected Area Construction Center, Nanchang 330006, ChinaKey Laboratory of Poyang Lake Wetland and Watershed Research, Ministry of Education, Jiangxi Normal University, Nanchang 330022, ChinaBird species detection is critical for applications such as the analysis of bird population dynamics and species diversity. However, this task remains challenging due to local structural similarities and class imbalances among bird species. Currently, most deep learning algorithms focus on designing local feature extraction modules while ignoring the importance of global information. However, this global information is essential for accurate bird species detection. To address this limitation, we propose BSD-Net, a bird species detection network. BSD-Net efficiently learns local and global information in pixels to accurately detect bird species. BSD-Net consists of two main components: a dual-branch feature mixer (DBFM) and a prediction balancing module (PBM). The dual-branch feature mixer extracts features from dichotomous feature segments using global attention and deep convolution, expanding the network’s receptive field and achieving a strong inductive bias, allowing the network to distinguish between similar local details. The prediction balance module balances the difference in feature space based on the pixel values of each category, thereby resolving category imbalances and improving the network’s detection accuracy. The experimental results using two public benchmarks and a self-constructed Poyang Lake Bird dataset demonstrate that BSD-Net outperforms existing methods, achieving 45.71% and 80.00% mAP50 with the CUB-200-2011 and Poyang Lake Bird datasets, respectively, and 66.03% AP with FBD-SV-2024, allowing for more accurate location and species information for bird detection tasks in video surveillance.https://www.mdpi.com/1424-8220/25/1/291bird detectionfeature extractionclass imbalancedeep learning |
spellingShingle | Chaoyang Li Zhipeng He Kai Lu Chaoyang Fang Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer Sensors bird detection feature extraction class imbalance deep learning |
title | Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer |
title_full | Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer |
title_fullStr | Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer |
title_full_unstemmed | Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer |
title_short | Bird Species Detection Net: Bird Species Detection Based on the Extraction of Local Details and Global Information Using a Dual-Feature Mixer |
title_sort | bird species detection net bird species detection based on the extraction of local details and global information using a dual feature mixer |
topic | bird detection feature extraction class imbalance deep learning |
url | https://www.mdpi.com/1424-8220/25/1/291 |
work_keys_str_mv | AT chaoyangli birdspeciesdetectionnetbirdspeciesdetectionbasedontheextractionoflocaldetailsandglobalinformationusingadualfeaturemixer AT zhipenghe birdspeciesdetectionnetbirdspeciesdetectionbasedontheextractionoflocaldetailsandglobalinformationusingadualfeaturemixer AT kailu birdspeciesdetectionnetbirdspeciesdetectionbasedontheextractionoflocaldetailsandglobalinformationusingadualfeaturemixer AT chaoyangfang birdspeciesdetectionnetbirdspeciesdetectionbasedontheextractionoflocaldetailsandglobalinformationusingadualfeaturemixer |