The use of BirdNET embeddings as a fast solution to find novel sound classes in audio recordings

Passive acoustic monitoring has emerged as a useful technique for monitoring vocal species and contributing to biodiversity monitoring goals. However, finding target sounds for species without pre-existing recognisers still proves challenging. Here, we demonstrate how the embeddings from the large a...

Full description

Saved in:
Bibliographic Details
Main Authors: Slade Allen-Ankins, Sebastian Hoefer, Jacopo Bartholomew, Sheryn Brodie, Lin Schwarzkopf
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-01-01
Series:Frontiers in Ecology and Evolution
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fevo.2024.1409407/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841527046741688320
author Slade Allen-Ankins
Sebastian Hoefer
Jacopo Bartholomew
Sheryn Brodie
Lin Schwarzkopf
author_facet Slade Allen-Ankins
Sebastian Hoefer
Jacopo Bartholomew
Sheryn Brodie
Lin Schwarzkopf
author_sort Slade Allen-Ankins
collection DOAJ
description Passive acoustic monitoring has emerged as a useful technique for monitoring vocal species and contributing to biodiversity monitoring goals. However, finding target sounds for species without pre-existing recognisers still proves challenging. Here, we demonstrate how the embeddings from the large acoustic model BirdNET can be used to quickly and easily find new sound classes outside the original model’s training set. We outline the general workflow, and present three case studies covering a range of ecological use cases that we believe are common requirements in research and management: monitoring invasive species, generating species lists, and detecting threatened species. In all cases, a minimal amount of target class examples and validation effort was required to obtain results applicable to the desired application. The demonstrated success of this method across different datasets and different taxonomic groups suggests a wide applicability of BirdNET embeddings for finding novel sound classes. We anticipate this method will allow easy and rapid detection of sound classes for which no current recognisers exist, contributing to both monitoring and conservation goals.
format Article
id doaj-art-a833dfb1c72b4f4cbd06117d7d47edab
institution Kabale University
issn 2296-701X
language English
publishDate 2025-01-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Ecology and Evolution
spelling doaj-art-a833dfb1c72b4f4cbd06117d7d47edab2025-01-16T06:10:52ZengFrontiers Media S.A.Frontiers in Ecology and Evolution2296-701X2025-01-011210.3389/fevo.2024.14094071409407The use of BirdNET embeddings as a fast solution to find novel sound classes in audio recordingsSlade Allen-AnkinsSebastian HoeferJacopo BartholomewSheryn BrodieLin SchwarzkopfPassive acoustic monitoring has emerged as a useful technique for monitoring vocal species and contributing to biodiversity monitoring goals. However, finding target sounds for species without pre-existing recognisers still proves challenging. Here, we demonstrate how the embeddings from the large acoustic model BirdNET can be used to quickly and easily find new sound classes outside the original model’s training set. We outline the general workflow, and present three case studies covering a range of ecological use cases that we believe are common requirements in research and management: monitoring invasive species, generating species lists, and detecting threatened species. In all cases, a minimal amount of target class examples and validation effort was required to obtain results applicable to the desired application. The demonstrated success of this method across different datasets and different taxonomic groups suggests a wide applicability of BirdNET embeddings for finding novel sound classes. We anticipate this method will allow easy and rapid detection of sound classes for which no current recognisers exist, contributing to both monitoring and conservation goals.https://www.frontiersin.org/articles/10.3389/fevo.2024.1409407/fullacoustic recognitionbioacousticsbiodiversityconservationdeep learningpassive acoustic monitoring
spellingShingle Slade Allen-Ankins
Sebastian Hoefer
Jacopo Bartholomew
Sheryn Brodie
Lin Schwarzkopf
The use of BirdNET embeddings as a fast solution to find novel sound classes in audio recordings
Frontiers in Ecology and Evolution
acoustic recognition
bioacoustics
biodiversity
conservation
deep learning
passive acoustic monitoring
title The use of BirdNET embeddings as a fast solution to find novel sound classes in audio recordings
title_full The use of BirdNET embeddings as a fast solution to find novel sound classes in audio recordings
title_fullStr The use of BirdNET embeddings as a fast solution to find novel sound classes in audio recordings
title_full_unstemmed The use of BirdNET embeddings as a fast solution to find novel sound classes in audio recordings
title_short The use of BirdNET embeddings as a fast solution to find novel sound classes in audio recordings
title_sort use of birdnet embeddings as a fast solution to find novel sound classes in audio recordings
topic acoustic recognition
bioacoustics
biodiversity
conservation
deep learning
passive acoustic monitoring
url https://www.frontiersin.org/articles/10.3389/fevo.2024.1409407/full
work_keys_str_mv AT sladeallenankins theuseofbirdnetembeddingsasafastsolutiontofindnovelsoundclassesinaudiorecordings
AT sebastianhoefer theuseofbirdnetembeddingsasafastsolutiontofindnovelsoundclassesinaudiorecordings
AT jacopobartholomew theuseofbirdnetembeddingsasafastsolutiontofindnovelsoundclassesinaudiorecordings
AT sherynbrodie theuseofbirdnetembeddingsasafastsolutiontofindnovelsoundclassesinaudiorecordings
AT linschwarzkopf theuseofbirdnetembeddingsasafastsolutiontofindnovelsoundclassesinaudiorecordings
AT sladeallenankins useofbirdnetembeddingsasafastsolutiontofindnovelsoundclassesinaudiorecordings
AT sebastianhoefer useofbirdnetembeddingsasafastsolutiontofindnovelsoundclassesinaudiorecordings
AT jacopobartholomew useofbirdnetembeddingsasafastsolutiontofindnovelsoundclassesinaudiorecordings
AT sherynbrodie useofbirdnetembeddingsasafastsolutiontofindnovelsoundclassesinaudiorecordings
AT linschwarzkopf useofbirdnetembeddingsasafastsolutiontofindnovelsoundclassesinaudiorecordings