Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data

The standardisation and correction of taxonomic names in large biodiversity databases remain persistent challenges for researchers, as errors in species names can compromise ecological analyses, land-use planning and conservation efforts, particularly when inaccurate data are shared on global biodiv...

Full description

Saved in:
Bibliographic Details
Main Authors: Marco Proença Neto, Marcos De Sousa
Format: Article
Language:English
Published: Pensoft Publishers 2025-01-01
Series:Biodiversity Data Journal
Subjects:
Online Access:https://bdj.pensoft.net/article/138257/download/pdf/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841550116987600896
author Marco Proença Neto
Marcos De Sousa
author_facet Marco Proença Neto
Marcos De Sousa
author_sort Marco Proença Neto
collection DOAJ
description The standardisation and correction of taxonomic names in large biodiversity databases remain persistent challenges for researchers, as errors in species names can compromise ecological analyses, land-use planning and conservation efforts, particularly when inaccurate data are shared on global biodiversity portals.We present pytaxon, a Python software designed to resolve and correct taxonomic names in biodiversity data by leveraging the Global Names Verifier (GNV) API and employing fuzzy matching techniques to suggest corrections for discrepancies and nomenclatural inconsistencies. The pytaxon offers both a Command Line Interface (CLI) and a Graphical User Interface (GUI), ensuring accessibility to users with different levels of computing expertise. Tests on spreadsheets derived from datasets published in the Global Biodiversity Information Facility (GBIF) demonstrated its effectiveness in identifying and resolving taxonomic errors. By mitigating the propagation of inaccuracies from researchers' datasets to global biodiversity databases, pytaxon supports more reliable conservation decisions and robust scientific investigations. Its contributions enhance data integrity and promote informed biodiversity management in a rapidly evolving global environment.
format Article
id doaj-art-84a3810b20c14526b5b1d5e4a3dcb540
institution Kabale University
issn 1314-2828
language English
publishDate 2025-01-01
publisher Pensoft Publishers
record_format Article
series Biodiversity Data Journal
spelling doaj-art-84a3810b20c14526b5b1d5e4a3dcb5402025-01-10T08:30:30ZengPensoft PublishersBiodiversity Data Journal1314-28282025-01-011311410.3897/BDJ.13.e138257138257Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity dataMarco Proença Neto0Marcos De Sousa1Centro Universitário do Estado do ParáCentro Universitário do Estado do ParáThe standardisation and correction of taxonomic names in large biodiversity databases remain persistent challenges for researchers, as errors in species names can compromise ecological analyses, land-use planning and conservation efforts, particularly when inaccurate data are shared on global biodiversity portals.We present pytaxon, a Python software designed to resolve and correct taxonomic names in biodiversity data by leveraging the Global Names Verifier (GNV) API and employing fuzzy matching techniques to suggest corrections for discrepancies and nomenclatural inconsistencies. The pytaxon offers both a Command Line Interface (CLI) and a Graphical User Interface (GUI), ensuring accessibility to users with different levels of computing expertise. Tests on spreadsheets derived from datasets published in the Global Biodiversity Information Facility (GBIF) demonstrated its effectiveness in identifying and resolving taxonomic errors. By mitigating the propagation of inaccuracies from researchers' datasets to global biodiversity databases, pytaxon supports more reliable conservation decisions and robust scientific investigations. Its contributions enhance data integrity and promote informed biodiversity management in a rapidly evolving global environment.https://bdj.pensoft.net/article/138257/download/pdf/biodiversity informaticstaxonomyscientific nam
spellingShingle Marco Proença Neto
Marcos De Sousa
Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data
Biodiversity Data Journal
biodiversity informatics
taxonomy
scientific nam
title Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data
title_full Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data
title_fullStr Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data
title_full_unstemmed Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data
title_short Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data
title_sort pytaxon a python software for resolving and correcting taxonomic names in biodiversity data
topic biodiversity informatics
taxonomy
scientific nam
url https://bdj.pensoft.net/article/138257/download/pdf/
work_keys_str_mv AT marcoproencaneto pytaxonapythonsoftwareforresolvingandcorrectingtaxonomicnamesinbiodiversitydata
AT marcosdesousa pytaxonapythonsoftwareforresolvingandcorrectingtaxonomicnamesinbiodiversitydata