Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data
The standardisation and correction of taxonomic names in large biodiversity databases remain persistent challenges for researchers, as errors in species names can compromise ecological analyses, land-use planning and conservation efforts, particularly when inaccurate data are shared on global biodiv...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Pensoft Publishers
2025-01-01
|
Series: | Biodiversity Data Journal |
Subjects: | |
Online Access: | https://bdj.pensoft.net/article/138257/download/pdf/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841550116987600896 |
---|---|
author | Marco Proença Neto Marcos De Sousa |
author_facet | Marco Proença Neto Marcos De Sousa |
author_sort | Marco Proença Neto |
collection | DOAJ |
description | The standardisation and correction of taxonomic names in large biodiversity databases remain persistent challenges for researchers, as errors in species names can compromise ecological analyses, land-use planning and conservation efforts, particularly when inaccurate data are shared on global biodiversity portals.We present pytaxon, a Python software designed to resolve and correct taxonomic names in biodiversity data by leveraging the Global Names Verifier (GNV) API and employing fuzzy matching techniques to suggest corrections for discrepancies and nomenclatural inconsistencies. The pytaxon offers both a Command Line Interface (CLI) and a Graphical User Interface (GUI), ensuring accessibility to users with different levels of computing expertise. Tests on spreadsheets derived from datasets published in the Global Biodiversity Information Facility (GBIF) demonstrated its effectiveness in identifying and resolving taxonomic errors. By mitigating the propagation of inaccuracies from researchers' datasets to global biodiversity databases, pytaxon supports more reliable conservation decisions and robust scientific investigations. Its contributions enhance data integrity and promote informed biodiversity management in a rapidly evolving global environment. |
format | Article |
id | doaj-art-84a3810b20c14526b5b1d5e4a3dcb540 |
institution | Kabale University |
issn | 1314-2828 |
language | English |
publishDate | 2025-01-01 |
publisher | Pensoft Publishers |
record_format | Article |
series | Biodiversity Data Journal |
spelling | doaj-art-84a3810b20c14526b5b1d5e4a3dcb5402025-01-10T08:30:30ZengPensoft PublishersBiodiversity Data Journal1314-28282025-01-011311410.3897/BDJ.13.e138257138257Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity dataMarco Proença Neto0Marcos De Sousa1Centro Universitário do Estado do ParáCentro Universitário do Estado do ParáThe standardisation and correction of taxonomic names in large biodiversity databases remain persistent challenges for researchers, as errors in species names can compromise ecological analyses, land-use planning and conservation efforts, particularly when inaccurate data are shared on global biodiversity portals.We present pytaxon, a Python software designed to resolve and correct taxonomic names in biodiversity data by leveraging the Global Names Verifier (GNV) API and employing fuzzy matching techniques to suggest corrections for discrepancies and nomenclatural inconsistencies. The pytaxon offers both a Command Line Interface (CLI) and a Graphical User Interface (GUI), ensuring accessibility to users with different levels of computing expertise. Tests on spreadsheets derived from datasets published in the Global Biodiversity Information Facility (GBIF) demonstrated its effectiveness in identifying and resolving taxonomic errors. By mitigating the propagation of inaccuracies from researchers' datasets to global biodiversity databases, pytaxon supports more reliable conservation decisions and robust scientific investigations. Its contributions enhance data integrity and promote informed biodiversity management in a rapidly evolving global environment.https://bdj.pensoft.net/article/138257/download/pdf/biodiversity informaticstaxonomyscientific nam |
spellingShingle | Marco Proença Neto Marcos De Sousa Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data Biodiversity Data Journal biodiversity informatics taxonomy scientific nam |
title | Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data |
title_full | Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data |
title_fullStr | Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data |
title_full_unstemmed | Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data |
title_short | Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data |
title_sort | pytaxon a python software for resolving and correcting taxonomic names in biodiversity data |
topic | biodiversity informatics taxonomy scientific nam |
url | https://bdj.pensoft.net/article/138257/download/pdf/ |
work_keys_str_mv | AT marcoproencaneto pytaxonapythonsoftwareforresolvingandcorrectingtaxonomicnamesinbiodiversitydata AT marcosdesousa pytaxonapythonsoftwareforresolvingandcorrectingtaxonomicnamesinbiodiversitydata |