Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data
The standardisation and correction of taxonomic names in large biodiversity databases remain persistent challenges for researchers, as errors in species names can compromise ecological analyses, land-use planning and conservation efforts, particularly when inaccurate data are shared on global biodiv...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Pensoft Publishers
2025-01-01
|
Series: | Biodiversity Data Journal |
Subjects: | |
Online Access: | https://bdj.pensoft.net/article/138257/download/pdf/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The standardisation and correction of taxonomic names in large biodiversity databases remain persistent challenges for researchers, as errors in species names can compromise ecological analyses, land-use planning and conservation efforts, particularly when inaccurate data are shared on global biodiversity portals.We present pytaxon, a Python software designed to resolve and correct taxonomic names in biodiversity data by leveraging the Global Names Verifier (GNV) API and employing fuzzy matching techniques to suggest corrections for discrepancies and nomenclatural inconsistencies. The pytaxon offers both a Command Line Interface (CLI) and a Graphical User Interface (GUI), ensuring accessibility to users with different levels of computing expertise. Tests on spreadsheets derived from datasets published in the Global Biodiversity Information Facility (GBIF) demonstrated its effectiveness in identifying and resolving taxonomic errors. By mitigating the propagation of inaccuracies from researchers' datasets to global biodiversity databases, pytaxon supports more reliable conservation decisions and robust scientific investigations. Its contributions enhance data integrity and promote informed biodiversity management in a rapidly evolving global environment. |
---|---|
ISSN: | 1314-2828 |