A multi-species benchmark for training and validating mass spectrometry proteomics machine learning models

Abstract Training machine learning models for tasks such as de novo sequencing or spectral clustering requires large collections of confidently identified spectra. Here we describe a dataset of 2.8 million high-confidence peptide-spectrum matches derived from nine different species. The dataset is b...

Full description

Saved in:
Bibliographic Details
Main Authors: Bo Wen, William Stafford Noble
Format: Article
Language:English
Published: Nature Portfolio 2024-11-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-024-04068-4
Tags: Add Tag
No Tags, Be the first to tag this record!