STAR Drums: A Dataset for Automatic Drum Transcription

Current state‑of‑the‑art automatic drum transcription (ADT) algorithms make use of neural networks. To train such models, large amounts of annotated data are needed. We introduce the Separate–Tracks–Annotate–Resynthesize Drums (STAR Drums) dataset, derived from full audio recordings that include mix...

Full description

Saved in:

Bibliographic Details
Main Authors:	Philipp Weber, Christian Uhle, Meinard Müller, Matthias Lang
Format:	Article
Language:	English
Published:	Ubiquity Press 2025-07-01
Series:	Transactions of the International Society for Music Information Retrieval
Subjects:	automatic drum transcription automatic music transcription dataset audio
Online Access:	https://account.transactions.ismir.net/index.php/up-j-tismir/article/view/244
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849230028309528576
author	Philipp Weber Christian Uhle Meinard Müller Matthias Lang
author_facet	Philipp Weber Christian Uhle Meinard Müller Matthias Lang
author_sort	Philipp Weber
collection	DOAJ
description	Current state‑of‑the‑art automatic drum transcription (ADT) algorithms make use of neural networks. To train such models, large amounts of annotated data are needed. We introduce the Separate–Tracks–Annotate–Resynthesize Drums (STAR Drums) dataset, derived from full audio recordings that include mixtures of drum instruments, melodic instruments, and vocals. First, we separate the music recordings into a drum stem and a non‑drum stem by applying a music source separation algorithm, then automatically annotate the drum stem with an ADT algorithm. The annotations are used for the re‑synthesis of the drum stem using sample‑based virtual drum instruments. Finally, we mix the re‑synthesized drum stem with the original non‑drum stem to obtain the final mix. In summary, STAR Drums includes annotated synthesized drum sounds mixed with real recordings of melodic instruments and vocals, offering several benefits: high temporal accuracy of annotations; training data that include recordings of instruments played by musicians, rather than solely relying on MIDI‑rendered audio; a large number of supported drum classes; the possibility to customize the final mix by, for instance, applying additional processing to the drum stem, as both drum and non‑drum stems are provided; and suitable licenses of audio files for making the dataset fully available to the research community. We demonstrate that, in the context of ADT, training with STAR Drums achieves superior performance compared to training with datasets solely relying on MIDI‑rendered data and that the synthesized nature of the drum stem does not diminish performance.
format	Article
id	doaj-art-902bc4a6d296480699c8993b4b0e460b
institution	Kabale University
issn	2514-3298
language	English
publishDate	2025-07-01
publisher	Ubiquity Press
record_format	Article
series	Transactions of the International Society for Music Information Retrieval
spelling	doaj-art-902bc4a6d296480699c8993b4b0e460b2025-08-21T12:49:42ZengUbiquity PressTransactions of the International Society for Music Information Retrieval2514-32982025-07-0181248–264248–26410.5334/tismir.244244STAR Drums: A Dataset for Automatic Drum TranscriptionPhilipp Weber0Christian Uhle1Meinard Müller2Matthias Lang3Fraunhofer Institute for Integrated Circuits (IIS), ErlangenFraunhofer Institute for Integrated Circuits (IIS), Erlangen; International Audio Laboratories ErlangenFraunhofer Institute for Integrated Circuits (IIS), Erlangen; International Audio Laboratories ErlangenFraunhofer Institute for Integrated Circuits (IIS), ErlangenCurrent state‑of‑the‑art automatic drum transcription (ADT) algorithms make use of neural networks. To train such models, large amounts of annotated data are needed. We introduce the Separate–Tracks–Annotate–Resynthesize Drums (STAR Drums) dataset, derived from full audio recordings that include mixtures of drum instruments, melodic instruments, and vocals. First, we separate the music recordings into a drum stem and a non‑drum stem by applying a music source separation algorithm, then automatically annotate the drum stem with an ADT algorithm. The annotations are used for the re‑synthesis of the drum stem using sample‑based virtual drum instruments. Finally, we mix the re‑synthesized drum stem with the original non‑drum stem to obtain the final mix. In summary, STAR Drums includes annotated synthesized drum sounds mixed with real recordings of melodic instruments and vocals, offering several benefits: high temporal accuracy of annotations; training data that include recordings of instruments played by musicians, rather than solely relying on MIDI‑rendered audio; a large number of supported drum classes; the possibility to customize the final mix by, for instance, applying additional processing to the drum stem, as both drum and non‑drum stems are provided; and suitable licenses of audio files for making the dataset fully available to the research community. We demonstrate that, in the context of ADT, training with STAR Drums achieves superior performance compared to training with datasets solely relying on MIDI‑rendered data and that the synthesized nature of the drum stem does not diminish performance.https://account.transactions.ismir.net/index.php/up-j-tismir/article/view/244automatic drum transcriptionautomatic music transcriptiondatasetaudio
spellingShingle	Philipp Weber Christian Uhle Meinard Müller Matthias Lang STAR Drums: A Dataset for Automatic Drum Transcription Transactions of the International Society for Music Information Retrieval automatic drum transcription automatic music transcription dataset audio
title	STAR Drums: A Dataset for Automatic Drum Transcription
title_full	STAR Drums: A Dataset for Automatic Drum Transcription
title_fullStr	STAR Drums: A Dataset for Automatic Drum Transcription
title_full_unstemmed	STAR Drums: A Dataset for Automatic Drum Transcription
title_short	STAR Drums: A Dataset for Automatic Drum Transcription
title_sort	star drums a dataset for automatic drum transcription
topic	automatic drum transcription automatic music transcription dataset audio
url	https://account.transactions.ismir.net/index.php/up-j-tismir/article/view/244
work_keys_str_mv	AT philippweber stardrumsadatasetforautomaticdrumtranscription AT christianuhle stardrumsadatasetforautomaticdrumtranscription AT meinardmuller stardrumsadatasetforautomaticdrumtranscription AT matthiaslang stardrumsadatasetforautomaticdrumtranscription

STAR Drums: A Dataset for Automatic Drum Transcription

Similar Items