A Sentinel-2 machine learning dataset for tree species classification in Germany

<p>We present a machine learning dataset for tree species classification in Sentinel-2 satellite image time series of bottom-of-atmosphere reflectance. It is geared towards training classifiers but is less suitable for validating the resulting maps. The dataset is based on the German National...

Full description

Saved in:
Bibliographic Details
Main Authors: M. Freudenberg, S. Schnell, P. Magdon
Format: Article
Language:English
Published: Copernicus Publications 2025-02-01
Series:Earth System Science Data
Online Access:https://essd.copernicus.org/articles/17/351/2025/essd-17-351-2025.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841524367922561024
author M. Freudenberg
M. Freudenberg
S. Schnell
P. Magdon
author_facet M. Freudenberg
M. Freudenberg
S. Schnell
P. Magdon
author_sort M. Freudenberg
collection DOAJ
description <p>We present a machine learning dataset for tree species classification in Sentinel-2 satellite image time series of bottom-of-atmosphere reflectance. It is geared towards training classifiers but is less suitable for validating the resulting maps. The dataset is based on the German National Forest Inventory of 2012 as well as analysis-ready satellite imagery computed using the Framework for Operational Radiometric Correction for Environmental monitoring (FORCE) processing pipeline. From the National Forest Inventory data, we extracted the tree positions, filtered 387 775 trees in the upper canopy layer, and automatically extracted the corresponding bottom-of-atmosphere reflectance time series from Sentinel-2 L2A images. These time series are labeled with the corresponding tree species, which allows pixel-wise classification tasks. Furthermore, we provide auxiliary information such as the approximate tree position, the year of possible disturbance events, or the diameter at breast height. Temporally, the dataset spans the years from July 2015 to the end of October 2022, with approx. 75.3 million data points for trees of 48 species and 3 species groups as well as 13.8 million observations for non-tree backgrounds. Spatially, it covers the whole of Germany. The dataset is available at the following DOI <span class="cit" id="xref_paren.1">(<a href="#bib1.bibx9">Freudenberg et al.</a>, <a href="#bib1.bibx9">2024</a>)</span>: <a href="https://doi.org/10.3220/DATA20240402122351-0">https://doi.org/10.3220/DATA20240402122351-0</a>.</p>
format Article
id doaj-art-99140157d3194555a76588369c16a2eb
institution Kabale University
issn 1866-3508
1866-3516
language English
publishDate 2025-02-01
publisher Copernicus Publications
record_format Article
series Earth System Science Data
spelling doaj-art-99140157d3194555a76588369c16a2eb2025-02-03T07:15:38ZengCopernicus PublicationsEarth System Science Data1866-35081866-35162025-02-011735136710.5194/essd-17-351-2025A Sentinel-2 machine learning dataset for tree species classification in GermanyM. Freudenberg0M. Freudenberg1S. Schnell2P. Magdon3Forest Inventory and Remote Sensing, University of Göttingen, Göttingen, GermanyNeural Data Science Group, University of Göttingen, Göttingen, GermanyThünen Institute of Forest Ecosystems, Eberswalde, GermanyFaculty of Resource Management, University of Applied Sciences and Arts (HAWK), Göttingen, Germany<p>We present a machine learning dataset for tree species classification in Sentinel-2 satellite image time series of bottom-of-atmosphere reflectance. It is geared towards training classifiers but is less suitable for validating the resulting maps. The dataset is based on the German National Forest Inventory of 2012 as well as analysis-ready satellite imagery computed using the Framework for Operational Radiometric Correction for Environmental monitoring (FORCE) processing pipeline. From the National Forest Inventory data, we extracted the tree positions, filtered 387 775 trees in the upper canopy layer, and automatically extracted the corresponding bottom-of-atmosphere reflectance time series from Sentinel-2 L2A images. These time series are labeled with the corresponding tree species, which allows pixel-wise classification tasks. Furthermore, we provide auxiliary information such as the approximate tree position, the year of possible disturbance events, or the diameter at breast height. Temporally, the dataset spans the years from July 2015 to the end of October 2022, with approx. 75.3 million data points for trees of 48 species and 3 species groups as well as 13.8 million observations for non-tree backgrounds. Spatially, it covers the whole of Germany. The dataset is available at the following DOI <span class="cit" id="xref_paren.1">(<a href="#bib1.bibx9">Freudenberg et al.</a>, <a href="#bib1.bibx9">2024</a>)</span>: <a href="https://doi.org/10.3220/DATA20240402122351-0">https://doi.org/10.3220/DATA20240402122351-0</a>.</p>https://essd.copernicus.org/articles/17/351/2025/essd-17-351-2025.pdf
spellingShingle M. Freudenberg
M. Freudenberg
S. Schnell
P. Magdon
A Sentinel-2 machine learning dataset for tree species classification in Germany
Earth System Science Data
title A Sentinel-2 machine learning dataset for tree species classification in Germany
title_full A Sentinel-2 machine learning dataset for tree species classification in Germany
title_fullStr A Sentinel-2 machine learning dataset for tree species classification in Germany
title_full_unstemmed A Sentinel-2 machine learning dataset for tree species classification in Germany
title_short A Sentinel-2 machine learning dataset for tree species classification in Germany
title_sort sentinel 2 machine learning dataset for tree species classification in germany
url https://essd.copernicus.org/articles/17/351/2025/essd-17-351-2025.pdf
work_keys_str_mv AT mfreudenberg asentinel2machinelearningdatasetfortreespeciesclassificationingermany
AT mfreudenberg asentinel2machinelearningdatasetfortreespeciesclassificationingermany
AT sschnell asentinel2machinelearningdatasetfortreespeciesclassificationingermany
AT pmagdon asentinel2machinelearningdatasetfortreespeciesclassificationingermany
AT mfreudenberg sentinel2machinelearningdatasetfortreespeciesclassificationingermany
AT mfreudenberg sentinel2machinelearningdatasetfortreespeciesclassificationingermany
AT sschnell sentinel2machinelearningdatasetfortreespeciesclassificationingermany
AT pmagdon sentinel2machinelearningdatasetfortreespeciesclassificationingermany