CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea

Abstract We introduce a novel benchmark dataset, CRxK Dataset, featuring surveillance images based on meticulously re-enacted crime events with curated annotations. CRxK dataset includes a collection of crime re-enacted videos and images categorized into 13 different categories, encompassing a set o...

Full description

Saved in:
Bibliographic Details
Main Authors: Chaehee An, Minyoung Lee, Eunil Park
Format: Article
Language:English
Published: Nature Portfolio 2025-08-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-15058-w
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849237791540510720
author Chaehee An
Minyoung Lee
Eunil Park
author_facet Chaehee An
Minyoung Lee
Eunil Park
author_sort Chaehee An
collection DOAJ
description Abstract We introduce a novel benchmark dataset, CRxK Dataset, featuring surveillance images based on meticulously re-enacted crime events with curated annotations. CRxK dataset includes a collection of crime re-enacted videos and images categorized into 13 different categories, encompassing a set of offenses such as assault, intoxication, swoon, and more. In addition, we cover a separate normal dataset extracted from scenes occurring five seconds before the crime event and spanning a 10-second duration. Among 13 categories, we employ six core categories, which are the most frequent occurrences in the collected dataset. These categories are assault, robbery, swooning, kidnapping, burglary, and normal scenarios. We conducted experiments using a total of 2,054,013 frames randomly selected and shuffled from the videos. Our training and validation involved four convolutional neural network (CNN) models and a single transformer model. We utilized a smaller sub-dataset, CRxK-6 dataset, containing 8,500 frames randomly sampled from each category video, resulting in 51,000 frames. Despite employing a train-test split ratio of 1:40 and applying face masking using RetinaFace, the dataset exhibited excellent performance with common CNN models, achieving an accuracy exceeding 0.940 for each model. However, it presented some challenges for the Transformer model.
format Article
id doaj-art-8da5d86c16e843d19a7fe5f33e8e1b05
institution Kabale University
issn 2045-2322
language English
publishDate 2025-08-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-8da5d86c16e843d19a7fe5f33e8e1b052025-08-20T04:01:52ZengNature PortfolioScientific Reports2045-23222025-08-0115111410.1038/s41598-025-15058-wCRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in KoreaChaehee An0Minyoung Lee1Eunil Park2Department of Applied Artificial Intelligence, Sungkyunkwan UniversityDepartment of Applied Artificial Intelligence, Sungkyunkwan UniversityDepartment of Applied Artificial Intelligence, Sungkyunkwan UniversityAbstract We introduce a novel benchmark dataset, CRxK Dataset, featuring surveillance images based on meticulously re-enacted crime events with curated annotations. CRxK dataset includes a collection of crime re-enacted videos and images categorized into 13 different categories, encompassing a set of offenses such as assault, intoxication, swoon, and more. In addition, we cover a separate normal dataset extracted from scenes occurring five seconds before the crime event and spanning a 10-second duration. Among 13 categories, we employ six core categories, which are the most frequent occurrences in the collected dataset. These categories are assault, robbery, swooning, kidnapping, burglary, and normal scenarios. We conducted experiments using a total of 2,054,013 frames randomly selected and shuffled from the videos. Our training and validation involved four convolutional neural network (CNN) models and a single transformer model. We utilized a smaller sub-dataset, CRxK-6 dataset, containing 8,500 frames randomly sampled from each category video, resulting in 51,000 frames. Despite employing a train-test split ratio of 1:40 and applying face masking using RetinaFace, the dataset exhibited excellent performance with common CNN models, achieving an accuracy exceeding 0.940 for each model. However, it presented some challenges for the Transformer model.https://doi.org/10.1038/s41598-025-15058-wCRxK DatasetCrimesSurveillance
spellingShingle Chaehee An
Minyoung Lee
Eunil Park
CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea
Scientific Reports
CRxK Dataset
Crimes
Surveillance
title CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea
title_full CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea
title_fullStr CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea
title_full_unstemmed CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea
title_short CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea
title_sort crxk dataset a multi view surveillance video dataset for re enacted crimes in korea
topic CRxK Dataset
Crimes
Surveillance
url https://doi.org/10.1038/s41598-025-15058-w
work_keys_str_mv AT chaeheean crxkdatasetamultiviewsurveillancevideodatasetforreenactedcrimesinkorea
AT minyounglee crxkdatasetamultiviewsurveillancevideodatasetforreenactedcrimesinkorea
AT eunilpark crxkdatasetamultiviewsurveillancevideodatasetforreenactedcrimesinkorea