CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea

Abstract We introduce a novel benchmark dataset, CRxK Dataset, featuring surveillance images based on meticulously re-enacted crime events with curated annotations. CRxK dataset includes a collection of crime re-enacted videos and images categorized into 13 different categories, encompassing a set o...

Full description

Saved in:

Bibliographic Details
Main Authors:	Chaehee An, Minyoung Lee, Eunil Park
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-08-01
Series:	Scientific Reports
Subjects:	CRxK Dataset Crimes Surveillance
Online Access:	https://doi.org/10.1038/s41598-025-15058-w
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849237791540510720
author	Chaehee An Minyoung Lee Eunil Park
author_facet	Chaehee An Minyoung Lee Eunil Park
author_sort	Chaehee An
collection	DOAJ
description	Abstract We introduce a novel benchmark dataset, CRxK Dataset, featuring surveillance images based on meticulously re-enacted crime events with curated annotations. CRxK dataset includes a collection of crime re-enacted videos and images categorized into 13 different categories, encompassing a set of offenses such as assault, intoxication, swoon, and more. In addition, we cover a separate normal dataset extracted from scenes occurring five seconds before the crime event and spanning a 10-second duration. Among 13 categories, we employ six core categories, which are the most frequent occurrences in the collected dataset. These categories are assault, robbery, swooning, kidnapping, burglary, and normal scenarios. We conducted experiments using a total of 2,054,013 frames randomly selected and shuffled from the videos. Our training and validation involved four convolutional neural network (CNN) models and a single transformer model. We utilized a smaller sub-dataset, CRxK-6 dataset, containing 8,500 frames randomly sampled from each category video, resulting in 51,000 frames. Despite employing a train-test split ratio of 1:40 and applying face masking using RetinaFace, the dataset exhibited excellent performance with common CNN models, achieving an accuracy exceeding 0.940 for each model. However, it presented some challenges for the Transformer model.
format	Article
id	doaj-art-8da5d86c16e843d19a7fe5f33e8e1b05
institution	Kabale University
issn	2045-2322
language	English
publishDate	2025-08-01
publisher	Nature Portfolio
record_format	Article
series	Scientific Reports
spelling	doaj-art-8da5d86c16e843d19a7fe5f33e8e1b052025-08-20T04:01:52ZengNature PortfolioScientific Reports2045-23222025-08-0115111410.1038/s41598-025-15058-wCRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in KoreaChaehee An0Minyoung Lee1Eunil Park2Department of Applied Artificial Intelligence, Sungkyunkwan UniversityDepartment of Applied Artificial Intelligence, Sungkyunkwan UniversityDepartment of Applied Artificial Intelligence, Sungkyunkwan UniversityAbstract We introduce a novel benchmark dataset, CRxK Dataset, featuring surveillance images based on meticulously re-enacted crime events with curated annotations. CRxK dataset includes a collection of crime re-enacted videos and images categorized into 13 different categories, encompassing a set of offenses such as assault, intoxication, swoon, and more. In addition, we cover a separate normal dataset extracted from scenes occurring five seconds before the crime event and spanning a 10-second duration. Among 13 categories, we employ six core categories, which are the most frequent occurrences in the collected dataset. These categories are assault, robbery, swooning, kidnapping, burglary, and normal scenarios. We conducted experiments using a total of 2,054,013 frames randomly selected and shuffled from the videos. Our training and validation involved four convolutional neural network (CNN) models and a single transformer model. We utilized a smaller sub-dataset, CRxK-6 dataset, containing 8,500 frames randomly sampled from each category video, resulting in 51,000 frames. Despite employing a train-test split ratio of 1:40 and applying face masking using RetinaFace, the dataset exhibited excellent performance with common CNN models, achieving an accuracy exceeding 0.940 for each model. However, it presented some challenges for the Transformer model.https://doi.org/10.1038/s41598-025-15058-wCRxK DatasetCrimesSurveillance
spellingShingle	Chaehee An Minyoung Lee Eunil Park CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea Scientific Reports CRxK Dataset Crimes Surveillance
title	CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea
title_full	CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea
title_fullStr	CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea
title_full_unstemmed	CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea
title_short	CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea
title_sort	crxk dataset a multi view surveillance video dataset for re enacted crimes in korea
topic	CRxK Dataset Crimes Surveillance
url	https://doi.org/10.1038/s41598-025-15058-w
work_keys_str_mv	AT chaeheean crxkdatasetamultiviewsurveillancevideodatasetforreenactedcrimesinkorea AT minyounglee crxkdatasetamultiviewsurveillancevideodatasetforreenactedcrimesinkorea AT eunilpark crxkdatasetamultiviewsurveillancevideodatasetforreenactedcrimesinkorea

CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea

Similar Items