CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea
Abstract We introduce a novel benchmark dataset, CRxK Dataset, featuring surveillance images based on meticulously re-enacted crime events with curated annotations. CRxK dataset includes a collection of crime re-enacted videos and images categorized into 13 different categories, encompassing a set o...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-08-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-15058-w |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849237791540510720 |
|---|---|
| author | Chaehee An Minyoung Lee Eunil Park |
| author_facet | Chaehee An Minyoung Lee Eunil Park |
| author_sort | Chaehee An |
| collection | DOAJ |
| description | Abstract We introduce a novel benchmark dataset, CRxK Dataset, featuring surveillance images based on meticulously re-enacted crime events with curated annotations. CRxK dataset includes a collection of crime re-enacted videos and images categorized into 13 different categories, encompassing a set of offenses such as assault, intoxication, swoon, and more. In addition, we cover a separate normal dataset extracted from scenes occurring five seconds before the crime event and spanning a 10-second duration. Among 13 categories, we employ six core categories, which are the most frequent occurrences in the collected dataset. These categories are assault, robbery, swooning, kidnapping, burglary, and normal scenarios. We conducted experiments using a total of 2,054,013 frames randomly selected and shuffled from the videos. Our training and validation involved four convolutional neural network (CNN) models and a single transformer model. We utilized a smaller sub-dataset, CRxK-6 dataset, containing 8,500 frames randomly sampled from each category video, resulting in 51,000 frames. Despite employing a train-test split ratio of 1:40 and applying face masking using RetinaFace, the dataset exhibited excellent performance with common CNN models, achieving an accuracy exceeding 0.940 for each model. However, it presented some challenges for the Transformer model. |
| format | Article |
| id | doaj-art-8da5d86c16e843d19a7fe5f33e8e1b05 |
| institution | Kabale University |
| issn | 2045-2322 |
| language | English |
| publishDate | 2025-08-01 |
| publisher | Nature Portfolio |
| record_format | Article |
| series | Scientific Reports |
| spelling | doaj-art-8da5d86c16e843d19a7fe5f33e8e1b052025-08-20T04:01:52ZengNature PortfolioScientific Reports2045-23222025-08-0115111410.1038/s41598-025-15058-wCRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in KoreaChaehee An0Minyoung Lee1Eunil Park2Department of Applied Artificial Intelligence, Sungkyunkwan UniversityDepartment of Applied Artificial Intelligence, Sungkyunkwan UniversityDepartment of Applied Artificial Intelligence, Sungkyunkwan UniversityAbstract We introduce a novel benchmark dataset, CRxK Dataset, featuring surveillance images based on meticulously re-enacted crime events with curated annotations. CRxK dataset includes a collection of crime re-enacted videos and images categorized into 13 different categories, encompassing a set of offenses such as assault, intoxication, swoon, and more. In addition, we cover a separate normal dataset extracted from scenes occurring five seconds before the crime event and spanning a 10-second duration. Among 13 categories, we employ six core categories, which are the most frequent occurrences in the collected dataset. These categories are assault, robbery, swooning, kidnapping, burglary, and normal scenarios. We conducted experiments using a total of 2,054,013 frames randomly selected and shuffled from the videos. Our training and validation involved four convolutional neural network (CNN) models and a single transformer model. We utilized a smaller sub-dataset, CRxK-6 dataset, containing 8,500 frames randomly sampled from each category video, resulting in 51,000 frames. Despite employing a train-test split ratio of 1:40 and applying face masking using RetinaFace, the dataset exhibited excellent performance with common CNN models, achieving an accuracy exceeding 0.940 for each model. However, it presented some challenges for the Transformer model.https://doi.org/10.1038/s41598-025-15058-wCRxK DatasetCrimesSurveillance |
| spellingShingle | Chaehee An Minyoung Lee Eunil Park CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea Scientific Reports CRxK Dataset Crimes Surveillance |
| title | CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea |
| title_full | CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea |
| title_fullStr | CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea |
| title_full_unstemmed | CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea |
| title_short | CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea |
| title_sort | crxk dataset a multi view surveillance video dataset for re enacted crimes in korea |
| topic | CRxK Dataset Crimes Surveillance |
| url | https://doi.org/10.1038/s41598-025-15058-w |
| work_keys_str_mv | AT chaeheean crxkdatasetamultiviewsurveillancevideodatasetforreenactedcrimesinkorea AT minyounglee crxkdatasetamultiviewsurveillancevideodatasetforreenactedcrimesinkorea AT eunilpark crxkdatasetamultiviewsurveillancevideodatasetforreenactedcrimesinkorea |