A file archival integrity check method based on the BiLSTM + CNN model and deep learning

Validating and integrity-checking archives ensures that files are authentic, trustworthy, and usable. In the age of digital technology, historical records must be genuine. Researching in archives raises ethical issues while having little to do with individuals. Traditional archive integrity solution...

Full description

Saved in:
Bibliographic Details
Main Authors: Jinxun Li, Tingjun Wang, Chao Ma, Yunxuan Lin, Qing Yan
Format: Article
Language:English
Published: Elsevier 2025-03-01
Series:Egyptian Informatics Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1110866524001609
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841545996286296064
author Jinxun Li
Tingjun Wang
Chao Ma
Yunxuan Lin
Qing Yan
author_facet Jinxun Li
Tingjun Wang
Chao Ma
Yunxuan Lin
Qing Yan
author_sort Jinxun Li
collection DOAJ
description Validating and integrity-checking archives ensures that files are authentic, trustworthy, and usable. In the age of digital technology, historical records must be genuine. Researching in archives raises ethical issues while having little to do with individuals. Traditional archive integrity solutions have scaling issues, real-time monitoring issues, and missed opportunities. An updated Archive File Integrity Check Method (AFICM) may solve these issues, and the paper explains it. Deep learning allows the combination of a Bidirectional Long-Short Term Memory (Bi-LSTM) with adaptive gating and an adaptive Temporal Convolutional Neural Network (TCNN) with multi-scale temporal attention. This method protects archived material against manipulation, which is crucial. The recommended method extracts complex sequential patterns and variants using adaptive TCNN trained on file data. Next, it analyzes these features using a Bi-LSTM network and attenuation method. It allows it to highlight significant temporal correlations while downplaying irrelevant data selectively. The hybrid model outperforms checksums in accuracy and dependability. It uses adaptive TCNNs for time-related feature extraction and attenuated Bi-LSTM for refinement. The F1 score, recall, accuracy, precision, and AU-ROC are critical measures for model evaluation. The AICM performed well overall, with 97.32% precision and 98.95% accuracy. This integrity check method outperforms others with an F1 score of 97.58, an AU-ROC of 0.983, and a recall rate of 98.18%. The findings set a new standard for archiving system integrity testing by showing the model’s dependability and security in several use scenarios.
format Article
id doaj-art-96c5eab8e65340c4b995088eadb0414e
institution Kabale University
issn 1110-8665
language English
publishDate 2025-03-01
publisher Elsevier
record_format Article
series Egyptian Informatics Journal
spelling doaj-art-96c5eab8e65340c4b995088eadb0414e2025-01-11T06:38:37ZengElsevierEgyptian Informatics Journal1110-86652025-03-0129100597A file archival integrity check method based on the BiLSTM + CNN model and deep learningJinxun Li0Tingjun Wang1Chao Ma2Yunxuan Lin3Qing Yan4Corresponding author.; Hainan Power Grid Co., Ltd, Haikou 570311, ChinaHainan Power Grid Co., Ltd, Haikou 570311, ChinaHainan Power Grid Co., Ltd, Haikou 570311, ChinaHainan Power Grid Co., Ltd, Haikou 570311, ChinaHainan Power Grid Co., Ltd, Haikou 570311, ChinaValidating and integrity-checking archives ensures that files are authentic, trustworthy, and usable. In the age of digital technology, historical records must be genuine. Researching in archives raises ethical issues while having little to do with individuals. Traditional archive integrity solutions have scaling issues, real-time monitoring issues, and missed opportunities. An updated Archive File Integrity Check Method (AFICM) may solve these issues, and the paper explains it. Deep learning allows the combination of a Bidirectional Long-Short Term Memory (Bi-LSTM) with adaptive gating and an adaptive Temporal Convolutional Neural Network (TCNN) with multi-scale temporal attention. This method protects archived material against manipulation, which is crucial. The recommended method extracts complex sequential patterns and variants using adaptive TCNN trained on file data. Next, it analyzes these features using a Bi-LSTM network and attenuation method. It allows it to highlight significant temporal correlations while downplaying irrelevant data selectively. The hybrid model outperforms checksums in accuracy and dependability. It uses adaptive TCNNs for time-related feature extraction and attenuated Bi-LSTM for refinement. The F1 score, recall, accuracy, precision, and AU-ROC are critical measures for model evaluation. The AICM performed well overall, with 97.32% precision and 98.95% accuracy. This integrity check method outperforms others with an F1 score of 97.58, an AU-ROC of 0.983, and a recall rate of 98.18%. The findings set a new standard for archiving system integrity testing by showing the model’s dependability and security in several use scenarios.http://www.sciencedirect.com/science/article/pii/S1110866524001609Archival integrity check methodBidirectional long-short term memoryTemporal convolutional neural network
spellingShingle Jinxun Li
Tingjun Wang
Chao Ma
Yunxuan Lin
Qing Yan
A file archival integrity check method based on the BiLSTM + CNN model and deep learning
Egyptian Informatics Journal
Archival integrity check method
Bidirectional long-short term memory
Temporal convolutional neural network
title A file archival integrity check method based on the BiLSTM + CNN model and deep learning
title_full A file archival integrity check method based on the BiLSTM + CNN model and deep learning
title_fullStr A file archival integrity check method based on the BiLSTM + CNN model and deep learning
title_full_unstemmed A file archival integrity check method based on the BiLSTM + CNN model and deep learning
title_short A file archival integrity check method based on the BiLSTM + CNN model and deep learning
title_sort file archival integrity check method based on the bilstm cnn model and deep learning
topic Archival integrity check method
Bidirectional long-short term memory
Temporal convolutional neural network
url http://www.sciencedirect.com/science/article/pii/S1110866524001609
work_keys_str_mv AT jinxunli afilearchivalintegritycheckmethodbasedonthebilstmcnnmodelanddeeplearning
AT tingjunwang afilearchivalintegritycheckmethodbasedonthebilstmcnnmodelanddeeplearning
AT chaoma afilearchivalintegritycheckmethodbasedonthebilstmcnnmodelanddeeplearning
AT yunxuanlin afilearchivalintegritycheckmethodbasedonthebilstmcnnmodelanddeeplearning
AT qingyan afilearchivalintegritycheckmethodbasedonthebilstmcnnmodelanddeeplearning
AT jinxunli filearchivalintegritycheckmethodbasedonthebilstmcnnmodelanddeeplearning
AT tingjunwang filearchivalintegritycheckmethodbasedonthebilstmcnnmodelanddeeplearning
AT chaoma filearchivalintegritycheckmethodbasedonthebilstmcnnmodelanddeeplearning
AT yunxuanlin filearchivalintegritycheckmethodbasedonthebilstmcnnmodelanddeeplearning
AT qingyan filearchivalintegritycheckmethodbasedonthebilstmcnnmodelanddeeplearning