Information extraction from green channel textual records on expressways using hybrid deep learning

Abstract The expressway green channel is an essential transportation policy for moving fresh agricultural products in China. In order to extract knowledge from various records, this study presents a cutting-edge approach to extract information from textual records of failure cases in the vertical fi...

Full description

Saved in:
Bibliographic Details
Main Authors: Jiaona Chen, Jing Zhang, Weijun Tao, Yinli Jin, Heng Fan
Format: Article
Language:English
Published: Nature Portfolio 2024-12-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-024-82681-4
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846101296828907520
author Jiaona Chen
Jing Zhang
Weijun Tao
Yinli Jin
Heng Fan
author_facet Jiaona Chen
Jing Zhang
Weijun Tao
Yinli Jin
Heng Fan
author_sort Jiaona Chen
collection DOAJ
description Abstract The expressway green channel is an essential transportation policy for moving fresh agricultural products in China. In order to extract knowledge from various records, this study presents a cutting-edge approach to extract information from textual records of failure cases in the vertical field of expressway green channel. We proposed a hybrid approach based on BIO labeling, pre-trained model, deep learning and CRF to build a named entity recognition (NER) model with the optimal prediction performance. Eight entities are designed and proposed in the NER processing for the expressway green channel. three typical pre-trained natural language processing models are utilized and compared to recognize entities and obtain feature vectors, including bidirectional encoder representations from transformer (BERT), ALBERT, and RoBERTa. An ablation experiment is performed to analyze the influence of each factor on the proposed models. Used the survey data from the expressway green channel management system in Shaanxi Province of China, the experimental results show that the precision, recall, and F1-score of the RoBERTa-BiGRU-CRF model are 93.04%, 92.99%, and 92.99%, respectively. As the results, it is discovered that the text features extracted from pre-training substantially enhance the prediction accuracy of deep learning algorithms. Surprisingly, the RoBERTa model is highly effective in the task for the expressway green channel NER. This study provides a timely and necessary knowledge extraction on the Expressway Green Channel in terms of textual data, offering a systematical explanation of failure cases and valuable insights for future research.
format Article
id doaj-art-d6a08ce1c0ef4a58a37d6f631f727b03
institution Kabale University
issn 2045-2322
language English
publishDate 2024-12-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-d6a08ce1c0ef4a58a37d6f631f727b032024-12-29T12:18:26ZengNature PortfolioScientific Reports2045-23222024-12-0114111510.1038/s41598-024-82681-4Information extraction from green channel textual records on expressways using hybrid deep learningJiaona Chen0Jing Zhang1Weijun Tao2Yinli Jin3Heng Fan4Xi’an Shiyou University School of Electronic EngineeringXi’an Shiyou University School of Electronic EngineeringXi’an Shiyou University School of Electronic EngineeringChang’an University School of Electronic and Control EngineeringXi’an Shiyou University School of Electronic EngineeringAbstract The expressway green channel is an essential transportation policy for moving fresh agricultural products in China. In order to extract knowledge from various records, this study presents a cutting-edge approach to extract information from textual records of failure cases in the vertical field of expressway green channel. We proposed a hybrid approach based on BIO labeling, pre-trained model, deep learning and CRF to build a named entity recognition (NER) model with the optimal prediction performance. Eight entities are designed and proposed in the NER processing for the expressway green channel. three typical pre-trained natural language processing models are utilized and compared to recognize entities and obtain feature vectors, including bidirectional encoder representations from transformer (BERT), ALBERT, and RoBERTa. An ablation experiment is performed to analyze the influence of each factor on the proposed models. Used the survey data from the expressway green channel management system in Shaanxi Province of China, the experimental results show that the precision, recall, and F1-score of the RoBERTa-BiGRU-CRF model are 93.04%, 92.99%, and 92.99%, respectively. As the results, it is discovered that the text features extracted from pre-training substantially enhance the prediction accuracy of deep learning algorithms. Surprisingly, the RoBERTa model is highly effective in the task for the expressway green channel NER. This study provides a timely and necessary knowledge extraction on the Expressway Green Channel in terms of textual data, offering a systematical explanation of failure cases and valuable insights for future research.https://doi.org/10.1038/s41598-024-82681-4Expressway green channelNamed entity recognitionBIO labelingPre-trained modelDeep learning
spellingShingle Jiaona Chen
Jing Zhang
Weijun Tao
Yinli Jin
Heng Fan
Information extraction from green channel textual records on expressways using hybrid deep learning
Scientific Reports
Expressway green channel
Named entity recognition
BIO labeling
Pre-trained model
Deep learning
title Information extraction from green channel textual records on expressways using hybrid deep learning
title_full Information extraction from green channel textual records on expressways using hybrid deep learning
title_fullStr Information extraction from green channel textual records on expressways using hybrid deep learning
title_full_unstemmed Information extraction from green channel textual records on expressways using hybrid deep learning
title_short Information extraction from green channel textual records on expressways using hybrid deep learning
title_sort information extraction from green channel textual records on expressways using hybrid deep learning
topic Expressway green channel
Named entity recognition
BIO labeling
Pre-trained model
Deep learning
url https://doi.org/10.1038/s41598-024-82681-4
work_keys_str_mv AT jiaonachen informationextractionfromgreenchanneltextualrecordsonexpresswaysusinghybriddeeplearning
AT jingzhang informationextractionfromgreenchanneltextualrecordsonexpresswaysusinghybriddeeplearning
AT weijuntao informationextractionfromgreenchanneltextualrecordsonexpresswaysusinghybriddeeplearning
AT yinlijin informationextractionfromgreenchanneltextualrecordsonexpresswaysusinghybriddeeplearning
AT hengfan informationextractionfromgreenchanneltextualrecordsonexpresswaysusinghybriddeeplearning