Multi-level lag scheme significantly improves training efficiency in deep learning: a case study in air quality alert service over sub-tropical area

Abstract In environmental monitoring, deep learning models are used where we can either use past observations or extrapolated values with high uncertainty as input. The lag scheme is commonly applied during the modeling and construction process, in the application of deep learning models in multivar...

Full description

Saved in:
Bibliographic Details
Main Authors: Benedito Chi Man Tam, Su-Kit Tang, Alberto Cardoso
Format: Article
Language:English
Published: SpringerOpen 2025-01-01
Series:Journal of Big Data
Subjects:
Online Access:https://doi.org/10.1186/s40537-024-01043-z
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841544588482838528
author Benedito Chi Man Tam
Su-Kit Tang
Alberto Cardoso
author_facet Benedito Chi Man Tam
Su-Kit Tang
Alberto Cardoso
author_sort Benedito Chi Man Tam
collection DOAJ
description Abstract In environmental monitoring, deep learning models are used where we can either use past observations or extrapolated values with high uncertainty as input. The lag scheme is commonly applied during the modeling and construction process, in the application of deep learning models in multivariate time series prediction. For an adaptive feature engineering, an automated lag scheme is essential for improving the training efficiency. In multivariate time series (MTS) models, the predictive accuracy of artificial neural network ANN-type models can be improved by including more features. It is assumed that when processing a certain number of multivariate features, the timeliness and lag time of the inter-influencing between any pair of elements are different. This research aims to adopt an adaptive approach to solve it, namely, multi-level lag scheme. The research methods include literature review, searching for relevant technology frontiers, feasibility studies, selection and design solutions, modeling, data collection and pre-processing, experiments, evaluation, comprehensive analysis and conclusions. In proof of concept, we demonstrated a practical case of seasonal ANN type MTS model and public service on air quality. In terms of models, ANN type models were attempted with ARIMA as the comparing baseline. We used public data set of more than two base stations with pollution varying from low to high and including southern to northern district of a small city. Conclusions can be drawn from the analysis of multiple experimental results, proving that the proposed solution can effectively improve the training efficiency of the model. This is of great significance, so that most such models can be implemented to adaptively use lagged past measured data as input, instead of synchronously inputting future prediction values, which can greatly improve the practical application of the model in predictive ability.
format Article
id doaj-art-6182438a89864a2db36d2c93bc122b1e
institution Kabale University
issn 2196-1115
language English
publishDate 2025-01-01
publisher SpringerOpen
record_format Article
series Journal of Big Data
spelling doaj-art-6182438a89864a2db36d2c93bc122b1e2025-01-12T12:27:12ZengSpringerOpenJournal of Big Data2196-11152025-01-0112112910.1186/s40537-024-01043-zMulti-level lag scheme significantly improves training efficiency in deep learning: a case study in air quality alert service over sub-tropical areaBenedito Chi Man Tam0Su-Kit Tang1Alberto Cardoso2Department of Informatics Engineering, Centre for Informatics and Systems of the University of Coimbra (CISUC), University of CoimbraFaculty of Applied Sciences, Macao Polytechnic UniversityDepartment of Informatics Engineering, Centre for Informatics and Systems of the University of Coimbra (CISUC), University of CoimbraAbstract In environmental monitoring, deep learning models are used where we can either use past observations or extrapolated values with high uncertainty as input. The lag scheme is commonly applied during the modeling and construction process, in the application of deep learning models in multivariate time series prediction. For an adaptive feature engineering, an automated lag scheme is essential for improving the training efficiency. In multivariate time series (MTS) models, the predictive accuracy of artificial neural network ANN-type models can be improved by including more features. It is assumed that when processing a certain number of multivariate features, the timeliness and lag time of the inter-influencing between any pair of elements are different. This research aims to adopt an adaptive approach to solve it, namely, multi-level lag scheme. The research methods include literature review, searching for relevant technology frontiers, feasibility studies, selection and design solutions, modeling, data collection and pre-processing, experiments, evaluation, comprehensive analysis and conclusions. In proof of concept, we demonstrated a practical case of seasonal ANN type MTS model and public service on air quality. In terms of models, ANN type models were attempted with ARIMA as the comparing baseline. We used public data set of more than two base stations with pollution varying from low to high and including southern to northern district of a small city. Conclusions can be drawn from the analysis of multiple experimental results, proving that the proposed solution can effectively improve the training efficiency of the model. This is of great significance, so that most such models can be implemented to adaptively use lagged past measured data as input, instead of synchronously inputting future prediction values, which can greatly improve the practical application of the model in predictive ability.https://doi.org/10.1186/s40537-024-01043-zAir quality forecastingMulti-level lag schemeMultivariateDeep learning
spellingShingle Benedito Chi Man Tam
Su-Kit Tang
Alberto Cardoso
Multi-level lag scheme significantly improves training efficiency in deep learning: a case study in air quality alert service over sub-tropical area
Journal of Big Data
Air quality forecasting
Multi-level lag scheme
Multivariate
Deep learning
title Multi-level lag scheme significantly improves training efficiency in deep learning: a case study in air quality alert service over sub-tropical area
title_full Multi-level lag scheme significantly improves training efficiency in deep learning: a case study in air quality alert service over sub-tropical area
title_fullStr Multi-level lag scheme significantly improves training efficiency in deep learning: a case study in air quality alert service over sub-tropical area
title_full_unstemmed Multi-level lag scheme significantly improves training efficiency in deep learning: a case study in air quality alert service over sub-tropical area
title_short Multi-level lag scheme significantly improves training efficiency in deep learning: a case study in air quality alert service over sub-tropical area
title_sort multi level lag scheme significantly improves training efficiency in deep learning a case study in air quality alert service over sub tropical area
topic Air quality forecasting
Multi-level lag scheme
Multivariate
Deep learning
url https://doi.org/10.1186/s40537-024-01043-z
work_keys_str_mv AT beneditochimantam multilevellagschemesignificantlyimprovestrainingefficiencyindeeplearningacasestudyinairqualityalertserviceoversubtropicalarea
AT sukittang multilevellagschemesignificantlyimprovestrainingefficiencyindeeplearningacasestudyinairqualityalertserviceoversubtropicalarea
AT albertocardoso multilevellagschemesignificantlyimprovestrainingefficiencyindeeplearningacasestudyinairqualityalertserviceoversubtropicalarea