An ODE based neural network approach for PM2.5 forecasting

Abstract Predicting time-series data is inherently complex, spurring the development of advanced neural network approaches. Monitoring and predicting PM2.5 levels is especially challenging due to the interplay of diverse natural and anthropogenic factors influencing its dispersion, making accurate p...

Full description

Saved in:

Bibliographic Details
Main Authors:	Md Khalid Hossen, Yan-Tsung Peng, Asher Shao, Meng Chang Chen
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-07-01
Series:	Scientific Reports
Online Access:	https://doi.org/10.1038/s41598-025-05958-2
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849344088510300160
author	Md Khalid Hossen Yan-Tsung Peng Asher Shao Meng Chang Chen
author_facet	Md Khalid Hossen Yan-Tsung Peng Asher Shao Meng Chang Chen
author_sort	Md Khalid Hossen
collection	DOAJ
description	Abstract Predicting time-series data is inherently complex, spurring the development of advanced neural network approaches. Monitoring and predicting PM2.5 levels is especially challenging due to the interplay of diverse natural and anthropogenic factors influencing its dispersion, making accurate predictions both costly and intricate. A key challenge in predicting PM2.5 concentrations lies in its variability, as the data distribution fluctuates significantly over time. Meanwhile, neural networks provide a cost-effective and highly accurate solution in managing such complexities. Deep learning models like Long Short-Term Memory (LSTM) and Bidirectional LSTM (BiLSTM) have been widely applied to PM2.5 prediction tasks. However, prediction errors increase as the forecasting window expands from 1 to 72 hours, underscoring the rising uncertainty in longer-term predictions. Recurrent Neural Networks (RNNs) with continuous-time hidden states are well-suited for modeling irregularly sampled time series but struggle with long-term dependencies due to gradient vanishing or exploding, as revealed by the ordinary differential equation (ODE) based hidden state dynamics–regardless of the ODE solver used. Continuous-time neural processes, defined by differential equations, are limited by numerical solvers, restricting scalability and hindering the modeling of complex phenomena like neural dynamics–ideally addressed via closed-form solutions. In contrast to ODE-based continuous models, closed-form networks demonstrate superior scalability over traditional deep-learning approaches. As continuous-time neural networks, Neural ODEs excel in modeling the intricate dynamics of time-series data, presenting a robust alternative to traditional LSTM models. We propose two ODE-based models: a transformer-based ODE model and a closed-form ODE model. Empirical evaluations show these models significantly enhance prediction accuracy, with improvements ranging from 2.91 to 14.15% for 1-hour to 8-hour predictions when compared to LSTM-based models. Moreover, after conducting the paired t-test, the RMSE values of the proposed model (CCCFC) were found to be significantly different from those of BILSTM, LSTM, GRU, ODE-LSTM, and PCNN,CNN-LSSTM. This implies that CCCFC demonstrates a distinct performance advantage, reinforcing its effectiveness in hourly PM2.5 forecasting.
format	Article
id	doaj-art-d3885a34d6304cc18c8f4d808bd7dcaa
institution	Kabale University
issn	2045-2322
language	English
publishDate	2025-07-01
publisher	Nature Portfolio
record_format	Article
series	Scientific Reports
spelling	doaj-art-d3885a34d6304cc18c8f4d808bd7dcaa2025-08-20T03:42:45ZengNature PortfolioScientific Reports2045-23222025-07-0115112210.1038/s41598-025-05958-2An ODE based neural network approach for PM2.5 forecastingMd Khalid Hossen0Yan-Tsung Peng1Asher Shao2Meng Chang Chen3Social Networks and Human-Centered Computing, TIGP, Academia SinicaDepartment of Computer Science, National Chengchi UniversityResearch Center for Information Technology Innovation, Academia SinicaResearch Center for Information Technology Innovation, Academia SinicaAbstract Predicting time-series data is inherently complex, spurring the development of advanced neural network approaches. Monitoring and predicting PM2.5 levels is especially challenging due to the interplay of diverse natural and anthropogenic factors influencing its dispersion, making accurate predictions both costly and intricate. A key challenge in predicting PM2.5 concentrations lies in its variability, as the data distribution fluctuates significantly over time. Meanwhile, neural networks provide a cost-effective and highly accurate solution in managing such complexities. Deep learning models like Long Short-Term Memory (LSTM) and Bidirectional LSTM (BiLSTM) have been widely applied to PM2.5 prediction tasks. However, prediction errors increase as the forecasting window expands from 1 to 72 hours, underscoring the rising uncertainty in longer-term predictions. Recurrent Neural Networks (RNNs) with continuous-time hidden states are well-suited for modeling irregularly sampled time series but struggle with long-term dependencies due to gradient vanishing or exploding, as revealed by the ordinary differential equation (ODE) based hidden state dynamics–regardless of the ODE solver used. Continuous-time neural processes, defined by differential equations, are limited by numerical solvers, restricting scalability and hindering the modeling of complex phenomena like neural dynamics–ideally addressed via closed-form solutions. In contrast to ODE-based continuous models, closed-form networks demonstrate superior scalability over traditional deep-learning approaches. As continuous-time neural networks, Neural ODEs excel in modeling the intricate dynamics of time-series data, presenting a robust alternative to traditional LSTM models. We propose two ODE-based models: a transformer-based ODE model and a closed-form ODE model. Empirical evaluations show these models significantly enhance prediction accuracy, with improvements ranging from 2.91 to 14.15% for 1-hour to 8-hour predictions when compared to LSTM-based models. Moreover, after conducting the paired t-test, the RMSE values of the proposed model (CCCFC) were found to be significantly different from those of BILSTM, LSTM, GRU, ODE-LSTM, and PCNN,CNN-LSSTM. This implies that CCCFC demonstrates a distinct performance advantage, reinforcing its effectiveness in hourly PM2.5 forecasting.https://doi.org/10.1038/s41598-025-05958-2
spellingShingle	Md Khalid Hossen Yan-Tsung Peng Asher Shao Meng Chang Chen An ODE based neural network approach for PM2.5 forecasting Scientific Reports
title	An ODE based neural network approach for PM2.5 forecasting
title_full	An ODE based neural network approach for PM2.5 forecasting
title_fullStr	An ODE based neural network approach for PM2.5 forecasting
title_full_unstemmed	An ODE based neural network approach for PM2.5 forecasting
title_short	An ODE based neural network approach for PM2.5 forecasting
title_sort	ode based neural network approach for pm2 5 forecasting
url	https://doi.org/10.1038/s41598-025-05958-2
work_keys_str_mv	AT mdkhalidhossen anodebasedneuralnetworkapproachforpm25forecasting AT yantsungpeng anodebasedneuralnetworkapproachforpm25forecasting AT ashershao anodebasedneuralnetworkapproachforpm25forecasting AT mengchangchen anodebasedneuralnetworkapproachforpm25forecasting AT mdkhalidhossen odebasedneuralnetworkapproachforpm25forecasting AT yantsungpeng odebasedneuralnetworkapproachforpm25forecasting AT ashershao odebasedneuralnetworkapproachforpm25forecasting AT mengchangchen odebasedneuralnetworkapproachforpm25forecasting

An ODE based neural network approach for PM2.5 forecasting

Similar Items