An ODE based neural network approach for PM2.5 forecasting

Abstract Predicting time-series data is inherently complex, spurring the development of advanced neural network approaches. Monitoring and predicting PM2.5 levels is especially challenging due to the interplay of diverse natural and anthropogenic factors influencing its dispersion, making accurate p...

Full description

Saved in:
Bibliographic Details
Main Authors: Md Khalid Hossen, Yan-Tsung Peng, Asher Shao, Meng Chang Chen
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-025-05958-2
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849344088510300160
author Md Khalid Hossen
Yan-Tsung Peng
Asher Shao
Meng Chang Chen
author_facet Md Khalid Hossen
Yan-Tsung Peng
Asher Shao
Meng Chang Chen
author_sort Md Khalid Hossen
collection DOAJ
description Abstract Predicting time-series data is inherently complex, spurring the development of advanced neural network approaches. Monitoring and predicting PM2.5 levels is especially challenging due to the interplay of diverse natural and anthropogenic factors influencing its dispersion, making accurate predictions both costly and intricate. A key challenge in predicting PM2.5 concentrations lies in its variability, as the data distribution fluctuates significantly over time. Meanwhile, neural networks provide a cost-effective and highly accurate solution in managing such complexities. Deep learning models like Long Short-Term Memory (LSTM) and Bidirectional LSTM (BiLSTM) have been widely applied to PM2.5 prediction tasks. However, prediction errors increase as the forecasting window expands from 1 to 72 hours, underscoring the rising uncertainty in longer-term predictions. Recurrent Neural Networks (RNNs) with continuous-time hidden states are well-suited for modeling irregularly sampled time series but struggle with long-term dependencies due to gradient vanishing or exploding, as revealed by the ordinary differential equation (ODE) based hidden state dynamics–regardless of the ODE solver used. Continuous-time neural processes, defined by differential equations, are limited by numerical solvers, restricting scalability and hindering the modeling of complex phenomena like neural dynamics–ideally addressed via closed-form solutions. In contrast to ODE-based continuous models, closed-form networks demonstrate superior scalability over traditional deep-learning approaches. As continuous-time neural networks, Neural ODEs excel in modeling the intricate dynamics of time-series data, presenting a robust alternative to traditional LSTM models. We propose two ODE-based models: a transformer-based ODE model and a closed-form ODE model. Empirical evaluations show these models significantly enhance prediction accuracy, with improvements ranging from 2.91 to 14.15% for 1-hour to 8-hour predictions when compared to LSTM-based models. Moreover, after conducting the paired t-test, the RMSE values of the proposed model (CCCFC) were found to be significantly different from those of BILSTM, LSTM, GRU, ODE-LSTM, and PCNN,CNN-LSSTM. This implies that CCCFC demonstrates a distinct performance advantage, reinforcing its effectiveness in hourly PM2.5 forecasting.
format Article
id doaj-art-d3885a34d6304cc18c8f4d808bd7dcaa
institution Kabale University
issn 2045-2322
language English
publishDate 2025-07-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-d3885a34d6304cc18c8f4d808bd7dcaa2025-08-20T03:42:45ZengNature PortfolioScientific Reports2045-23222025-07-0115112210.1038/s41598-025-05958-2An ODE based neural network approach for PM2.5 forecastingMd Khalid Hossen0Yan-Tsung Peng1Asher Shao2Meng Chang Chen3Social Networks and Human-Centered Computing, TIGP, Academia SinicaDepartment of Computer Science, National Chengchi UniversityResearch Center for Information Technology Innovation, Academia SinicaResearch Center for Information Technology Innovation, Academia SinicaAbstract Predicting time-series data is inherently complex, spurring the development of advanced neural network approaches. Monitoring and predicting PM2.5 levels is especially challenging due to the interplay of diverse natural and anthropogenic factors influencing its dispersion, making accurate predictions both costly and intricate. A key challenge in predicting PM2.5 concentrations lies in its variability, as the data distribution fluctuates significantly over time. Meanwhile, neural networks provide a cost-effective and highly accurate solution in managing such complexities. Deep learning models like Long Short-Term Memory (LSTM) and Bidirectional LSTM (BiLSTM) have been widely applied to PM2.5 prediction tasks. However, prediction errors increase as the forecasting window expands from 1 to 72 hours, underscoring the rising uncertainty in longer-term predictions. Recurrent Neural Networks (RNNs) with continuous-time hidden states are well-suited for modeling irregularly sampled time series but struggle with long-term dependencies due to gradient vanishing or exploding, as revealed by the ordinary differential equation (ODE) based hidden state dynamics–regardless of the ODE solver used. Continuous-time neural processes, defined by differential equations, are limited by numerical solvers, restricting scalability and hindering the modeling of complex phenomena like neural dynamics–ideally addressed via closed-form solutions. In contrast to ODE-based continuous models, closed-form networks demonstrate superior scalability over traditional deep-learning approaches. As continuous-time neural networks, Neural ODEs excel in modeling the intricate dynamics of time-series data, presenting a robust alternative to traditional LSTM models. We propose two ODE-based models: a transformer-based ODE model and a closed-form ODE model. Empirical evaluations show these models significantly enhance prediction accuracy, with improvements ranging from 2.91 to 14.15% for 1-hour to 8-hour predictions when compared to LSTM-based models. Moreover, after conducting the paired t-test, the RMSE values of the proposed model (CCCFC) were found to be significantly different from those of BILSTM, LSTM, GRU, ODE-LSTM, and PCNN,CNN-LSSTM. This implies that CCCFC demonstrates a distinct performance advantage, reinforcing its effectiveness in hourly PM2.5 forecasting.https://doi.org/10.1038/s41598-025-05958-2
spellingShingle Md Khalid Hossen
Yan-Tsung Peng
Asher Shao
Meng Chang Chen
An ODE based neural network approach for PM2.5 forecasting
Scientific Reports
title An ODE based neural network approach for PM2.5 forecasting
title_full An ODE based neural network approach for PM2.5 forecasting
title_fullStr An ODE based neural network approach for PM2.5 forecasting
title_full_unstemmed An ODE based neural network approach for PM2.5 forecasting
title_short An ODE based neural network approach for PM2.5 forecasting
title_sort ode based neural network approach for pm2 5 forecasting
url https://doi.org/10.1038/s41598-025-05958-2
work_keys_str_mv AT mdkhalidhossen anodebasedneuralnetworkapproachforpm25forecasting
AT yantsungpeng anodebasedneuralnetworkapproachforpm25forecasting
AT ashershao anodebasedneuralnetworkapproachforpm25forecasting
AT mengchangchen anodebasedneuralnetworkapproachforpm25forecasting
AT mdkhalidhossen odebasedneuralnetworkapproachforpm25forecasting
AT yantsungpeng odebasedneuralnetworkapproachforpm25forecasting
AT ashershao odebasedneuralnetworkapproachforpm25forecasting
AT mengchangchen odebasedneuralnetworkapproachforpm25forecasting