Subsampling Algorithms for Irregularly Spaced Autoregressive Models

With the exponential growth of data across diverse fields, applying conventional statistical methods directly to large-scale datasets has become computationally infeasible. To overcome this challenge, subsampling algorithms are widely used to perform statistical analyses on smaller, more manageable...

Full description

Saved in:
Bibliographic Details
Main Authors: Jiaqi Liu, Ziyang Wang, HaiYing Wang, Nalini Ravishanker
Format: Article
Language:English
Published: MDPI AG 2024-11-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/17/11/524
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846154666641981440
author Jiaqi Liu
Ziyang Wang
HaiYing Wang
Nalini Ravishanker
author_facet Jiaqi Liu
Ziyang Wang
HaiYing Wang
Nalini Ravishanker
author_sort Jiaqi Liu
collection DOAJ
description With the exponential growth of data across diverse fields, applying conventional statistical methods directly to large-scale datasets has become computationally infeasible. To overcome this challenge, subsampling algorithms are widely used to perform statistical analyses on smaller, more manageable subsets of the data. The effectiveness of these methods depends on their ability to identify and select data points that improve the estimation efficiency according to some optimality criteria. While much of the existing research has focused on subsampling techniques for independent data, there is considerable potential for developing methods tailored to dependent data, particularly in time-dependent contexts. In this study, we extend subsampling techniques to irregularly spaced time series data which are modeled by irregularly spaced autoregressive models. We present frameworks for various subsampling approaches, including optimal subsampling under A-optimality, information-based optimal subdata selection, and sequential thinning on streaming data. These methods use A-optimality or D-optimality criteria to assess the usefulness of each data point and prioritize the inclusion of the most informative ones. We then assess the performance of these subsampling methods using numerical simulations, providing insights into their suitability and effectiveness for handling irregularly spaced long time series. Numerical results show that our algorithms have promising performance. Their estimation efficiency can be ten times as high as that of the uniform sampling estimator. They also significantly reduce the computational time and can be up to forty times faster than the full-data estimator.
format Article
id doaj-art-05976424af3a45e8b9084644d7d5b86a
institution Kabale University
issn 1999-4893
language English
publishDate 2024-11-01
publisher MDPI AG
record_format Article
series Algorithms
spelling doaj-art-05976424af3a45e8b9084644d7d5b86a2024-11-26T17:45:30ZengMDPI AGAlgorithms1999-48932024-11-01171152410.3390/a17110524Subsampling Algorithms for Irregularly Spaced Autoregressive ModelsJiaqi Liu0Ziyang Wang1HaiYing Wang2Nalini Ravishanker3Department of Statistics, University of Connecticut, Storrs, CT 06269, USADepartment of Statistics, University of Connecticut, Storrs, CT 06269, USADepartment of Statistics, University of Connecticut, Storrs, CT 06269, USADepartment of Statistics, University of Connecticut, Storrs, CT 06269, USAWith the exponential growth of data across diverse fields, applying conventional statistical methods directly to large-scale datasets has become computationally infeasible. To overcome this challenge, subsampling algorithms are widely used to perform statistical analyses on smaller, more manageable subsets of the data. The effectiveness of these methods depends on their ability to identify and select data points that improve the estimation efficiency according to some optimality criteria. While much of the existing research has focused on subsampling techniques for independent data, there is considerable potential for developing methods tailored to dependent data, particularly in time-dependent contexts. In this study, we extend subsampling techniques to irregularly spaced time series data which are modeled by irregularly spaced autoregressive models. We present frameworks for various subsampling approaches, including optimal subsampling under A-optimality, information-based optimal subdata selection, and sequential thinning on streaming data. These methods use A-optimality or D-optimality criteria to assess the usefulness of each data point and prioritize the inclusion of the most informative ones. We then assess the performance of these subsampling methods using numerical simulations, providing insights into their suitability and effectiveness for handling irregularly spaced long time series. Numerical results show that our algorithms have promising performance. Their estimation efficiency can be ten times as high as that of the uniform sampling estimator. They also significantly reduce the computational time and can be up to forty times faster than the full-data estimator.https://www.mdpi.com/1999-4893/17/11/524subsamplingirregularly spaced autoregressive modeltime seriesbig data
spellingShingle Jiaqi Liu
Ziyang Wang
HaiYing Wang
Nalini Ravishanker
Subsampling Algorithms for Irregularly Spaced Autoregressive Models
Algorithms
subsampling
irregularly spaced autoregressive model
time series
big data
title Subsampling Algorithms for Irregularly Spaced Autoregressive Models
title_full Subsampling Algorithms for Irregularly Spaced Autoregressive Models
title_fullStr Subsampling Algorithms for Irregularly Spaced Autoregressive Models
title_full_unstemmed Subsampling Algorithms for Irregularly Spaced Autoregressive Models
title_short Subsampling Algorithms for Irregularly Spaced Autoregressive Models
title_sort subsampling algorithms for irregularly spaced autoregressive models
topic subsampling
irregularly spaced autoregressive model
time series
big data
url https://www.mdpi.com/1999-4893/17/11/524
work_keys_str_mv AT jiaqiliu subsamplingalgorithmsforirregularlyspacedautoregressivemodels
AT ziyangwang subsamplingalgorithmsforirregularlyspacedautoregressivemodels
AT haiyingwang subsamplingalgorithmsforirregularlyspacedautoregressivemodels
AT naliniravishanker subsamplingalgorithmsforirregularlyspacedautoregressivemodels