Technical variability across the 450K, EPICv1, and EPICv2 DNA methylation arrays: lessons learned for clinical and longitudinal studies

Abstract DNA methylation (DNAm) is the most commonly measured epigenetic mechanism in human populations, with most studies using Illumina arrays to assess DNAm levels. In 2023, Illumina updated their DNAm arrays to the EPIC version 2 (EPICv2), building on prior iterations, namely the EPIC version 1...

Full description

Saved in:
Bibliographic Details
Main Authors: Alexandre A. Lussier, Isabel K. Schuurmans, Anna Großbach, Julie Maclsaac, Kristy Dever, Nastassja Koen, Heather J. Zar, Dan J. Stein, Michael S. Kobor, Erin C. Dunn
Format: Article
Language:English
Published: BMC 2024-11-01
Series:Clinical Epigenetics
Subjects:
Online Access:https://doi.org/10.1186/s13148-024-01761-4
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract DNA methylation (DNAm) is the most commonly measured epigenetic mechanism in human populations, with most studies using Illumina arrays to assess DNAm levels. In 2023, Illumina updated their DNAm arrays to the EPIC version 2 (EPICv2), building on prior iterations, namely the EPIC version 1 (EPICv1) and 450K arrays. Whether DNAm measurements are stable across these three generations of arrays has yet not been investigated, limiting the ability of researchers—especially those with longitudinal data—to compare and replicate results across arrays. Here, we present results from a study of 30 child participants (15 male; 15 female) from the Drakenstein Child Health Study, who had DNAm measured on all three of the latest arrays: 450K, EPICv1, and EPICv2. Using these data, we created an annotation of probe quality across arrays, which includes the intraclass correlations, interquartile ranges, correlations, and array bias (i.e., the extent to which DNAm levels were explained by array type) of all CpGs. We also present results from an analysis of sex differences, where we found that CpGs with lower replicability across arrays had higher array-based variance, suggesting this variance metric help guide replication efforts. We also showed that epigenetic age estimates across arrays were more stable when using the principal component versions of epigenetic clocks. Ultimately, this collection of results provides a framework for investigating the replicability and longitudinal stability of epigenetic changes across multiple versions of Illumina DNAm arrays.
ISSN:1868-7083