TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data.

It is commonly reported that rare variants may be more functionally related to complex diseases than common variants. However, individual rare variant association tests remain challenging due to low minor allele frequency in the available samples. This paper proposes an expectation maximization vari...

Full description

Saved in:
Bibliographic Details
Main Authors: Duo Yu, Matthew Koslovsky, Margaret C Steiner, Kusha Mohammadi, Chenguang Zhang, Michael D Swartz
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2024-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0314502
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846129383104839680
author Duo Yu
Matthew Koslovsky
Margaret C Steiner
Kusha Mohammadi
Chenguang Zhang
Michael D Swartz
author_facet Duo Yu
Matthew Koslovsky
Margaret C Steiner
Kusha Mohammadi
Chenguang Zhang
Michael D Swartz
author_sort Duo Yu
collection DOAJ
description It is commonly reported that rare variants may be more functionally related to complex diseases than common variants. However, individual rare variant association tests remain challenging due to low minor allele frequency in the available samples. This paper proposes an expectation maximization variable selection (EMVS) method to simultaneously detect common and rare variants at the individual variant level using family trio data. TRIO_RVEMVS was assessed in both large (1500 families) and small (350 families) datasets based on simulation. The performance of TRIO_RVEMVS was compared with gene-level kernel and burden association tests that use pedigree data (PedGene) and rare-variant extensions of the transmission disequilibrium test (RV-TDT). At the region level, TRIO_RVEMVS outperformed PedGene and RV-TDT when common variants were included. TRIO_RVEMVS performed competitively with PedGene and outperformed RV-TDT when the analysis was only restricted to rare variants. At the individual variants level, with 1,500 trios, the average true positive rate of individual rare variants that were polymorphic across 500 datasets was 12.20%, and the average false positive rate was 0.74%. In the datasets with 350 trios, the average true and false positive rates of individual rare variants were 13.10% and 1.30%, respectively. When applying TRIO_RVEMVS to real data from the Gabriella Miller Kids First Pediatric Research Program, it identified 3 rare variants in q24.21 and q24.22 associated with the risk of orofacial clefts in the Kids First European population.
format Article
id doaj-art-57876a1bd3bc4eeeb41a9a22e4bc1dda
institution Kabale University
issn 1932-6203
language English
publishDate 2024-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-57876a1bd3bc4eeeb41a9a22e4bc1dda2024-12-10T05:32:43ZengPublic Library of Science (PLoS)PLoS ONE1932-62032024-01-011912e031450210.1371/journal.pone.0314502TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data.Duo YuMatthew KoslovskyMargaret C SteinerKusha MohammadiChenguang ZhangMichael D SwartzIt is commonly reported that rare variants may be more functionally related to complex diseases than common variants. However, individual rare variant association tests remain challenging due to low minor allele frequency in the available samples. This paper proposes an expectation maximization variable selection (EMVS) method to simultaneously detect common and rare variants at the individual variant level using family trio data. TRIO_RVEMVS was assessed in both large (1500 families) and small (350 families) datasets based on simulation. The performance of TRIO_RVEMVS was compared with gene-level kernel and burden association tests that use pedigree data (PedGene) and rare-variant extensions of the transmission disequilibrium test (RV-TDT). At the region level, TRIO_RVEMVS outperformed PedGene and RV-TDT when common variants were included. TRIO_RVEMVS performed competitively with PedGene and outperformed RV-TDT when the analysis was only restricted to rare variants. At the individual variants level, with 1,500 trios, the average true positive rate of individual rare variants that were polymorphic across 500 datasets was 12.20%, and the average false positive rate was 0.74%. In the datasets with 350 trios, the average true and false positive rates of individual rare variants were 13.10% and 1.30%, respectively. When applying TRIO_RVEMVS to real data from the Gabriella Miller Kids First Pediatric Research Program, it identified 3 rare variants in q24.21 and q24.22 associated with the risk of orofacial clefts in the Kids First European population.https://doi.org/10.1371/journal.pone.0314502
spellingShingle Duo Yu
Matthew Koslovsky
Margaret C Steiner
Kusha Mohammadi
Chenguang Zhang
Michael D Swartz
TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data.
PLoS ONE
title TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data.
title_full TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data.
title_fullStr TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data.
title_full_unstemmed TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data.
title_short TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data.
title_sort trio rvemvs a bayesian framework for rare variant association analysis with expectation maximization variable selection using family trio data
url https://doi.org/10.1371/journal.pone.0314502
work_keys_str_mv AT duoyu triorvemvsabayesianframeworkforrarevariantassociationanalysiswithexpectationmaximizationvariableselectionusingfamilytriodata
AT matthewkoslovsky triorvemvsabayesianframeworkforrarevariantassociationanalysiswithexpectationmaximizationvariableselectionusingfamilytriodata
AT margaretcsteiner triorvemvsabayesianframeworkforrarevariantassociationanalysiswithexpectationmaximizationvariableselectionusingfamilytriodata
AT kushamohammadi triorvemvsabayesianframeworkforrarevariantassociationanalysiswithexpectationmaximizationvariableselectionusingfamilytriodata
AT chenguangzhang triorvemvsabayesianframeworkforrarevariantassociationanalysiswithexpectationmaximizationvariableselectionusingfamilytriodata
AT michaeldswartz triorvemvsabayesianframeworkforrarevariantassociationanalysiswithexpectationmaximizationvariableselectionusingfamilytriodata