TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data.
It is commonly reported that rare variants may be more functionally related to complex diseases than common variants. However, individual rare variant association tests remain challenging due to low minor allele frequency in the available samples. This paper proposes an expectation maximization vari...
Saved in:
| Main Authors: | , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Public Library of Science (PLoS)
2024-01-01
|
| Series: | PLoS ONE |
| Online Access: | https://doi.org/10.1371/journal.pone.0314502 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1846129383104839680 |
|---|---|
| author | Duo Yu Matthew Koslovsky Margaret C Steiner Kusha Mohammadi Chenguang Zhang Michael D Swartz |
| author_facet | Duo Yu Matthew Koslovsky Margaret C Steiner Kusha Mohammadi Chenguang Zhang Michael D Swartz |
| author_sort | Duo Yu |
| collection | DOAJ |
| description | It is commonly reported that rare variants may be more functionally related to complex diseases than common variants. However, individual rare variant association tests remain challenging due to low minor allele frequency in the available samples. This paper proposes an expectation maximization variable selection (EMVS) method to simultaneously detect common and rare variants at the individual variant level using family trio data. TRIO_RVEMVS was assessed in both large (1500 families) and small (350 families) datasets based on simulation. The performance of TRIO_RVEMVS was compared with gene-level kernel and burden association tests that use pedigree data (PedGene) and rare-variant extensions of the transmission disequilibrium test (RV-TDT). At the region level, TRIO_RVEMVS outperformed PedGene and RV-TDT when common variants were included. TRIO_RVEMVS performed competitively with PedGene and outperformed RV-TDT when the analysis was only restricted to rare variants. At the individual variants level, with 1,500 trios, the average true positive rate of individual rare variants that were polymorphic across 500 datasets was 12.20%, and the average false positive rate was 0.74%. In the datasets with 350 trios, the average true and false positive rates of individual rare variants were 13.10% and 1.30%, respectively. When applying TRIO_RVEMVS to real data from the Gabriella Miller Kids First Pediatric Research Program, it identified 3 rare variants in q24.21 and q24.22 associated with the risk of orofacial clefts in the Kids First European population. |
| format | Article |
| id | doaj-art-57876a1bd3bc4eeeb41a9a22e4bc1dda |
| institution | Kabale University |
| issn | 1932-6203 |
| language | English |
| publishDate | 2024-01-01 |
| publisher | Public Library of Science (PLoS) |
| record_format | Article |
| series | PLoS ONE |
| spelling | doaj-art-57876a1bd3bc4eeeb41a9a22e4bc1dda2024-12-10T05:32:43ZengPublic Library of Science (PLoS)PLoS ONE1932-62032024-01-011912e031450210.1371/journal.pone.0314502TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data.Duo YuMatthew KoslovskyMargaret C SteinerKusha MohammadiChenguang ZhangMichael D SwartzIt is commonly reported that rare variants may be more functionally related to complex diseases than common variants. However, individual rare variant association tests remain challenging due to low minor allele frequency in the available samples. This paper proposes an expectation maximization variable selection (EMVS) method to simultaneously detect common and rare variants at the individual variant level using family trio data. TRIO_RVEMVS was assessed in both large (1500 families) and small (350 families) datasets based on simulation. The performance of TRIO_RVEMVS was compared with gene-level kernel and burden association tests that use pedigree data (PedGene) and rare-variant extensions of the transmission disequilibrium test (RV-TDT). At the region level, TRIO_RVEMVS outperformed PedGene and RV-TDT when common variants were included. TRIO_RVEMVS performed competitively with PedGene and outperformed RV-TDT when the analysis was only restricted to rare variants. At the individual variants level, with 1,500 trios, the average true positive rate of individual rare variants that were polymorphic across 500 datasets was 12.20%, and the average false positive rate was 0.74%. In the datasets with 350 trios, the average true and false positive rates of individual rare variants were 13.10% and 1.30%, respectively. When applying TRIO_RVEMVS to real data from the Gabriella Miller Kids First Pediatric Research Program, it identified 3 rare variants in q24.21 and q24.22 associated with the risk of orofacial clefts in the Kids First European population.https://doi.org/10.1371/journal.pone.0314502 |
| spellingShingle | Duo Yu Matthew Koslovsky Margaret C Steiner Kusha Mohammadi Chenguang Zhang Michael D Swartz TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data. PLoS ONE |
| title | TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data. |
| title_full | TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data. |
| title_fullStr | TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data. |
| title_full_unstemmed | TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data. |
| title_short | TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data. |
| title_sort | trio rvemvs a bayesian framework for rare variant association analysis with expectation maximization variable selection using family trio data |
| url | https://doi.org/10.1371/journal.pone.0314502 |
| work_keys_str_mv | AT duoyu triorvemvsabayesianframeworkforrarevariantassociationanalysiswithexpectationmaximizationvariableselectionusingfamilytriodata AT matthewkoslovsky triorvemvsabayesianframeworkforrarevariantassociationanalysiswithexpectationmaximizationvariableselectionusingfamilytriodata AT margaretcsteiner triorvemvsabayesianframeworkforrarevariantassociationanalysiswithexpectationmaximizationvariableselectionusingfamilytriodata AT kushamohammadi triorvemvsabayesianframeworkforrarevariantassociationanalysiswithexpectationmaximizationvariableselectionusingfamilytriodata AT chenguangzhang triorvemvsabayesianframeworkforrarevariantassociationanalysiswithexpectationmaximizationvariableselectionusingfamilytriodata AT michaeldswartz triorvemvsabayesianframeworkforrarevariantassociationanalysiswithexpectationmaximizationvariableselectionusingfamilytriodata |