Quantifying the regulatory potential of genetic variants via a hybrid sequence-oriented model with SVEN

Abstract Deciphering how noncoding DNA determines gene expression is critical for decoding the functional genome. Understanding the transcription effects of noncoding genetic variants are still major unsolved problems, which is critical for downstream applications in human genetics and precision med...

Full description

Saved in:
Bibliographic Details
Main Authors: Yu Wang, Nan Liang, Ge Gao
Format: Article
Language:English
Published: Nature Portfolio 2024-12-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-024-55392-7
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841559313570594816
author Yu Wang
Nan Liang
Ge Gao
author_facet Yu Wang
Nan Liang
Ge Gao
author_sort Yu Wang
collection DOAJ
description Abstract Deciphering how noncoding DNA determines gene expression is critical for decoding the functional genome. Understanding the transcription effects of noncoding genetic variants are still major unsolved problems, which is critical for downstream applications in human genetics and precision medicine. Here, we integrate regulatory-specific neural networks and tissue-specific gradient-boosting trees to build SVEN: a hybrid sequence-oriented architecture that can accurately predict tissue-specific gene expression level and quantify the tissue-specific transcriptomic impacts of structural variants across more than 350 tissues and cell lines. We further systematically screen a large-scale structural variants dataset derived from 3622 individuals and clinical structural variants from ClinVar, and provide an overview of transcriptomic impacts of structural variants in population. As a sequence-oriented model, SVEN is also able to predict regulatory effects for small noncoding variants. We expect that SVEN will enable more effective in silico analysis and interpretation of human genome-wide disease-related genetic variants.
format Article
id doaj-art-c37fe0ea49754581b7942193892cde29
institution Kabale University
issn 2041-1723
language English
publishDate 2024-12-01
publisher Nature Portfolio
record_format Article
series Nature Communications
spelling doaj-art-c37fe0ea49754581b7942193892cde292025-01-05T12:36:17ZengNature PortfolioNature Communications2041-17232024-12-0115111110.1038/s41467-024-55392-7Quantifying the regulatory potential of genetic variants via a hybrid sequence-oriented model with SVENYu Wang0Nan Liang1Ge Gao2State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Biomedical Pioneering Innovative Center (BIOPIC) and Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), Peking UniversityState Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Biomedical Pioneering Innovative Center (BIOPIC) and Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), Peking UniversityState Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Biomedical Pioneering Innovative Center (BIOPIC) and Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), Peking UniversityAbstract Deciphering how noncoding DNA determines gene expression is critical for decoding the functional genome. Understanding the transcription effects of noncoding genetic variants are still major unsolved problems, which is critical for downstream applications in human genetics and precision medicine. Here, we integrate regulatory-specific neural networks and tissue-specific gradient-boosting trees to build SVEN: a hybrid sequence-oriented architecture that can accurately predict tissue-specific gene expression level and quantify the tissue-specific transcriptomic impacts of structural variants across more than 350 tissues and cell lines. We further systematically screen a large-scale structural variants dataset derived from 3622 individuals and clinical structural variants from ClinVar, and provide an overview of transcriptomic impacts of structural variants in population. As a sequence-oriented model, SVEN is also able to predict regulatory effects for small noncoding variants. We expect that SVEN will enable more effective in silico analysis and interpretation of human genome-wide disease-related genetic variants.https://doi.org/10.1038/s41467-024-55392-7
spellingShingle Yu Wang
Nan Liang
Ge Gao
Quantifying the regulatory potential of genetic variants via a hybrid sequence-oriented model with SVEN
Nature Communications
title Quantifying the regulatory potential of genetic variants via a hybrid sequence-oriented model with SVEN
title_full Quantifying the regulatory potential of genetic variants via a hybrid sequence-oriented model with SVEN
title_fullStr Quantifying the regulatory potential of genetic variants via a hybrid sequence-oriented model with SVEN
title_full_unstemmed Quantifying the regulatory potential of genetic variants via a hybrid sequence-oriented model with SVEN
title_short Quantifying the regulatory potential of genetic variants via a hybrid sequence-oriented model with SVEN
title_sort quantifying the regulatory potential of genetic variants via a hybrid sequence oriented model with sven
url https://doi.org/10.1038/s41467-024-55392-7
work_keys_str_mv AT yuwang quantifyingtheregulatorypotentialofgeneticvariantsviaahybridsequenceorientedmodelwithsven
AT nanliang quantifyingtheregulatorypotentialofgeneticvariantsviaahybridsequenceorientedmodelwithsven
AT gegao quantifyingtheregulatorypotentialofgeneticvariantsviaahybridsequenceorientedmodelwithsven