Detection and functional assessment of structural variants using whole-genome re-sequencing data in Nellore cattle

Abstract Ongoing advances in genome sequencing technologies have enabled the identification of numerous structural variants (SVs) in livestock genomes, which are the main determinants of complex traits due to their impact on gene expression. Thus, this study aimed to detect structural variants using...

Full description

Saved in:
Bibliographic Details
Main Authors: Natalia A. Marín-Garzón, Lucio F. M. Mota, Giovana Vargas, Leonardo M. Arikawa, Larissa F. S. Fonseca, Gerardo A. Fernandes Júnior, Roberto Carvalheiro, Lucia G. Albuquerque
Format: Article
Language:English
Published: Nature Portfolio 2025-08-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-14139-0
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849226333645701120
author Natalia A. Marín-Garzón
Lucio F. M. Mota
Giovana Vargas
Leonardo M. Arikawa
Larissa F. S. Fonseca
Gerardo A. Fernandes Júnior
Roberto Carvalheiro
Lucia G. Albuquerque
author_facet Natalia A. Marín-Garzón
Lucio F. M. Mota
Giovana Vargas
Leonardo M. Arikawa
Larissa F. S. Fonseca
Gerardo A. Fernandes Júnior
Roberto Carvalheiro
Lucia G. Albuquerque
author_sort Natalia A. Marín-Garzón
collection DOAJ
description Abstract Ongoing advances in genome sequencing technologies have enabled the identification of numerous structural variants (SVs) in livestock genomes, which are the main determinants of complex traits due to their impact on gene expression. Thus, this study aimed to detect structural variants using whole genome re-sequencing (WGS) data and identify positional candidate genes and quantitative trait loci (QTL) overlapping the most frequent SV regions in Nellore cattle. The WGS from 151 representative Nellore bulls were analyzed to conduct genome-wide structural variation calling and to identify common SV regions. Gene and QTL information surrounding the most frequent SV regions was identified using the Ensembl Genes and Cattle QTL database. The identified genes were functionally classified for biological mechanisms and pathways (Gene Ontology - GO) using the panther database. A total of 215,031 SVs were identified, with most of them being copy number variants (CNV) (183,032 deletions and 14,013 duplications) and 17,986 inversions (INV). These SVs cover, on average, 4.81% of the autosomal genome. Furthermore, we found 3,752 non-redundant SV regions that are frequent in at least 5% of the bulls. These SV regions mainly correspond to CNV regions (97%) and inversion regions (3%). In total, all SV regions cover 13.13% of the total autosomal genome, with 11.4% attributed to CNV regions and 1.7% to inversion regions. Moreover, we found that 532 SV regions were common in more than 50% of the bulls evaluated and overlapped 130 QTL previously associated with economically important traits related to exterior, health, meat and carcass, milk, production, and reproduction. A total of 1,164 positional candidate genes were identified, with 204 SVRs overlapping these genes. These genes are significantly overrepresented in GO terms related to biological processes (BP), molecular functions (MF), and biochemical pathways, playing an essential role in environmental adaptation mechanisms and feed efficiency indicator traits. Our results suggest that genes surrounding SV regions play key biological functions essential to thermotolerance, immunity, metabolism, tissue integrity, and environmental adaptation in tropical regions.
format Article
id doaj-art-223f7037ffc44b4aa9faf7e85293d858
institution Kabale University
issn 2045-2322
language English
publishDate 2025-08-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-223f7037ffc44b4aa9faf7e85293d8582025-08-24T11:22:35ZengNature PortfolioScientific Reports2045-23222025-08-0115112410.1038/s41598-025-14139-0Detection and functional assessment of structural variants using whole-genome re-sequencing data in Nellore cattleNatalia A. Marín-Garzón0Lucio F. M. Mota1Giovana Vargas2Leonardo M. Arikawa3Larissa F. S. Fonseca4Gerardo A. Fernandes Júnior5Roberto Carvalheiro6Lucia G. Albuquerque7School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)CSIRO Agriculture and FoodSchool of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP)Abstract Ongoing advances in genome sequencing technologies have enabled the identification of numerous structural variants (SVs) in livestock genomes, which are the main determinants of complex traits due to their impact on gene expression. Thus, this study aimed to detect structural variants using whole genome re-sequencing (WGS) data and identify positional candidate genes and quantitative trait loci (QTL) overlapping the most frequent SV regions in Nellore cattle. The WGS from 151 representative Nellore bulls were analyzed to conduct genome-wide structural variation calling and to identify common SV regions. Gene and QTL information surrounding the most frequent SV regions was identified using the Ensembl Genes and Cattle QTL database. The identified genes were functionally classified for biological mechanisms and pathways (Gene Ontology - GO) using the panther database. A total of 215,031 SVs were identified, with most of them being copy number variants (CNV) (183,032 deletions and 14,013 duplications) and 17,986 inversions (INV). These SVs cover, on average, 4.81% of the autosomal genome. Furthermore, we found 3,752 non-redundant SV regions that are frequent in at least 5% of the bulls. These SV regions mainly correspond to CNV regions (97%) and inversion regions (3%). In total, all SV regions cover 13.13% of the total autosomal genome, with 11.4% attributed to CNV regions and 1.7% to inversion regions. Moreover, we found that 532 SV regions were common in more than 50% of the bulls evaluated and overlapped 130 QTL previously associated with economically important traits related to exterior, health, meat and carcass, milk, production, and reproduction. A total of 1,164 positional candidate genes were identified, with 204 SVRs overlapping these genes. These genes are significantly overrepresented in GO terms related to biological processes (BP), molecular functions (MF), and biochemical pathways, playing an essential role in environmental adaptation mechanisms and feed efficiency indicator traits. Our results suggest that genes surrounding SV regions play key biological functions essential to thermotolerance, immunity, metabolism, tissue integrity, and environmental adaptation in tropical regions.https://doi.org/10.1038/s41598-025-14139-0Copy number variationGains and losses of DNA fragmentsMobile elementsNext-generation sequencingZebu cattle
spellingShingle Natalia A. Marín-Garzón
Lucio F. M. Mota
Giovana Vargas
Leonardo M. Arikawa
Larissa F. S. Fonseca
Gerardo A. Fernandes Júnior
Roberto Carvalheiro
Lucia G. Albuquerque
Detection and functional assessment of structural variants using whole-genome re-sequencing data in Nellore cattle
Scientific Reports
Copy number variation
Gains and losses of DNA fragments
Mobile elements
Next-generation sequencing
Zebu cattle
title Detection and functional assessment of structural variants using whole-genome re-sequencing data in Nellore cattle
title_full Detection and functional assessment of structural variants using whole-genome re-sequencing data in Nellore cattle
title_fullStr Detection and functional assessment of structural variants using whole-genome re-sequencing data in Nellore cattle
title_full_unstemmed Detection and functional assessment of structural variants using whole-genome re-sequencing data in Nellore cattle
title_short Detection and functional assessment of structural variants using whole-genome re-sequencing data in Nellore cattle
title_sort detection and functional assessment of structural variants using whole genome re sequencing data in nellore cattle
topic Copy number variation
Gains and losses of DNA fragments
Mobile elements
Next-generation sequencing
Zebu cattle
url https://doi.org/10.1038/s41598-025-14139-0
work_keys_str_mv AT nataliaamaringarzon detectionandfunctionalassessmentofstructuralvariantsusingwholegenomeresequencingdatainnellorecattle
AT luciofmmota detectionandfunctionalassessmentofstructuralvariantsusingwholegenomeresequencingdatainnellorecattle
AT giovanavargas detectionandfunctionalassessmentofstructuralvariantsusingwholegenomeresequencingdatainnellorecattle
AT leonardomarikawa detectionandfunctionalassessmentofstructuralvariantsusingwholegenomeresequencingdatainnellorecattle
AT larissafsfonseca detectionandfunctionalassessmentofstructuralvariantsusingwholegenomeresequencingdatainnellorecattle
AT gerardoafernandesjunior detectionandfunctionalassessmentofstructuralvariantsusingwholegenomeresequencingdatainnellorecattle
AT robertocarvalheiro detectionandfunctionalassessmentofstructuralvariantsusingwholegenomeresequencingdatainnellorecattle
AT luciagalbuquerque detectionandfunctionalassessmentofstructuralvariantsusingwholegenomeresequencingdatainnellorecattle