Resolving the source of branch length variation in the Y chromosome phylogeny

Abstract Background Genetic variation in the non-recombining part of the human Y chromosome has provided important insight into the paternal history of human populations. However, a significant and yet unexplained branch length variation of Y chromosome lineages has been observed, notably amongst th...

Full description

Saved in:
Bibliographic Details
Main Authors: Yaniv Swiel, Janet Kelso, Stéphane Peyrégne
Format: Article
Language:English
Published: BMC 2025-01-01
Series:Genome Biology
Subjects:
Online Access:https://doi.org/10.1186/s13059-024-03468-4
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841544618242473984
author Yaniv Swiel
Janet Kelso
Stéphane Peyrégne
author_facet Yaniv Swiel
Janet Kelso
Stéphane Peyrégne
author_sort Yaniv Swiel
collection DOAJ
description Abstract Background Genetic variation in the non-recombining part of the human Y chromosome has provided important insight into the paternal history of human populations. However, a significant and yet unexplained branch length variation of Y chromosome lineages has been observed, notably amongst those that are highly diverged from the human reference Y chromosome. Understanding the origin of this variation, which has previously been attributed to changes in generation time, mutation rate, or efficacy of selection, is important for accurately reconstructing human evolutionary and demographic history. Results Here, we analyze Y chromosomes from present-day and ancient modern humans, as well as Neandertals, and show that branch length variation amongst human Y chromosomes cannot solely be explained by differences in demographic or biological processes. Instead, reference bias results in mutations being missed on Y chromosomes that are highly diverged from the reference used for alignment. We show that masking fast-evolving, highly divergent regions of the human Y chromosome mitigates the effect of this bias and enables more accurate determination of branch lengths in the Y chromosome phylogeny. Conclusion We show that our approach allows us to estimate the age of ancient samples from Y chromosome sequence data and provide updated estimates for the time to the most recent common ancestor using the portion of the Y chromosome where the effect of reference bias is minimized.
format Article
id doaj-art-bd05e46cff124d9582b9a7a6d9c9ac58
institution Kabale University
issn 1474-760X
language English
publishDate 2025-01-01
publisher BMC
record_format Article
series Genome Biology
spelling doaj-art-bd05e46cff124d9582b9a7a6d9c9ac582025-01-12T12:25:57ZengBMCGenome Biology1474-760X2025-01-0126111910.1186/s13059-024-03468-4Resolving the source of branch length variation in the Y chromosome phylogenyYaniv Swiel0Janet Kelso1Stéphane Peyrégne2Department of Evolutionary Genetics, Max Planck Institute for Evolutionary AnthropologyDepartment of Evolutionary Genetics, Max Planck Institute for Evolutionary AnthropologyDepartment of Evolutionary Genetics, Max Planck Institute for Evolutionary AnthropologyAbstract Background Genetic variation in the non-recombining part of the human Y chromosome has provided important insight into the paternal history of human populations. However, a significant and yet unexplained branch length variation of Y chromosome lineages has been observed, notably amongst those that are highly diverged from the human reference Y chromosome. Understanding the origin of this variation, which has previously been attributed to changes in generation time, mutation rate, or efficacy of selection, is important for accurately reconstructing human evolutionary and demographic history. Results Here, we analyze Y chromosomes from present-day and ancient modern humans, as well as Neandertals, and show that branch length variation amongst human Y chromosomes cannot solely be explained by differences in demographic or biological processes. Instead, reference bias results in mutations being missed on Y chromosomes that are highly diverged from the reference used for alignment. We show that masking fast-evolving, highly divergent regions of the human Y chromosome mitigates the effect of this bias and enables more accurate determination of branch lengths in the Y chromosome phylogeny. Conclusion We show that our approach allows us to estimate the age of ancient samples from Y chromosome sequence data and provide updated estimates for the time to the most recent common ancestor using the portion of the Y chromosome where the effect of reference bias is minimized.https://doi.org/10.1186/s13059-024-03468-4Reference biasAncient DNAY chromosomeSequence alignmentMolecular datingMutation rate
spellingShingle Yaniv Swiel
Janet Kelso
Stéphane Peyrégne
Resolving the source of branch length variation in the Y chromosome phylogeny
Genome Biology
Reference bias
Ancient DNA
Y chromosome
Sequence alignment
Molecular dating
Mutation rate
title Resolving the source of branch length variation in the Y chromosome phylogeny
title_full Resolving the source of branch length variation in the Y chromosome phylogeny
title_fullStr Resolving the source of branch length variation in the Y chromosome phylogeny
title_full_unstemmed Resolving the source of branch length variation in the Y chromosome phylogeny
title_short Resolving the source of branch length variation in the Y chromosome phylogeny
title_sort resolving the source of branch length variation in the y chromosome phylogeny
topic Reference bias
Ancient DNA
Y chromosome
Sequence alignment
Molecular dating
Mutation rate
url https://doi.org/10.1186/s13059-024-03468-4
work_keys_str_mv AT yanivswiel resolvingthesourceofbranchlengthvariationintheychromosomephylogeny
AT janetkelso resolvingthesourceofbranchlengthvariationintheychromosomephylogeny
AT stephanepeyregne resolvingthesourceofbranchlengthvariationintheychromosomephylogeny