On the origin and structure of haplotype blocks

Research output: Contribution to journalArticleAcademicpeer-review

23 Citations (Scopus)

Abstract

The term "haplotype block" is commonly used in the developing field of haplotype-based inference methods. We argue that the term should be defined based on the structure of the Ancestral Recombination Graph (ARG), which contains complete information on the ancestry of a sample. We use simulated examples to demonstrate key features of the relationship between haplotype blocks and ancestral structure, emphasizing the stochasticity of the processes that generate them. Even the simplest cases of neutrality or of a "hard" selective sweep produce a rich structure, often missed by commonly used statistics. We highlight a number of novel methods for inferring haplotype structure, based on the full ARG, or on a sequence of trees, and illustrate how they can be used to define haplotype blocks using an empirical data set. While the advent of new, computationally efficient methods makes it possible to apply these concepts broadly, they (and additional new methods) could benefit from adding features to explore haplotype blocks, as we define them. Understanding and applying the concept of the haplotype block will be essential to fully exploit long and linked-read sequencing technologies.

Original languageEnglish
Pages (from-to)1441-1457
Number of pages17
JournalMolecular Ecology
Volume32
Issue number6
DOIs
Publication statusPublished - Mar-2023
Externally publishedYes

Keywords

  • Haplotypes/genetics
  • Algorithms
  • Models, Genetic

Fingerprint

Dive into the research topics of 'On the origin and structure of haplotype blocks'. Together they form a unique fingerprint.
  • ERC Proof of Concept

    Chan, F. (Recipient), 2022

    Prize: Fellowship awarded competitivelyAcademic

Cite this