The Genomics of Hybrid Speciation
Cassandra Nicole Trier
Dissertation presented for the degree of Philosophiae Doctor (PhD)
2018
Centre for Ecological and Evolutionary Synthesis Department of Biosciences
Faculty of Mathematics and Natural Sciences
© Cassandra Nicole Trier, 2018
Series of dissertations submitted to the
Faculty of Mathematics and Natural Sciences, University of Oslo No. 2020
ISSN 1501-7710
All rights reserved. No part of this publication may be
reproduced or transmitted, in any form or by any means, without permission.
Cover: Hanne Baadsgaard Utigard.
Print production: Reprosentralen, University of Oslo.
Science is magic that works.
-Kurt Vonnegut
Acknowledgements
This PhD has been a crazy journey and the truth is, I would have never made it through without support from so many wonderful people along the way. First of all, I would like to thank my supervisors Glenn- Peter Sætre and Kjetill S. Jakobsen. Glenn, you didn’t know what you were signing up for when I knocked on your door as a new masters student looking for a research group! Now, eight years later, I can say that it has been a pleasure being in your research group and I will always be grateful for the guidance and support you have given me through the years. Kjetill, you have always been there if I need advice or just someone to talk to about my project. Your up-beat personality and great sense of humor make you easy to talk to and I always enjoy our chats. Thank you so much for your support.
I would also like to thank my co-authors and friends in the
sparrow group for so many great discussions that left me inspired in my own work. In particular, thank you Angelica, Anna R., Camilla, Caroline, Fabrice, Jo, Mark and Melissah for being a constant source of encouragement through the years. You have always made me feel like my opinion is valuable even when I have doubted so myself; it has meant more to me than you know.
The Centre for Ecological and Evolutionary Synthesis (CEES) has
been such a fantastic work environment where I have made life-long
friends. I feel so fortunate to have worked with such an amazing group
of people. Thank you to all my co-workers for the ridiculous lunchtime
conversations and making CEES a fun place to work. A special thanks
to Anders, Anna M., Bastiaan, Boris, Eric, Heidi, Helle, Inger Maren,
Katie, Kjetil, Luis, Ole Kristian, Olja, Pernille, Ryan, Sanne and Unni for
the fun that also extended outside of work at barbecues, parties,
hikes, kids play-dates, you name it.
To my knitting girls, your love and constant support has meant the world to me. Life is so much easier with friends like you. Thank you for keeping me sane and bringing so much joy and laughter into my life.
I have never stopped missing my friends from California. To Alex, Anna H., Jack, Jiro, Kevin, Nicole, Matt and Tarek, thank you for being life-long friends that keep me grounded and have put up with me all these years!
I would also like to thank all my extended family that held back from teasing me too much about “taking care of the birds” all this time.
Thank you for your continuous love and support. A special thank you to Grandpa Tom who always believed in me and I know would be proud.
To Mom and Dad, it has only been with your unconditional love that I had the courage to come to Norway in the !rst place. You
constantly sacri!ce so I can have a good life, even when it means your only daughter moves across the world. Thank you for always being there for me, believing in me and pushing me to follow my passion.
To my dear husband, Tore, I’m not sure any words su"ce. Not only have you stuck by me as my number one fan throughout the PhD, but you have been my o"cemate, co-author and best friend. People wonder how I could share an o"ce with my husband and the truth is, it was easy because you and our sons are always the highlight of my day.
Thank you for quite simply everything along the way, I couldn’t have done it without you.
And last but not least, my sons. Oliver and Eirik, thank you for everyday reminding me of what is really important in life.
Oslo, June 2018
Table of Contents
List of Papers 1
Summary 2
Introduction 3
Background 3
Hybrid speciation 4
Detecting hybrid origin 6
Hybrid zones 8
The genomic architecture of hybrid species 11
The Passer sparrow system 13
Aims 15
Paper Summaries 16
Discussion 19
Patterns of admixture in a hybrid species 19
Reproductive isolation from the parents 21
Selection within the hybrid lineage 23
Future perspectives and concluding remarks 23
Acknowledgements 25
References 25
Paper I
Paper II
Paper III
List of Papers
Paper I.
C. N. Trier*, J. S. Hermansen*, G.-P. Sætre, R. I. Bailey. (2014) Evidence for mito-nuclear and sex-linked reproductive barriers between the hybrid Italian sparrow and its parent species. PLoS Genetics
10:e1004075.
Paper II.
T.O. Elgvin*, C. N. Trier*, O.K. Tørresen, I. Hagen, S. Lien, M. Ravinet, H.
Jensen, G.-P. Sætre. (2017) The genetic mosaicism of hybrid speciation. Science Advances 3:e1602996.
*These authors contributed equally to the paper.
Paper III.
A. Runemark, C. N. Trier, F. Eroukhmano!, J.S. Hermansen, M.
Matschiner, M. Ravinet, T.O. Elgvin, G.-P. Sætre. (2018) Variation and
constraints in hybrid genome formation. Nature Ecology and Evolution
2:549-556.
Summary
The central question of this thesis is how hybridization can lead to the formation of new species. To approach this question, I used the hybrid Italian sparrow (Passer italiae) to explore the consequences of hybridization on genomic architecture and reproductive isolation at the genomic level. With the use of transcriptomic data and cline analyses in Paper I, we investigated if there is evidence for reproductive barriers between the hybrid species and its parents, the house sparrow (P. domesticus) and Spanish sparrow (P.
hispaniolensis); and if so, which genes or genomic regions may be involved?
We found that there is evidence of reproductive isolation between the Italian sparrow and its parent species and that Z-linked genes and mito-nuclear gene complexes play an integral role.
In Paper II, we sought to examine the genomic architecture of the Italian sparrow in comparison to its parents. This "rst consisted of de novo assembling a high-quality reference genome of the house sparrow. We then mapped whole-genome sequencing data from populations of the parent house and Spanish sparrows, as well as the Italian sparrow to the reference genome. By using comparative genomics, we were able to characterize patterns of admixture and di!erentiation in the Italian sparrow genome in relation to both parent species. We show that the genomic landscape of the Italian sparrow is highly heterogenous with regions inherited alternately from either parent across the genome in a mosaic pattern. High divergence regions between the Italian sparrow and either of its parents were found to be disproportionately located on the Z chromosome and genes involved in body patterning, beak morphology and the immune system were over- represented in these regions. We also found regions where the Italian sparrow is divergent from both parents that may represent areas of novel divergence in this homoploid hybrid lineage.
In Paper III, we utilized genomic data from multiple geographically
isolated Italian sparrow populations from di!erent Mediterranean islands to
explore the extent to which the hybrid genome can vary. We "nd that there is
variation in the genomic combinations that compose a functional hybrid species, yet there are some areas that are invariably inherited from one parent. These regions of genomic constraint are over-represented on the Z chromosome and hold candidate incompatibility loci involved in DNA repair and mito-nuclear function.
Overall, this dissertation helps demonstrate how a new species can arise via hybridization by painting a picture of how admixture can shape the genomes of di!erentiated populations and lead to the formation of reproductive barriers.
Introduction
Background
Hybridization is widespread in nature and can have various impacts on species diversi"cation. Traditionally, interbreeding between distinct populations has been viewed as a detriment to species divergence and a
‘biological mistake’ (Mayr 1932; Fisher 1930). Yet, around 10% of animal and 25% of plant species are known to hybridize (Mallet 2007), making hybridization a prominent feature in nature. Hybridization has even been shown to have shaped the genome of our own species (Sankararaman et al.
2014). In recent years, it has become apparent that hybridization can also serve as a source of genetic variation promoting diversi"cation (Abbott et al.
2013; Seehausen et al. 2014). Since new genetic variation can have adaptive
potential, species divergence can alternatively be facilitated through
hybridization rather than impeded (Grant and Grant 1994; Heliconius
Genome Consortium 2012). In fact, hybridization is particularly common
among rapidly radiating groups (Mallet 2007; Grant et al. 2015) as it can allow
for rapid local adaption through the introgression of new variation subject to
selection (Lamichhaney et al. 2018). In some instances, hybridization can
even give rise to new species through the recombination of parental
genomes, leading to a third population of mixed ancestry that remains distinct from both its parents (Rieseberg 1997; Mallet 2007; Mavarez and Linares 2008). Thus, the creative role of hybridization in evolution spans a continuum from adaptive introgression to hybrid speciation. This thesis focuses on the latter and aims at gaining a better understanding of how hybridization can play a primary role in the origin of new species.
Hybrid speciation
When two di!erentiated populations mate, hybrid o!spring possessing novel, mixed genotypes can arise (Rieseberg 1997; Buerkle et al. 2000;
Coyne and Orr 2004). These hybrid o!spring may have reduced "tness relative to their parent taxa (Mayr 1963; Coyne and Orr 2004) due to genetic incompatibilities leading to inviability or infertility in "rst generation (F1) hybrids (Arnold and Hodges 1995) or ecological intermediacy (Schluter 1993; 1995; Coyne and Orr 2004). However, empirical studies have shown that hybrid genotypes possess a wide range of "tnesses and that hybrids can have equivalent or higher "tness than their parents (Arnold and Hodges 1995; Arnegard et al. 2014).
Hybridization provides the means in which new genetic variation from multiple loci with adaptive potential are transferred simultaneously.
Therefore, adaptive evolution may proceed more rapidly following
hybridization than would be expected from mutations alone (Grant and Grant
1994; Heliconius Genome Consortium 2012; Abbott et al. 2016). F1 hybrids
can also experience an increase in growth rate, size and reproductive
success, known as heterosis or hybrid vigor (Arnold and Hodges 1995),
which may constitute an adaptive advantage relative to its parents. Though
the genetic basis for hybrid vigor is subject to much debate, recent
hypotheses suggest it may be attributed to complementary interactions of
alleles at multiple loci (epistasis) (Baack and Rieseberg 2007). Hybrid vigor is
often broken down in subsequent generations as recessive alleles become
exposed and parental gene combinations are broken up via recombination (Dobzhansky 1948; Templeton 1981; Felsenstein 1981). Yet, if the hybrid lineage is able to persist and develop reproductive barriers against both its parents, it has the potential to become its own species. This process in which interspeci"c hybridization gives rise to novel species is known as hybrid speciation.
An important aspect of hybrid speciation is the hybrid karyotype in relation to its parents. Allopolyploid hybrids maintain a di!erent number of chromosome complements than their parents and consequently develop immediate reproductive isolation. This, combined with potential heterosis, may lead to the establishment of new lineages and has frequently been observed in plant taxa (Rieseberg 1997; Mallet 2007; Rieseberg and Willis 2007; Hegarty and Hiscock 2008; Soltis and Soltis 2009). In contrast, homoploid hybrids, which share the same ploidy level as their parents, may struggle to develop reproductive barriers strong enough to remain distinct from their parents (Baack & Rieseberg 2007). For this reason, homoploid hybrid speciation has historically been considered to be a rare outcome of hybridization (Mallet 2007; Schumer et al. 2014), particularly in animals (Mavarez and Linares 2008). The hybrid must "rst escape genetic incompatibilities and "tness loss, and then the homogenizing e!ect of gene
#ow from its parents; despite complementary ploidy levels and the fact that their parent’s reproductive barriers were su$ciently weak that they interbred in the "rst place (Buerkle et al. 2000; Coyne and Orr 2004). In some instances however, the recombination of parental alleles from initial hybridization could trigger the formation of reproductive barriers between the parents via ecological divergence (Gross and Rieseberg 2005), assortative mating (Mavarez et al. 2006; Melo et al. 2009) or genetic incompatibilities (Rieseberg 1997; Schumer et al. 2014; Abbott et al. 2016). In particular, one mechanism predicted to allow for homoploid hybrid speciation is transgressive segregation - the production of hybrid traits outside the range of its parents - that enable the hybrid to colonize new ecological niches (Rieseberg et al.
1999; Mallet 2007). While there are few well-documented cases of
homoploid hybrid animal species, new methods have led to a growing
number of empirical examples in the past decade in #ies (Schwarz et al.
2005), bats (Larsen et al. 2010), butter#ies (Mavarez and Gonzalez 2006;
Kunte et al. 2011; Heliconius Genome Consortium 2012), birds (Elgvin et al.
2011; Hermansen et al. 2011) and "shes (Salzburger et al. 2002; Meyer et al.
2006; Keller et al. 2013) suggesting that homoploid hybrid speciation may be more common than initially thought (Mavarez and Linares 2008; Abbott et al.
2013).
Detecting hybrid origin
For many years, con#icting phylogenetic trees from nuclear and organellar markers have been used to test for hybrid ancestry in proposed hybrids (Bullini 1994; Dowling and Secor 1997; Soltis and Soltis 2009). In the past, this has proven problematic when few markers are used because incomplete lineage sorting can also produce discordant phylogenies (reviewed in Ballard and Rand 2005). Genetic mosaicism, where there is evidence of alternating inheritance from two parental lineages, has been considered strong evidence for hybrid speciation, but it is also di$cult to demonstrate with few markers since hybridization upon secondary contact can give the same signal (vonHoldt et al. 2011; Schumer et al. 2014). As data sets have gotten bigger and genome-wide markers or whole genome sequencing are becoming commonplace, this has allowed for more robustly testing of hybrid ancestry hypotheses. However, it has also raised new questions in regards to what criteria are needed to determine if a taxon is derived from hybrid speciation.
As the scale of genetic data has gotten larger, it has become
increasingly apparent that hybridization is a prominent feature in nature
(Mallet 2007; Abbott et al. 2013) and the number of cases of proposed
hybrid species has increased dramatically (Schumer et al. 2013; 2014). This
has led to some confusion in the literature as to what constitutes ‘hybrid
speciation’ (Mallet 2007; Mavarez and Linares 2008; Jiggins et al. 2008;
Abbott et al. 2013; 2016). Schumer et al. 2014, argued that many purported examples of hybrid speciation do not have strong enough evidence and proposed criteria required in order to conclude that a species is product of hybrid speciation. They suggest there needs to be evidence of i) reproductive isolation between the hybrid lineage from its parents, ii) signatures of hybridization in the genome, and iii) reproductive isolation being a direct product of hybridization. Since speciation is a continuum and de"ning the point at which two divergent populations become a species has proven di$cult (Mallet 1995), it should come as no surprise that there is a debate on what evidence is required for a species to be considered of hybrid origin. In particular, Feliner et al. 2017 have criticized Schumer et al. 2014, speci"cally criterion iii, arguing that if a hybridization led to an established, ecologically and morphologically distinct hybrid lineage, this should be considered hybrid speciation regardless of whether or not hybridization directly led to reproductive isolation.
There does however appear to be some agreement that support for hybrid speciation should include evidence of both hybrid ancestry and reproductive isolation from the parental lineages. Genomic data provides great opportunities to investigate these aspects of proposed hybrid species.
For instance, since many more genomic regions can be sampled, the
chances of identifying the areas involved in reproductive isolation are higher
(Schumer et al. 2014). Also, large-scale signatures of hybridization such as
genomic mosaicism can help distinguish between patterns of hybridization
from those driven by incomplete lineage sorting, genetic drift or selection
(Nice et al. 2013; Schumer et al. 2014). While there is certainly a gray area in
what is considered hybrid speciation, what is perhaps more fruitful than
squabbling over its de"nition is using newly available genomic data to better
understand the processes involved in hybridization leading to the formation
of new species. In this thesis, the goal was to do just that; use genomic data
to explore how hybridization can result in a new, distinct and reproductively
isolated lineage.
Hybrid zones
Hybrid zones are regions where “genetically distinct groups of individuals meet and mate, resulting in at least some o!spring of mixed ancestry” (Harrison 1993). They provide an excellent opportunity to study reproductive barriers as they are in e!ect natural laboratories where parental ancestry blocks are broken down and new combinations of genes are exposed to selection (Barton and Hewitt 1989; Buerkle et al. 2000; Payseur and Nachman 2005; Harrison and Larson 2014).
Postzygotic reproductive barriers between a hybrid and its parents are often the result of Bateson-Dobzhansky-Muller (BDM) incompatibilities (Bateson 1909, Dobzhansky 1936; Muller 1940; Orr 1996; Turelli and Orr 1995). Under a BDM model, hybrids may receive alleles from two populations that have diverged at di!erent loci without su!ering a loss in "tness in either population, but when these divergent alleles introgress into a new genomic background, they may interact poorly resulting in a "tness loss (Figure 1).
This could be due to selection against heterozygotes, selection against
certain alleles in a foreign genetic background or a combination of the two
(Barton 2001; Buerkle and Rieseberg 2001; Baack and Rieseberg 2007). The
rate of gene #ow of an allelic variant in a hybrid zone is consequently
determined by its e!ect on "tness and linkage disequilibrium to other
genomic regions (Gompert and Buerkle 2011b). This is true for all loci
experiencing divergent selection in the hybrid zone and is expected to be
especially strong for BDM incompatibilities as they often have large "tness
e!ects. Therefore, hybrid zone analyses provide an opportunity for studying
genomic regions that may drive speciation.
Figure 1. BDM model of genetic incompatibilities. After populations of the ancestral genotype split, new mutations arise and reach !xation without a
!tness loss. When the divergent populations meet again and mate, new combinations of incompatible alleles result in a !tness loss in the hybrid o"spring.
By examining patterns of di!erentiation of loci across hybrid zones, it is possible to identify loci that exhibit reduced introgression and consequently are candidates for being involved in reproductive barriers between species (Barton and Hewitt 1985; Gompert and Buerkle 2011a; Gompert et al. 2012).
One manner of quantifying di!erential rates of introgression among loci in hybrid zones is through the use of cline theory. Cline theory measures the gradient in a trait or allele frequency across a geographic or genomic range.
In terms of gene #ow, cline theory can be used to measure rates of gene #ow across a geographic range (geographic clines) (Szymura and Barton 1986;
Carling and Brum"eld 2008; Teeter et al. 2010; Taylor et al. 2012) or into di!erent genomic backgrounds (genomic clines) (Szymura and Barton 1986;
Gompert and Buerkle 2011b; Taylor et al. 2014). Cline theory predicts that the width of a cline is dependent on a balance between selection and dispersal (Slatkin 1973; Barton and Hewitt 1985). Neutral alleles are predicted to introgress proportionally to dispersal distance, i.e. the gene #ow out of the hybrid zone. Meanwhile, alleles that reduce "tness or contribute to assortative mating are expected to introgress less due to strong selection creating narrow clines (Figure 2) (Barton and Hewitt 1989; Harrison 1993;
Buerkle and Lexer 2008; Gompert and Buerkle 2011b). If partial reproductive isolation exists between two taxa at a given locus, steep clines in allele frequencies indicative of reduced introgression would be expected in hybrid zones (Slatkin 1973; Nagylaki 1975; May et al. 2015).
A B A B
A b
A B a B A B
A b
A b a B a B
X
A ba B
Ancestral genotype New mutations Fixation of mutations Hybrid offspring
X