Like many people, from a young age I was obsessed and interested in works of fantasy and science fiction. To feel transported to magical worlds of various imaginative creatures and diverse places. The luxury of being able to separate from the mundanity of reality is one many children (or nostalgic adults) will be able to relate to upon reflection. Worlds that appear far more creative and engaging than our own are intrinsically enticing to the human psyche and the escapism it allows is no doubt an integral part of growing up for many people (especially those who have also dealt or avoided dealing with mental health issues).
The intricate connection to the (super)natural world drove me to fall in love with the natural world. Although there might seem to be an intrinsic contrast between the two – the absence or presence of reality – the truth is that the world is a wondrous place if you observe it through an appropriate lens. Dragons are real, forms of life are astronomically varied and imaginative, and there we are surrounded by the unknown and potentially mythical. To see the awe and mystification on a child’s face when they see a strange or unique animal for the very first time bears remarkable parallels to the expression when we stare into the fantasy of Avatar or The Lord of the Rings.
It might seem common for ‘nerds’ (at least under the traditional definition of being obsessed with particular aspects of pop culture) to later become scientists of some form or another. And I think this is a true reflection: particularly, I think the innate personality traits that cause one to look at the world of fantasy with wonder and amazement also commonly elicits a similar response in terms of the natural world. It is hard to see an example where the CGI’d majesty of contemporary fantasy and sci-fi could outcompete the intrigue generated by real, wondrous plants and animals.
Seeing the divine in the mundane
Although we often require a more tangible, objective justification for research, the connection of people to the diversity of life (whether said diversity is fictitious or not) should be a significant driving factor in the perceived importance of conservation management. However, we are often degraded to somewhat trivial discussions: why should we care about (x) species? What do they do for us? Why are they important?
If we approach the real world and the organisms that inhabit it with truly the same wonder as we approach the fantastical, would we be more successful in preserving biodiversity? Could we reverse our horrific trend of letting species go extinct? Every species on Earth represents something unique: a new perspective, an evolutionary innovation, a lens through which to see the world and its history. Even the most ‘mundane’ of species represent something critical to functionality of ecosystems, and their lack of emphasis undermines their importance.
The biota of Earth are no different to the magical fabled beasts of science fiction and fantasy, and we’re watching it all burn away right in front of our eyes.
This is partly where the concept of a ‘model’ comes into it: it’s much easier to pick a particular species to study as a target, and use the information from it to apply to other scenarios. Most people would be familiar with the concept based on medical research: the ‘lab rat’ (or mouse). The common house mouse (Mus musculus) and the brown rat (Rattus norvegicus) are some of the most widely used models for understanding the impact of particular biochemical compounds on physiology and are often used as the testing phase of medical developments before human trials.
This is the fourth (and final) part of the miniseries on the genetics and process of speciation. To start from Part One, click here.
In last week’s post, we looked at how we can use genetic tools to understand and study the process of speciation, and particularly the transition from populations to species along the speciation continuum. Following on from that, the question of “how many species do I have?” can be further examined using genetic data. Sometimes, it’s entirely necessary to look at this question using genetics (and genomics).
Genetic tools to study species: the ‘Barcode of Life’
A classically employed method that uses DNA to detect and determine species is referred to as the ‘Barcode of Life’. This uses a very specific fragment of DNA from the mitochondria of the cell: the cytochrome c oxidase I gene, CO1. This gene is made of 648 base pairs and is found pretty well universally: this and the fact that CO1 evolves very slowly make it an ideal candidate for easily testing the identity of new species. Additionally, mitochondrial DNA tends to be a bit more resilient than its nuclear counterpart; thus, small or degraded tissue samples can still be sequenced for CO1, making it amenable to wildlife forensics cases. Generally, two sequences will be considered as belonging to different species if they are certain percentage different from one another.
Despite the apparent benefits of CO1, there are of course a few drawbacks. Most of these revolve around the mitochondrial genome itself. Because mitochondria are passed on from mother to offspring (and not at all from the father), it reflects the genetic history of only one sex of the species. Secondly, the actual cut-off for species using CO1 barcoding is highly contentious and possibly not as universal as previously suggested. Levels of sequence divergence of CO1 between species that have been previously determined to be separate (through other means) have varied from anywhere between 2% to 12%. The actual translation of CO1 sequence divergence and species identity is not all that clear.
Gene tree – species tree incongruences
One particularly confounding aspect of defining species based on a single gene, and with using phylogenetic-based methods, is that the history of that gene might not actually be reflective of the history of the species. This can be a little confusing to think about but essentially leads to what we call “gene tree – species tree incongruence”. Different evolutionary events cause different effects on the underlying genetic diversity of a species (or group of species): while these may be predictable from the genetic sequence, different parts of the genome might not be as equally affected by the same exact process.
A classic example of this is hybridisation. If we have two initial species, which then hybridise with one another, we expect our resultant hybrids to be approximately made of 50% Species A DNA and 50% Species B DNA (if this is the first generation of hybrids formed; it gets a little more complicated further down the track). This means that, within the DNA sequence of the hybrid, 50% of it will reflect the history of Species A and the other 50% will reflect the history of Species B, which could differ dramatically. If we randomly sample a single gene in the hybrid, we will have no idea if that gene belongs to the genealogy of Species A or Species B, and thus we might make incorrect inferences about the history of the hybrid species.
There are a number of other processes that could similarly alter our interpretations of evolutionary history based on analysing the genetic make-up of the species. The best way to handle this is simply to sample more genes: this way, the effect of variation of evolutionary history in individual genes is likely to be overpowered by the average over the entire gene pool. We interpret this as a set of individual gene trees contained within a species tree: although one gene might vary from another, the overall picture is clearer when considering all genes together.
In earlier posts on The G-CAT, I’ve discussed the biogeographical patterns unveiled by my Honours research. Another key component of that paper involved using statistical modelling to determine whether cryptic species were present within the pygmy perches. I didn’t exactly elaborate on that in that section (mostly for simplicity), but this type of analysis is referred to as ‘species delimitation’. To try and simplify complicated analyses, species delimitation methods evaluate possible numbers and combinations of species within a particular dataset and provides a statistical value for which configuration of species is most supported. One program that employs species delimitation is Bayesian Phylogenetics and Phylogeography(BPP): to do this, it uses a plethora of information from the genetics of the individuals within the dataset. These include how long ago the different populations/species separated; which populations/species are most related to one another; and a pre-set minimum number of species (BPP will try to combine these in estimations, but not split them due to computational restraints). This all sounds very complex (and to a degree it is), but this allows the program to give you a statistical value for what is a species and what isn’t based on the genetics and statistical modelling.
The end result of a BPP run is usually reported as a species tree (e.g. a phylogenetic tree describing species relationships) and statistical support for the delimitation of species (0-1 for each species). Because of the way the statistical component of BPP works, it has been found to give extremely high support for species identities. This has been criticised as BPP can, at time, provide high statistical support for genetically isolated lineages (i.e. divergent populations) which are not actually species.
Improving species identities with integrative taxonomy
Due to this particular drawback, and the often complex nature of species identity, using solely genetic information such as species delimitation to define species is extremely rare. Instead, we use a combination of different analytical techniques which can include genetic-based evaluations to more robustly assign and describe species. In my own paper example, we suggested that up to three ‘species’ of N. vittata that were determined as cryptic species by BPP could potentially exist pending on further analyses. We did not describe or name any of the species, as this would require a deeper delve into the exact nature and identity of these species.
As genetic data and analytical techniques improve into the future, it seems likely that our ability to detect and determine species boundaries will also improve. However, the additional supported provided by alternative aspects such as ecology, behaviour and morphology will undoubtedly be useful in the progress of taxonomy.
Given the strong influence of genetic identity on the process and outcomes of the speciation process, it seems a natural connection to use genetic information to study speciation and species identities. There is a plethora of genetics-based tools we can use to investigate how speciation occurs (both the evolutionary processes and the external influences that drive it). One clear way to test whether two populations of a particular species are actually two different species is to investigate genes related to reproductive isolation: if the genetic differences demonstrate reproductive incompatibilities across the two populations, then there is strong evidence that they are separate species (at least under the Biological Species Concept; see Part One for why!). But this type of analysis requires several tools: 1) knowledge of the specific genes related to reproduction (e.g. formation of sperm and eggs, genital morphology, etc.), 2) the complete and annotated genome of the species (to be able to find and analyse the right genes properly) and 3) a good amount of data for the populations in question. As you can imagine, for people working on non-model species (i.e. ones that haven’t had the same history and detail of research as, say, humans and mice), this can be problematic. So, instead, we can use other genetic information to investigate and suggest patterns and processes related to the formation of new species.
Is reproductive isolation naturally selected for or just a consequence?
A fundamental aspect of studies of speciation is a “chicken or the egg”-type paradigm: does natural selection directly select for rapid reproductive isolation, preventing interbreeding; or as a secondary consequence of general adaptive differences, over a long history of evolution? This might be a confusing distinction, so we’ll dive into it a little more.
The reproductive incompatibility of two populations (thus making them species) is often intrinsically linked to the genetic make-up of those two species. Some conflicts in the genetics of Population 1 and Population 2 may mean that a hybrid having half Population 1 genes and half Population 2 genes will have serious fitness problems (such as sterility or developmental problems). Dramatic genetic differences, particularly a difference in the number of chromosomes between the two sources, is a significant component of reproductive isolation and is usually to blame for sterile hybrids such as ligers, zorse and mules.
We can study the process of speciation in the natural world without focussing on the ‘reproductive isolation’ element of species identity as well. For many species, we are unlikely to have the detail (such as an annotated genome and known functions of genes related to reproduction) required to study speciation at this level in any case. Instead, we might choose to focus on the different factors that are currently influencing the process of speciation, such as how the environmental, demographic or adaptive contexts of populations plays a role in the formation of new species. Many of these questions fall within the domain of phylogeography; particularly, how the historical environment has shaped the diversity of populations and species today.
Although these can help answer some questions related to speciation, new tools are constantly needed to provide a clearer picture of the process. Understanding how and why new species are formed is a critical aspect of understanding the world’s biodiversity. How can we predict if a population will speciate at some point? What environmental factors are most important for driving the formation of new species? How stable are species identities, really? These questions (and many more) remain elusive for a wide variety of life on Earth.
This is Part 2 of a four part miniseries on the process of speciation: how we get new species, how we can see this in action, and the end results of the process. This week we’re taking a look at how new species are formed from natural selection. For Part 1, on the identity and concept of the species, click here.
The Origin of Species
Despite Darwin’s scientifically ground-breaking revelations over 150 years ago, the truth of the origin of species has remained a puzzling and complex question in biology. While the fundamental concepts of Darwin’s theory remain heavily supported – that groups which become separated from one another and undergo differing evolutionary pathways through natural selection may over time form new species – the mechanisms leading to this are mysterious. Even though the heritable component of evolution (DNA) was not uncovered for a hundred years after publishing ‘On the Origin of Species’, Darwin’s theory can largely explain many patterns of the formation of species on Earth.
The population-speciation continuum
The understanding that groups that are separated progress into species through differential adaptation leads to a phenomenon as the ‘speciation continuum’: all populations exist at some point on the continuum, with those that are most differentiated (i.e. most progressed) are distinct species, whereas those least differentiated are closely related or the same population. Whether or not populations progress along this continuum, and how fast this progression happens, depends on the difference in selective pressure and speed of evolution in the populations. Even if two populations are physically separated, they might not necessarily form new species if the separation is too short-term or if they do not evolve in different ways. Even if they do differentially evolve, whether or not they develop reproductive isolation is not always consistent.
Furthermore, how these populations are changing may affect the rate or success of speciation: if the traits that evolve differently across the population also cause them to be unable to breed, then they may quickly become reproductively isolated and thus new species. For example, Momigliano et al. (2017) demonstrated the fastest known rate of speciation (within 3000 generations) in a marine vertebrate in a species of flounders. Flounders that adapted to a higher salinity environment became reproductively isolated from their sister population as their sperm could not tolerate the high salinity conditions (directly preventing breeding and causing reproductive isolation). This strong and rapid selection to an environment, and its subsequent selection on reproductive ability, was cutely described as a “magic trait”.
Gene flow across populations (through hybridisation) will balance out the different allele frequencies of the two gene pools, preventing adaptive alleles from moving towards fixation as per the standard natural selection process. While the effect of gene flow might slow the process, taking longer for the populations to diverge to the species level, speciation can still be achieved. Thus, the balance of gene flow and adaptive divergence is critical in determining whether ecological speciation is possible.
The reality of species
While the distinction between divergent populations and species might be a complex one, development in genomic technologies and greater understanding of evolutionary patterns is helping us uncover the real origin of species. And while species might not be as concrete a concept as one might expect, understanding the processes that generate new species and diversity is critical for understanding the diversity within nature that we see today, and also the potential diversity for the future (and why protecting said diversity is important!).