I am interested by the challenge of translating domain expertise into testable statistical frameworks. Especially in biology where the difficult-to-code observational / experimental findings often are left out of the modelling process.
The convergence of past and present contributors in the field, their cultural context, and the resulting assumptions, biases, and inferences that shape evolutionary thinking has always interested me. I believe that fully acknowledging and reconciling this historical legacy with the present is crucial for advancing evolutionary research beyond the field's active justification of systemic racism, eugenics, and discrimination. I hope to eventually expand my interest in this convergence into a more formal research effort.
My dissertation explores the intersection of complex traits, methodological approaches in macroevolution, and the evolution of bird migration. It is structured around three key questions:
Through this research, I hope to contribute to the ongoing dialogue about how best to represent and analyze complex biological traits in the era of big data and advanced statistical methods, using bird migration as a case study.
The study of avian migration presents a distinctive challenge due to the many ways a bird can be considered a 'migrant', most of which are difficult to measure directly. This is a common problem in comparative biology, as many times the traits that researchers are interested in are difficult to wrangle into a simple dataset.
Developing effective methods to measure and model these difficult-to-measure axes is essential for understanding the evolution of migration as a trait over long time scales, as well as understanding the relationship between migration and other traits. By analyzing the eBird dataset, this project aims to shed light on the many different ways birds migrate. Whether following predictable patterns or moving in more unpredictable ways, understanding these behaviors can have implications for conservation and ecological studies.
Poster on this research for the Interfaces of Global Change symposium.
The first part of my dissertation is focused on the efficacy of Structured Hidden Markov Models (SHMMs) in identifying threshold traits - discretized traits with underlying one-dimensional continuous variation. Unlike standard, generalized Hidden Markov models, SHMMs allow us to impose specific structures on the hidden states, representing hypotheses about the underlying architecture of the trait. I used SHMMs to model the evolution of discrete threshold characters, manipulating them to represent the quantitative threshold model. This approach allows us to test whether a discrete trait is best described as a one-dimensional threshold trait, even when only discrete data are available.
This method provides a framework for testing explicit hypotheses about trait evolution and validating expert knowledge in character state coding, potentially improving our understanding of complex phenotypic evolution. In no ways am I the first to suggest this methodology, as this work builds off the work of Sergei Tarasov, James Boyko, and others.