Our goal is to combine principled statistical models and large-scale data to answer key questions about human disease:

Which genetic variants lead to disease and how?

Genome-wide association studies (GWAS) can tell us where to look for genetic effects on disease, but not how these effects manifest themselves. Disentangling the underlying biological mechanisms of these loci poses the next great challenge for human genetics. Population-scale bulk/single-cell transcriptomics and epigenomics provide a tool to understand these associations. We aim to develop statistical methods for integrating such molecular data to make sense of GWAS findings and to understand their causal contribution to overall disease biology.

papers

Which patients respond to cancer treatments and why?

Predicting which patients will respond to treatment or develop harmful side-effects is a fundamental goal of precision medicine. We aim to integrate genetics and large-scale electronic health record data to identify meaningful biomarkers of treatment consequences and develop algorithmic models to support clinical decision-making.

papers

What is the influence of germline variation on the tumor?

Cancer is a disease influenced by both germline (host) and somatic (tumor) variation, but these influences have largely been studied in isolation. We are interested in understanding how somatic mutations and tumor evolution are impacted by germline risk for cancer & related traits, genetic ancestry, and individual genetic variants. We empower these studies using large-scale clinical sequencing from tens of thousands of tumors.

papers

What can genetics tell us about how populations have evolved?

Genetic data has been collected from millions of individuals across many populations. We are interested in using this data to broaden our understanding of the genetic relationships within and across populations.

papers