Method for Identify cell-type specific QTL effects by leveraging allele-specific and total expression.

REF: A novel method to identify cell-type specific regulatory variants and their role in cancer risk. Kalita et al. biorxiv 2021

SurvNODE: Neural ODEs for Multi-State Survival Analysis

Method for inferring survival trajectories across multiple states (e.g. illness/death) using neural Ordinary Differential Equations (ODEs).

REF:A General Framework for Survival Analysis and Multi-State Modelling. Groha et al. arXiv. 2021


Method for identifying identical-by-descent segments in large genomic data.

REF:Identity-by-descent detection across 487,409 British samples reveals fine scale population structure and ultra-rare variant associations. Saada et al. Nature Communications. 2020

MESC: Mediated Expression Score Regression

Method for quantifying the fraction of disease heritability mediated by all QTL effects.

REF:Quantifying genetic effects on disease mediated by assayed gene expression levels. Yao et al. Nature Genetics. 2020

PLASMA: PopuLation Allele-Specific MApping

Method for fine-mapping functional data using eQTL and allelic-imbalance signal.

REF:Allele-Specific QTL Fine Mapping with PLASMA. Wang et al. AJHG. 2020


Method for identifying context-specific allelic imbalance and building allele-specific predictors.

REF: Allelic imbalance reveals widespread germline-somatic regulatory differences and prioritizes risk loci in Renal Cell Carcinoma. pre-print


Methods and data for performing a transcriptome-wide (or any other *ome-wide) association study. Includes a streamlined pipeline for quantifying genetic components of functional features (such as gene expression), building predictive models, predicting functional features into GWAS data, and joint/conditional analyses of associated features. Also includes a resource of trained predictive models from multiple large cohorts, tissues, and functional assays.

REF: Integrative approaches for large-scale transcriptome-wide association studies. Nature Genetics. 2016


Method for detection of IBD shared haplotypes and association to trait. Infers haplotype clusters from IBD segments (for example, detected by the GREMLIN algorithm below), generating pseudo-SNP data for association testing.

REF: DASH: a method for identical-by-descent haplotype mapping uncovers association with recent variation. The American Journal of Human Genetics. 2011

Data / Pipelines

CWAS: Cistrome-Wide Association Studies

A workflow for training predictive models of the epigenomic “cistrome” and testing for association with GWAS disease data.

REF:Genetic determinants of chromatin reveal prostate cancer risk mediated by context-dependent gene regulation. Baca et al. biorxiv. 2021


A workflow for germline imputation from tumors with quality control, ancestry inference, and polygenic risk scoring.

REF:Constructing germline research cohorts from the discarded reads of clinical tumor sequences. Gusev et al. Genome Med. 2021

Interactive browser for TWAS results from hundreds of complex traits.

Chromatin TWAS

Data and analysis of chromatin/expression/splicing and schizophrenia (see below for TWAS methods). Includes genome-wide results from TWAS of expression/splicing in three tissues (four studies) and schizophrenia. Additionally, genome-wide results from TWAS of expression/splicing TWAS and chromatin activity measured in ~100 individuals (two studies) by ChIP-Seq.

REF: Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights. Nature Genetics. 2018



This code has been superseded by the FUSION software above. Legacy implementation archived here.

Methods for performing a Transcriptome-wide Association Study. Identify associations between genetic component of gene expression and trait using eQTL and GWAS data only.

REF: Integrative approaches for large-scale transcriptome-wide association studies. Nature Genetics. 2016


See GERMLINE2 above

Method for fast, pairwise detection of segments identical by descent. Uses hashing techniques to efficiently identify long stretches of shared DNA between pairs of individuals from array SNP data.

REF: Whole population, genome-wide mapping of hidden relatedness. Genome Research. 2009


Genotype phasing by entropy minimization.

REF: Highly scalable genotype phasing by entropy minimization. IEEE/ACM TCBB. 2008