Research
Home Page

My research is in the field of applied statistics, particularly in developing and applying methodologies to solve statistical problems that arise in biology. Exciting statistical challenges are continually arising in molecular biology as experimental techniques change rapidly, creating new and more complicated data problems to be addressed. Statistically, this requires the development of appropriate estimation methodology and high-dimensional inference for new types of high-throughput data, as well as multivariate analysis techniques as tools for integration of heterogeneous sources of data. My research interests lie in developing statistical methods for high-dimensional data arising in the field of biology and genetics. I focus on questions of robust estimation and hypothesis testing for high-throughput biological experiments, in particular gene expression microarrays and next generation sequencing. I am also interested in integration of heterogeneous sources of data, where the data can be multiple experimental platforms or, more generally, arbitrary forms of preexisting biological knowledge such as networks or trees. Statistically, I am interested in questions of high-dimensional inference and multivariate analysis -- problems that arise naturally in trying to create a unified understanding of this type of data.

 


 

Current Research Collaborations

Classification of Cortical Neurons by Single Cell Transcriptomics

    This NIH-supported BRAIN Initiative project aims to provide a suite of technologies for identifying and classifying the diverse cell types in the mammalian nervous system. We are part of multidisciplinary collaboration between 10 research groups at UC Berkeley, headed by John Ngai.

    I jointly lead (with Sandrine Dudoit and Nir Yosef) the computational group responsible for the identification of different cell types and of biomarker targets. We are also responsible for developing quality control and analysis techniques for single cell data.

    See a more detailed description at the Ngai lab webpage.

    Collaborators: John Ngai (Lead PI, MCB), Sandrine Dudoit (Biostatistics), Nir Yosef (CS), Hillel Adesnik (MCB), Helen Bateup (MCB), Dan Feldman (MCB), Jennifer Doudna (MCB, Chemistry, HHMI, LBNL), Dirk Hockemeyer (MCB), Russell Vance (MCB, HHMI)

    UC Berkeley press announcement: September 30, 2014 - NIH awards UC Berkeley $7.2 million to advance brain initiative

Epigenetic Control of Drought Response in Sorghum (EPICON)

  • The goal of this DOE-funded project is to generate a model linking the physiology and genetics of sorghum with its epigenetic regulatory machinery, the dynamics of its microbial community and the composite genetic pathways involved in drought response.

    EPICON's efforts will focus on unraveling the temporal role epigenetic signals play in acclimation to and recovery from drought through effects on individual transcription factors or transcriptional networks that direct entire metabolic pathways. To achieve this goal we will follow responses to water deprivation of two sorghum cultivars differing in their drought responses. Sorghum, a widely cultivated cereal with drought and flood tolerance, offers notable advantages as a bioenergy feedstock due to its relatively reduced environmental footprint versus its close relative, corn. In EPICON's three-year field trial, sorghum will be grown under controlled irrigation conditions. Phenotypic analyses will be conducted to chart growth, flowering, grain and biomass yield, and other observable characteristics. Leaf and root samples will also be taken to perform molecular phenotyping to track spatiotemporal changes in epigenetic, transcriptomic, metabolomic and proteomic footprints. Analysis of the entire data set generated will provide a better understanding of the epigenetic processes responsible for restructuring the metabolic and regulatory landscape of the sorghum genome, and their relationship to drought tolerance.

  • The Purdom group will lead the computational analysis, specifically to integrate the multiple epigenetic datasets to provide understanding of the interplay of these biological processes. We are looking for a post-doc for this project, see advertisement for more information.

  • Collaborators: Peggy Lemaux (Lead PI, PMB UC Berkeley), Devin Coleman-Derr (PMB UC Berkeley, USDA), John Taylor (PMB UC Berkeley), Jeffery A. Dahlberg (Kearney Agricultural Research & Extension Center), Christer Jansson (EMSL, Pacific Northwest National Laboratory), Chia-Lin Wei (Joint Genome Institute)

  • UC Berkeley press announcement: September 28, 2015 - UC Berkeley to spearhead $12.3M project to study crop drought tolerance

Analysis of Mutation Spectrum of tumors

  • This research is centered on the analysis of mutations found by sequencing the DNA of tumor samples. The goal is to understand the mechanisms that lead to the accumulation of DNA mutations in tumors with the goal of better understanding the etiology of cancer.

    Collaborators: Raymond Cho (UCSF), Paul Spellman (Organ Health and Science University)

 


 

Back to top

Last updated 07/23/2015