|
|
Haiyan Huang, PhD
|
|

|
Associate Professor
Department of Statistics
Interdepartmental Group
in Biostatistics
Graduate Group in Computational
and Genomic Biology.
University of
California, Berkeley
CA, 94720, USA
Tel: (510)642-6433
Fax: (510)642-7892
Email: hhuang
AT stat DOT berkeley DOT edu
|
|
|
|
|
|
|
|
|
People
Current Graduate Students:

Former
Graduate Students:
·
Siew-leng Melinda Teng, PhD, 2007 Summer (Thesis
Title: Statistical methods in integrative analysis
of gene expression data with applications to
biological pathways; Current Position: Statistician,
Genentech, Inc.)
·
Na Xu, PhD, 2008 summer, co-advised with Prof. Peter Bickel (Thesis Title: Transcriptome
Detection by Multiple RNA Tiling Array Analysis and Identifying Functional Conserved Non-coding
Elements by Statistical Testing;
Current Position: Statistician,
Genentech, Inc.)
·
Kyungpil Kim, PhD, 2013 (Thesis Title:
Application of Statistical Methods to Integrative Analysis of Genomic Data; Current
Position: Postdoctoral Research Fellow, Children’s Hospital Oakland
Research Institute)
·
Daisy Yan Huang, PhD (Thesis Title:
Overcoming the Small Sample Size Challenge in Differential Gene Expression
Analysis Studies; Current Position: Statistician, Amazon)
·
Hua Chen, Master 2008 (Thesis Title: Bayesian Method for Multi-Loci
Association Study of Human Disease; Current Position: Research fellow, Harvard University)
·
Ling
Meng, Master 2009 (Thesis
Title: Learning Algorithm and Model Selection for Protein-Protein
Interaction Inference in Arabidopsis; Current Position: Research fellow,
UC Berkeley)
Former Postdocs:
·
Ci-Ren Jiang, Postdoc Sept 2009 – Aug 2010 (Current
position: (Tenure-track) Associate Research Fellow of the Institute of Physics,
Academia Sinica, Taipei, Taiwan)
·
Qunhua
Li, Postdoc Sept 2008 – July 2011, co-advised with Professor Peter
Bickel (Current position: Assistant
Professor, Department of Statistics, Penn State University)
Major Collaborators:
·
Peter Bickel, Statistics Department, UC Berkeley
·
Ron Krauss, Children’s Hospital Oakland Research
Institute
·
Xianghong Zhou, Computational and Molecular
Biology, USC
·
Lewis Feldman, Plant & Microbial
Biology, UC Berkeley
·
Lydia Sohn, Mechanical Engineering, UC Berkeley
Teaching
Undergraduate
courses:
STAT
152: Survey Sampling (Falls
2003 – 2006)
BIOE/STAT
C141: Statistics for Bioinformatics (Springs 2004 – 2008)
STAT
131A: Statistical Inferences for Social and Life Scientists (Spring 2009)
STAT
157: Seminar on
Topics in Probability and Statistics (Fall 2009)
STAT
133: Concepts in Computing with Data (Spring 2013)
Master
courses:
STAT
200B: Introduction to Probability and Statistics at an Advanced Level (Springs 2006, 2007, 2011, 2012)
PhD
courses:
STAT 215A: Statistical Models:
Theory and Application (Fall 2011)
STAT 210A: Theoretical
Statistics (Falls
2008
– 2010)
STAT
246: Statistical Genetics (Spring 2009; co-teaching with Prof. S Dudoit)
STAT
C245E/F: Statistical Genomics (Springs 2010, 2012, 2013; co-teaching with Prof.
S Dudoit and Prof. R Nielson)
STAT
272: Statistical Consulting (Fall 2010)
Publications
- Jiang
CR, Liu CC, Zhou XJ, Huang H*. Optimal Ranking in Multi-label
Classification Using Local Precision Rates. Under revision. Statistica
Sinica.
*corresponding author
- Wang
YXR, Jiang K, Feldman LJ, Bickel PJ, Huang H*. Inferring gene networks
using sparse canonical correlation analysis. Under revision. AOAS.
*corresponding author
- Chapman
MR, Balakrishnan KR, Li J, Conboy MJ, Huang H, Mohanty SK, Jabart E, Hack
J, Conboy IM, Sohn LL (2013). Sorting single satellite cells from
individual myofibers reveals heterogeneity in cell-surface markers and
myogenic capacity. Integrative Biology. 5(4):692-702.
- ENCODE
Consortium Project (2012). An Integrated Encyclopedia of DNA Elements in
the Human Genome. Nature. 489, 57-74.
- Kim K,
Teng S, Jiang K, Feldman L, Huang H* (2012). Using biologically
interrelated experiments to identify pathway genes in arabidopsis. Bioinformatics.
28(6), 815-822. [Paper
Link]
*corresponding author
- Gao Q,
Ho C, Jia Y, Li JJ, Huang H* (2012). Biclustering of Linear Patterns in
Gene Expression Data (CLiP). Journal of Computational Biology. 19(6),
619-631.
*corresponding author
- Li JJ,
Jiang CR, Brown BJ, Huang H*, Bickel PJ* (2011). Sparse Linear Modeling of
RNA-seq Data for Isoform Discovery and Abundance Estimation. Proc Natl
Acad Sci. USA. 108 (50) 19867-19872. [Paper
Link]
*co-corresponding
authors
- Li Q,
Brown JB, Huang H, Bickel PJ. (2011). Measuring Reproducibility of
High-throughput Experiments. Annals of Applied Statistics. 5(3),
1752-1779. [Paper
Link]
- Li Y,
Huang H and Cai L (2011) Prediction of Transcriptional Regulatory Networks
for Retinal Development. A chapter in book “Computational Biology and Applied
Bioinformatics” edited by Lopes HS and Cruz LM.
- Durinck
S, Ho C, Wang NJ, Liao W, Jakkula LR, Collisson EA, Pons J, Chan SW, Lam
ET, Chu C, Park K, Hong S, Hur JS, Huh N, Neuhaus IM, Yu SS, Grekin RC, Mauro TM,
Cleaver JE, Kwok P, LeBoit PE, Getz G, Cibulskis K, Aster JC, Huang H,
Purdom E, Li J, Bolund L,
Arron ST, Gray JW, Spellman PT, Cho RJ (2011). Temporal Dissection of
Tumorigenesis in Primary Cancers. Cancer Discovery, 1:137-143.
- Xu N,
Bickel PJ, Huang H* (2010). Genome-wide Detection of Transcribed Regions
through Multiple RNA Tiling Array Analysis. International Journal of Systems and
Synthetic Biology. 1(2) 155-170.
*corresponding
author
- Huang
H*, Liu C, Zhou XJ* (2010). Bayesian Approach to Transforming Public Gene
Expression Repositories into Disease Diagnosis Databases. Proc Natl
Acad Sci. USA. 107 (15) 6823-6828. [Paper Link]
*co-corresponding
authors
·
This paper is selected for issue highlight by PNAS:
http://www.pnas.org/content/107/15/6553.full.pdf+html (it is under the title
"Gene databases mined for diagnoses").
·
This paper has been selected for Faculty of 1000 Biology (http://www.f1000biology.com)
and evaluated by Dr. Russ Altman from Stanford University:
http://www.f1000biology.com/article/id/3925957. (Faculty of 1000 Biology is an award-winning online service
that highlights and evaluates the most interesting papers published in the
biological sciences, based on the recommendations of over 2000 of the world's
top researchers.)
·
This paper has also been reported in the following news reports:
·
GenomeWeb daily news. “Team Develops Proof-of-Principle Diagnostic Database
for Applying Public Gene Expression Data”. March 22, 2010.
·
National Cancer Institute. Research News. “Mathematical Modeling Turns
Gene Expression Data into Disease Diagnostics”
http://physics.cancer.gov/news/2010/april/po_news_b.asp
·
Biocentury and Nature publishing group. “GEO: world of diagnostic
potential,” Haas, M.J. SciBX 3(14); April 8, 2010
- Bickel
PJ, Boley N, Brown JB, Huang H, Zhang NR (2010). Subsampling Methods for Genomic Inference. Annals of Applied Statistics. 4(4)
1660-1697.
(authors
ordered alphabetically)
- Jiang K, Zhu T, Diao Z, Huang H,
Feldman LJ. (2010). The Maize Root Stem Cell Niche: A Partnership between
Two Sister Cell Populations. Planta, 231(2):411-24.
[Paper
Link]
- Bickel
P, Brown B, Huang H, Li Q (2009). An overview of recent
developments in genomics and associated statistical methods. Philosophical Transactions of the Royal Society A 367,
4313-4337. [Paper
Link]
(authors
ordered alphabetically)
- Teng
S, Huang H* (2009). A statistical
framework to infer functional gene
associations from multiple biologically interrelated
microarray experiments. Journal of the American
Statistical Association, June 2009, Vol. 104, No. 486. [Paper Link]
*work with
PhD student
- Wang
F, Jiang T, Sun Z, Teng SL, Luo X, Zhu Z, Zang Y, Zhang H, Yue W, Hong N,
Huang H, Blumberg H, Zhang, D (2009). Neuregulin 1 genetic variation
and anterior cingulum integrity in patients
with schizophrenia and healthy controls. Journal of
Psychiatry & Neuroscience, 2009 May;34(3):181-6. [Paper
Link]
- Liu C,
Hu J, Kalakrishnan M, Huang H*, Zhou XJ* (2009) Integrative disease classification based
on cross-platform microarray Data. BMC Bioinformatics, 2009 Jan;10
Suppl 1:S25. [Paper Link]
*co-corresponding
authors
- Carbonaro
A, Mohanty SK, Huang H, Godley LA and Sohn LL (2008) Cell characterization using a protein-functionalized Pore. Lab Chip, 8(9):1478-85. [Paper
Link]
- Huang
H, Cai L, Wong WH. (2008) Clustering analysis of
SAGE transcription profiles using a Poisson approach. Methods Mol Biol. 2008; 387:185-98. (Book Chapter) [Paper
Link]
- Huang
Y, Li H, Hu H, Yan X, Waterman MS, Huang H, Zhou XJ (2007). Systematic discovery of functional modules and context-specific functional annotation of human genome. Bioinformatics, 23(13):i222-i229. [Paper
Link]
- ENCODE
Consortium (2007)._ Identification and analysis
of functional elements in 1% of
the human genome by the ENCODE pilot project. Nature. 447, 799-816. [Paper
Link]
- Kim K,
Zhang S, Jiang K, Cai L, Lee IB, Feldman LJ, Huang H* (2007). Measuring similarities
between gene expression profiles through data transformations. BMC Bioinformatics, 8:29 (highly accessed
paper). [Paper Link]
*corresponding author; work with
graduate student
- Jiang
K, Zhang S, Lee S, Tsai G, Kim K, Huang H, Zhu T, Feldman LJ (2006). Transcription profile analyses identify
genes and pathways central to root cap functions in Maize. Plant Molecular Biology, 60(3):343-63. [Paper Link]
- Huang
H, Kim K (2006). Unsupervised clustering analysis of gene expression, Chance,
vol. 19, No.3. [Paper Link]
- Zhou
XJ, Kao MJ, Huang H, Wong A, Nunez-Iglesias J, Aparicio OM, Morgan TE, Wong WH (2005). Functional
annotation and network reconstruction through cross-platform
integration of microarray data. Nature Biotechnology, 23(2):238-43. [Paper
Link]
- Zhao
X, Huang H, Speed T (2005). Finding short DNA motifs using permuted Markov
models. Journal
of Computational Biology, 12(6): 894-906 (journal version of the 2004 RECOMB paper; numbered as 16 below) [Paper
Link]
- Huang
H, Kao MJ, Zhou X, Liu JS, Wong WH (2004). Determination of local
statistical significance of patterns in Markov sequences with application
to promoter element
identification.” Journal of Computational Biology, 11(1):1-14.
[Paper
Link]
- Cai
L*, Huang H*, Blackshaw S, Liu JS, Cepko CL, Wong WH (2004). Clustering analysis of SAGE data using a
Poisson approach. Genome
Biology, 5(7):R51. [Paper Link]
*Joint first authors
- Zhao
X, Huang H, Speed T (2004). Finding short DNA motifs using permuted Markov
models. Proceedings
of RECOMB 2004. [Paper Link]
- Blackshaw
S, Harpavat S, Trimarchi J, Cai L, Huang H, Kuo W, Fraioli R, Cho S, Yung R, Asch E, Wong WH, Cepko CL
(2004). Genomic analysis of mouse
retinal development. PLoS Biol,
2(9):E247. [Paper
Link]
- Allinen
M, Beroukhim R, Cai L, Brennan C, Domenici CJ, Huang H, Porter D, Hu M, Chin L, Richardson A, Schnitt
S, Sellers W, Polyak K (2004). Molecular characterization of the tumor microenvironment
in breast cancer. Cancer Cell, 6(1):17-32. [Paper
Link]
- Lippert
RA, Huang H, Waterman MS (2002). Distributional regimes for the number of k-word matches between two
random sequences. Proc Natl Acad Sci. USA, 99(22):13980-9. [Paper Link]
- Huang
H (2002). Error bounds on multivariate normal approximations for word
count statistics. Advances in Applied Probability, 34(3): 559-586. [Paper
Link]
Books Edited
- “Research
in Computational Molecular Biology” (11th Annual International Conference,
RECOMB 2007), edited by Terry Speed and Haiyan Huang, Published by
Springer. [Book
Link]
Articles Submitted or Manuscripts in Preparation
- Li JJ,
Huang H, Bickel PJ, Brenner S (2013). Comparison between developmental
stages of D. melanogaster and C. elegans with modENCODE RNA-Seq data.
Manuscript in Preparation.
- Lee W,
Huang H (2013). Decision Making in Hierarchical Multi-label
Classification. Manuscript in Preparation.