Publications
- N. Meinshausen, G. Rocha, and B. Yu (2007). A tale of three cousins: Lasso, L2Boosting, and Danzig Annals of Statistics (invited discussion on Candes and Tao's Danzig Selector paper)
- V. Vu, B. Yu, and R. Kass (2007). Coverage Adjusted Entropy Estimation. Tech Report. 727 Stat Dept. UCB (accepted by Statistics and Medicine)
- P. Zhao, G. Rocha, and B. Yu (2006). Grouped and hierarchical model selection through composite absolute penalties. Annals of Statistics (to appear)
- N. Meinshausen and B. Yu (2006). Lasso-type recovery of sparse representations for high-dimensional data. Tech. Report, Statistics, UC Berkeley (revised in Aug, 2007)
- B. Yu (2006). Comments on: Regularization in Statistics, by P. J. Bickel and B. Li. Test, vol. 15 (2), pages 314-316.
- J. Gao, H. Suzuki, and B. Yu (2006). Approximation Lasso Methods for Language Modeling. Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL, pp. 225-232, Sydney.
- T. Shi, B. Yu, E. Clothiaux, and A. Braverman (2006). Daytime Arctic Cloud Detection based on Multi-angle Satellite Data with Case Studies. Journal of American Statistical Association (accepted, 2007)
- B. Yu (2007). Embracing Statistical Challenges in the Information Technology Age Technometrics (special issue on statistics and information technologies). vol. 49 (3), 237-248.
- X. Jiang, Y. Liu, B. Yu and M. Jiang (2007). Comparison of MISR aerosol optical thickness with AERONET measurements in Beijing metropolitan area. Remote Sensing of Environment (Special Issue on Multi-angle Imaging SpectroRadiometer), vol. 107, pp. 45-53.
- Peng Zhao and Bin Yu (2006). On Model Selection Consistency of Lasso. J. Machine Learning Research, 7 (nov), 2541-2567.
- T. Shi, E. E. Clothiaux, B. Yu, A. J. Braverman, and G. N. Groff (2007). Detection of Daytime Arctic Clouds using MISR and MODIS Data. Remote Sensing of Environment (Special Issue on Multi-angle Imaging SpectroRadiometer), vol. 107, pp. 172-184.
- P. Buhlmann and B. Yu (2006). Sparse Boosing Journal of Machine Learning Research ( 7 (June), 1001-1024). This is a shortened and more focused version of Buhlmann and Yu "Boosting, Model Selection, Lasso and Nonnegative Garotte" given below.
- T. Shi and B. Yu (2005). Binning in Gaussian Kernel Regularization. Statistica Sinica (special issue on machine learning), 16, 541-567.
- G. Liang, N. Taft, and B. Yu (2005). A fast lightweight approach to origin-destination IP traffic estimation using partial measurements. Tech Report 687, Statistics Department, UCB (accepted for Special Issue of IEEE-IT and ACM Networks on data networks, Jan. 2006)
- C. D. Giurcaneanu and B. Yu (2005). Efficient algorithms for discrete universal denoising for channels with memeory. Tech. Report 686, Statistics Department, UCB (to appear in Proc. ISIT, Sept. 2005)
- P. Buhlmann and B. Yu (2005). Boosting, Model Selection, Lasso and Nonnegative Garotte. Tech. Report, UC Berkeley.
- Tong Zhang and B. Yu (2005). Boosting with early stopping: convergence and consistency. The Annals of Statistics. Vol. 33, 1538-1579.
- D. J. Diner et al (2004). PARAGON: A Systematic, Integrated Approach to Aerosol Observation and Modeling. American Meterological Society, Oct., 1491-1501.
- P. Zhao and B. Yu (2004). Stagewise Lasso (old title: Boosted Lasso) Tech. Report #678, Statistics, UC Berkeley (December, 2004; revised in April, 2005. accepted by J. Machine Learning Res, 2007).
- R. Jorsten and B. Yu (2004). Compressing genomic and proteomic array images for statistical analyses. Invited chapter in a book on Genomic Engineering.
- T. Shi, B. Yu, E. Clothiaux, A. Braverman (2004). Cloud detection over ice and snow using MISR data. Tech. Report 663, Stat Dept, UCB.
- P. Buhlmann and B. Yu (2004). Discussion on three boosting papers by Jiang, Lugosi and Vayatis, and Zhang Annals of Statistics. 32 (1): 96-101.
- R. Castro, M. Coates, G. Liang, R. Nowak, and B. Yu (2003). Internet Tomography: Recent Developments Statistical Science. Vol. 19(3), 499-517.
- G. Liang and B. Yu (2003). Maximum Pseudo Likelihood Estimation in Network Tomography. IEEE Trans. on Signal Processing (Special Issue on Data Networks). 51(8), 2043-2053
- R. Jornsten, W. Wang, B. Yu, and K. Ramchandran (2003). Microarray image compression: SLOCO and the effects of information loss. Signal Processing Journal (Special Issue on Genomic Signal Processing). 83, 859-869.
- Rebecka Jornsten and Bin Yu (2003). Simultaneous Gene Clustering and Subset Selection for Classification via MDL. Bioinformatics. 19(9): 1100-1109.
- Mark Hansen and Bin Yu (2002). Minimum Description Length Model Selection Criteria for Generalized Linear Models. {\em Science and Statistics: Festschrift for Terry Speed}, IMS Lecture Notes -- Monograph Series, Vol. 40.
- Rebecka Jornsten, and Bin Yu (2002). Multiterminal Estimation: Extensions and a Geometric interpretation. Proceedings of International Symposium on Information Theory (ISIT), June, 2002.
- Peter Buhlmann and Bin Yu (2003). Boosting with the L2 Loss: Regression and Classification. J. Amer. Statist. Assoc. 98, 324-340.
- Mark Coates, Alfred Hero, Robert Nowak, and Bin Yu (2002). Internet Tomography. Signal Processing Magazine. vol. 19, No. 3 (May issue), 47-65.
- Gerald Schuller, Bin Yu, Dawei Huang, and Bern Edler (2002). Perceptual Audio Coding using Pre- and Poster- Filters and Lossless Compression. IEEE Trans. Speech and Audio Processing. Vol. 10 (6), 379-390
- Rebecka Jornsten and Bin Yu (2000). ``Comprestimation": Microarray Images in Abundance. Proc. of Conference on Information Science and Systems. Princeton, March 14-17, 2000.
- Jin Cao, Scott Vander Wiel, Bin Yu, and Zhenyuan Zhu (2000). [ PDF | A Scalable Method for Estimating Network Traffic Matrices from Link Counts. Preprint.
- Peter Buhlmann and Bin Yu (2002). Analyzing Bagging. Annals of Statistics vol. 30, 927-961.
- Peter Buhlmann and Bin Yu (2000a). Discussion. Additive logistic regression: a statistical view of boosting, by Friedman, J., Hastie, T. and Tibshirani, R. Annals of Statistics. Vol. 28, 377-386
- Mark Hansen and Bin Yu (2000). Wavelet thresholding via MDL for natural images. IEEE Trans. Inform. Theory (Special Issue on Information Theoretic Imaging). vol. 46, 1778-1788.
- Rebecka Jornsten and Bin Yu (1999). Insensitivity of model estimation in adaptive scalar-quantization for wavelet subband coding. IEEE Trans. Inform. Theory. submitted.
- Jin Cao, Drew Davis, Scott Vander Wiel and Bin Yu (2000). [ PDF | Time-varying network tomography: router link data. J. Amer. Statist. Assoc. vol. 95, 1063-1075.
- Jorma Rissanen and Bin Yu (2000). Coding and compression: a happy union of theory and practice. J. Amer. Statist. Assoc. (Year 2000 Commemorative Vignette on Engineering and Physical Sciences). vol. 95, 986-988.
- Lei Li and Bin Yu (2000). Iterated logarithm expansions of the pathwise code lengths for exponential families. IEEE Trans. Inform. Theory. vol. 46, 2683-2689.
- Bin Yu (1999). Codes and Models. IMS Special Invited Paper (Talk Slides) , ENAR/IMS/ASA Spring Meeting, Atlanta, March.
[ PDF | PostScript ]
- Mark Hansen and Bin Yu (2001). Model selection and Minimum Description Length principle. J. Amer. Statist. Assoc. vol. 96, 746-774.
- G. Chang, B. Yu and M. Vetterli (2000). Spatially adaptive wavelet thresholding based on context modeling for image denoising. IEEE Trans. Image Processing, vol. 9, 1522-1531.
- G. Chang, B. Yu and M. Vetterli (2000). Adaptive wavelet thresholding for image denoising and compression. IEEE Trans. Image Processing, vol. 9, 1532-1546.
- G. Chang, B. Yu and M. Vetterli (2000). Wavelet thresholding for multiple noisy image copies. IEEE Trans. Image Processing, vol. 9, 1631-1635.
- Y. Yoo, A. Ortega, and B. Yu (1999). Image subband coding using context-based classification and adaptive quantization. IEEE Trans. Image Processing, vol. 8, 1702-1215.
- P. Gong, R. Pu and B. Yu (1999) Conifer species recognition: effects of data transformatoin and band width. Remote Sensing of Environment, (in press).
- B. Yu, M. Ostland, P. Gong and R. Pu (1999). Penalized discriminant analysis of in situ hyperspectral data for conifer species recognition. IEEE Trans. Geoscience and Remote Sensing, in press.
- A. Barron, J. Rissanen, and B. Yu (1998). The Minimum Description Length principle in coding and modeling. (Special Commemorative Issue: Information Theory: 1948-1998) IEEE. Trans. Inform. Th., Oct, 2743-2760.
- B. Yu and P. Mykland (1998). Looking at Markov samplers through cusum path plots: a simple diagnostic idea. Statistics and Computing , 8, 275-286.
- M. Ostland and B. Yu (1997). Exploring quasi Monte Carlo for marginal density approximation. Statistics and Computing, 7, 217-228.
- P. Gong, R. Pu, and B. Yu (1997). Conifer species recognition with in Situ hyperspectral data. Remote Sensing of Environment, 62, 189-200.
- B. Yu and T. P. Speed (1997). Information and the clone mapping of chromosomes. Ann. Statist. 25, 169-185.
- D. Nelson, T. Speed, and B. Yu (1997). The limits of random fingerprinting. Genomics, 40, 1-12.
- B. Yu (1997). Assouad, Fano, and Le Cam. Festschrift for Lucien Le Cam . D. Pollard, E. Torgersen, and G. Yang (eds), pp. 423-435, Springer-Verla g.
- G. Chang, B. Yu, and M. Vetterli (1997). Bridging compression to wavelet thresholding as a denoising method. In Proc. of the 31st Conference on Information Sciences and Systems, The Johns Hopkins Univ, Maryland.
- Y. Yoo, A. Ortega, and B. Yu (1996). Adaptive quantization of image subbands with efficient overhead rate selection. In Proceedings of IEEE International Conference on Image Processing, Lausanne, Switzerland.
- B. Yu (1996). Minimum Description Length Principle: a Review. In Proc. of the 30th Conference on Information Sciences and Systems, Princeton Univ, New Jersy.
- B. Yu (1996). Lower bounds on expected redundancy for nonparametric classes. IEEE Trans. on Information Theory, 42, 272-275.
- J. Rissanen and B. Yu (1995). MDL learning. In Learning and Geometry: Computational Approaches, Progress in Computer Science and Applied Logic, 14, David Kueker and Carl Smith (eds), Birkhäuser, Boston, pp. 3-19.
- B. Yu (1995). Comment: Extracting more diagnostic information from a single run using cusum path plot. Statist. Sci., 10, 54-58.
- P. Mykland, L. Tierney, and B. Yu (1995). Regeneration in Markov Chain samplers. J. Amer. Statist. Assoc., 90, 233-241.
- B. Yu (1994a). Rates of convergence for empirical processes of stationary mixing sequences. Ann. Probab. 22, 94-116.
- M. Arcones and B. Yu (1994a). Central limit theorems for empirical and U-processes of stationary mixing sequences. J. Theor. Probab. 7, 47-71.
- B. Yu (1994b). Lower bound on the expected redundancy for classes of continuous Markov sources. In Statistical Decision Theory and Related Topics V, S. S. Gupta and J. O. Berger (eds), 453-466.
- M. Arcones and B. Yu (1994b). Limit theorems for empirical processes under dependence. In Proc. in Chaos expansions, multiple Itô--Wiener integrals and their applications. 205-221.
- A. R. Barron, Y. Yang and B. Yu (1994). Asymptotically optimal function estimation by minimum complexity criteria. In Proceedings of 1994 International Symposium on Information Theory, pp. 38, Trondheim, Norway.
- B. Yu (1993). Density estimation in the
norm for dependent data with applications to the Gibbs sampler. Ann. Statist. 21, 711-735.
- B. Yu and T. Speed (1993). A rate of convergence result for a universal D-semifaithful code. IEEE Trans. on Information Theory 39, 8813-820.
- T. Speed and B. Yu (1993). Model selection and prediction: normal regression. J. Inst. Statist. Math. 45, 35-54.
- J. Rissanen, T. Speed and B. Yu (1992). Density estimation by stochastic complexity. IEEE Trans. on Information Theory, 38, 315-323.
- B. Yu and T. Speed (1992) Data compression and histograms. Probability Theory and Related Fields, 92, 195-229.
TECHNICAL REPORTS
- B. Yu (1994). Estimating the
error of kernel estimators for Markov samplers. Technical Report 409, Department of Statistics, UC Berkeley.
- B. Yu (1996). A Statistical analysis of adaptive scalar quant ization based on quantized past data.
Last Modified 02/16/1999