This course is an introduction to the theory and application of statistical methods. The topics to be covered are fundamental concepts of mathematical statistics, including survey sampling, estimation and hypothesis testing, topics in descriptive statistics and data analysis, with particular emphasis on graphical displays, aspects of experimental design, and a variety of applications. We will cover most of the material in chapters 7-14 of the text. The computer will play a key role in the course, although no prior experience is assumed; the open source statistical package R will be used in labs to analyze real data sets and conduct simulations. These concrete activities will be a valuable complement to the lectures.
Pre-requisites: Calculus and linear algebra. Statistics 134 or an equivalent course in probability theory
Text: J. A. Rice. Mathematical Statistics and Data Analysis. 3rd Edition. We will cover most of chapters 7-14.
Library reserves: I put two optional books on reserve. I recommend that you check them out sometime during the semester.
John Rice
Office: 425 Evans Hall
Phone: 642-6930
Email: rice AT stat.berkeley.edu
url: www.stat.berkeley.edu/users/rice
Office Hours: 1:30-3:30 Tues and by appointment
Harry Kim
Office:
Email: harrykim AT stat.berkeley.edu
Office Hours:
Tu-Th 8:00-9:30. 102 Moffitt
Friday 1:00-2:00 and 2:00-3:00 332 Evans. The section meeting will be used to instruct and help with computer assignements and to review course material.
Lab webpage
Grades will be based on a midterm, a final exam, homework, and labs.
The midterm will count for 25% of your grade; the score on the midterm will be replaced by the score on the final if the latter is higher. There will be no makeup midterms: if you miss the midterm, the score on it will be your score on the final. The the midterm will be on Oct 11.
Solutions to the midterm
Score distribution on the midterm:
The final exam will count 40%. It will be on Monday Dec 17, 12:30-3:30 pm. There will be no alternative times, so if you can't take the exam at this time, don't take the course.
Homework will be assigned every week and will count 20%. Your two lowest homework grades will be dropped. Assignments will be posted below. Homework will be collected in class on Thursdays.
There will be several labs; this component of the course will count 15%. They require data analysis using the statistical software R (see links below).
You are encouraged to work together with others on the homework, but you must write up your own solutions. The same applies to labs -- you must ultimately do your own computing and writing. So, for example, if a lab assignment involved taking a random sample, your random sample had better not be identical to any other in the class. No collaboration is allowed on exams. Cheating will be taken seriously and the penalties will be severe.
R code for the demo in class on 9/11 - estimating proportion of the earth covered by water. Code may differ slightly from what I did in class
R code for 9/18 demo and data (R workspace)
Demo of Oct 18: geissler.txt geissler.R
Demo of Oct 25: smokepreg.txt delinq.txt cont-table.R
Demo of Oct 30: 2sample.R calcium.csv ozonecontrol.csv ozonetreat.csv
Baseball: baseball.R obp_nl.txt
First midterm from 2004
Second midterm from 2004. solutions
Old final exam
Week of Aug 27: Review 4.3; 7.1-7.3.1
Week of Sept 3: 7.3.2-7.3.3, 7.6, skim 7.5
Week of Sept 10: pages 255-272 of Chapter 8
Week of Sept 17: 8.5.1-8.5.3
Week of Sept 24: (Background: 4.6. Example E p 94.) 8.5.2. Skim 8.7: p 298-302. I hope to begin 8.6
Week of Oct 1: 8.6; 9.1, 9.2
Week of Oct 8: 9.1, 9.2, Midterm
Week of Oct 15: 9.4, 9.5
Week of Oct 22: 9.3, 13.3, 13.4
Week of Oct 29: 11.1-11.3
Week of Nov 5: Read 11.4, start Chapter 14 (14.1, 14.2)
Week of Nov 12: Read 10.7; 14.1 - 14.3
Week of Nov 19: Read 14.1, 14.2
Week of Nov 26: Read 14.3 -14.5
Week of Dec 3:
Homework will be due in class on Thursday unless otherwise specified. Late assignments will not be accepted. The list of homework assignments and due dates follows. Show your work.
Sept 6: Chapter 7: 4, 10, 28, 34, 36, 38 (hint: see Lemma A of Section 7.3)
Sept 13: Chapter 7: 8,14,16 (true or false and why),22,32
Sept 20: Chapter 8: 4ac, 16ab, 18ab, 28, 30 (be careful), 34
Sept 27: Chapter 8: 12, 14, 51, 52abc, 58abc
Oct 4: Chapter 8: 8d, 48,62,66
Oct 11: no homework
Oct 18: Chapter 9: 2,4, 20, 30, 32
Oct 25: Chapter 9: 18, 26,33, 34, 40
Nov 1: Chapter 9: 14, 64c; Chapter 13: 2,6,28
Nov 8: Chapter 11: 2, 8, 14, 24 (U_Y = sum sum Z_ij -- see top page 440), 34 (you can do this problem by hand using the tables or using R Consider power against an alternative of the form mu_X > mu_Y)
Nov 15: Chapter 11: 5, 6, 18, 39, 52adghi
Nov 29: Chapter 14: 3,4,7,14,15,22, 25
Dec 6: Chapter 14: 6, 26, 28, 30, 32