Stat 133 Syllabus
Statistical Thinking and the Data Analysis Cycle
- Data ACQUISITION -- Input/output, regular expressions
- Data CLEANING, verification, and manipulation -- graphics, exploratory data analysis
- Data ORGANIZATION -- data frames, XML, databases
- MODEL the data -- fit statistical models to the data
- Data as a PSEUDO-POPULATION -- assess the fit of the model via the
bootstrap, cross-validation
- SIMULATED data -- simulation studies
Statistical Concepts
- Graphics
- elements of graphing data
- grammar of graphics
- advanced plotting
- Computationally intensive methods
- Hierarchical Bayes
- Nearest Neighbor methods
- Thin plate splines
- Simulation tools
- Bootstrap
- Cross-validation
- Monte Carlo Markov Chain
Computing Concepts
- Programming concepts - e.g. loops, recursion, trees
- Regular expressions and text manipulation
- Relational Databases
- Random number generation
- Representation of numbers in the computer
- Event handling and GUI development
Software
- R - statistical software
- Unix - shell commands
- SQL - Structured Query Language for relational databases
- XML - Extensible Markup language
- Gtk/wXWidgets - Toolkits for creating graphical user interfaces