Stat 131a: Statistical Methods for Data Science

UC Berkeley

Offerings

  1. Fall 2025
  2. Fall 2024
  3. Spring 2023

Overview

This course teaches a broad range of statistical methods that are used to solve data problems. Topics include group comparisons and ANOVA, standard parametric statistical models, multivariate data visualization, multiple linear regression, logistic regression and classification, regression trees and random forests. An important focus of the course is on statistical computing and reproducible statistical analysis. The course and lab include hands-on experience in analyzing real world data from the social, life, and physical sciences. The R statistical language is used.

Logistics

Three hours of lecture and two hours of laboratory per week. Six hours of lecture and four hours of laboratory per week for 8 weeks.

Prerequisites

DATA/COMPSCI/INFO/STAT C8 or STAT 20; and MATH 1A, MATH 51, MATH 16A, or MATH 10A/10B. Strongly recommended corequisite: STAT 33A or STAT 133.