# Class Notes for STAT C100 at University of California - Berkeley (UCB)

BERKELEYSTAT C100Joshua HugFall

## STAT C100 Lecture Notes - Lecture 14: Data Manipulation Language, Data Definition Language, Database

OC13466332 Page
18 Nov 2018
0
View Document
BERKELEYSTAT C100Joshua HugFall

## STAT C100 Lecture 13: Classification and logistic regression

OC13466333 Page
24 Oct 2018
0
View Document
BERKELEYSTAT C100Josh HugFall

## STAT C100 Lecture Notes - Lecture 10: Probability Mass Function, Random Variable, Sampling Distribution

OC24968313 Page
13 Oct 2018
0
Sample the data that we have and generalize it to the population that is the group we want to study. Random variable: a variable whose value is determi
View Document
BERKELEYSTAT C100Josh HugFall

## STAT C100 Lecture Notes - Lecture 11: Feature Engineering, Missing Data, Data Science

OC24968318 Page
13 Oct 2018
0
The regression line: given scalar data y and x, find m and b that minimizes the mean squared error (a. k. a. Adding a constant feature function is a co
View Document
BERKELEYSTAT C100Joshua HugFall

## STAT C100 Lecture 9: Model estimation

OC13466334 Page
2 Oct 2018
0
View Document
BERKELEYSTAT C100Joshua HugFall

## STAT C100 Lecture Notes - Lecture 8: Mean Squared Error, Uses Of English Verb Forms, Smoothness

OC13466333 Page
25 Sep 2018
0
A model is an idealized representation of a system. There is a percent tip that all customers pay. The parameter theta is determine by the universe (es
View Document
BERKELEYSTAT C100Joshua HugFall

## STAT C100 Lecture Notes - Lecture 4: Coordinated Universal Time, Greenwich Mean Time, Unix Time

OC13466332 Page
8 Sep 2018
0
View Document
BERKELEYSTAT C100Josh HugFall

## STAT C100 Lecture Notes - Lecture 7: Canonicalization, Regular Expression, List Comprehension

OC24968316 Page
13 Oct 2018
0
Our goal: canonicalization: replace each string with a unique representation, feels very hacky , but this is just how it goes. 169. 237. 46. 168 - - [2
View Document
BERKELEYSTAT C100Josh HugFall

## STAT C100 Lecture Notes - Lecture 2: Apache Spark, Ipython, Data Science

OC24968313 Page
13 Oct 2018
0
View Document
BERKELEYSTAT C100Joshua HugFall

## STAT C100 Lecture Notes - Lecture 7: Box Plot, Jigging, Bar Chart

OC13466333 Page
13 Sep 2018
0
Kaiser study: oakland kaiser mothers, 1960s, measure the babies weight in ounces at birth. Birthweights: density curve (width can be trick to choose fo
View Document
BERKELEYSTAT C100Josh HugFall

## STAT C100 Lecture Notes - Lecture 13: Feature Engineering, Overfitting, Invertible Matrix

OC24968318 Page
13 Oct 2018
0
Fitting linear models, regularization and cross validation (domain) feature engineering linear regression. Turn into the feature matrix with entirely q
View Document
BERKELEYSTAT C100Josh HugFall

## STAT C100 Lecture Notes - Lecture 1: Exploratory Data Analysis, Data Science, Big Data

OC24968313 Page
13 Oct 2018
0
The application of data centric, computational, and inferential thinking to: understand the world (science), solve problems (engineering). Good data an
View Document

## Popular Professors

View all professors (2+)

Class Notes (1,100,000)
US (480,000)
Berkeley (10,000)
STAT (500)
STAT C100 (20)