QBUS3820 Lecture Notes - Lecture 3: Model Selection, Test Statistic, Mahalanobis Distance

46 views58 pages

grayprairie-dog236

23 Jun 2018

School

Department

Course

Professor

For unlimited access to Class Notes, a Class+ subscription is required.

QBUS3820: Machine Learning and Data

Mining in Business

Lecture 3: Linear Regression and K-Nearest Neighbours

Associate Prof. Peter Radchenko

Semester 1, 2018

Discipline of Business Analytics, The University of Sydney Business School

find more resources at oneclass.com

Unlock document

This preview shows pages 1-3 of the document.
Unlock all 58 pages and 3 million more documents.

Already have an account? Log in

Lecture 3: Linear Regression and K-Nearest Neighbours

1. Statistical properties of OLS

2. The Gaussian MLR model

3. K-Nearest Neighbours

4. Comparison with linear regression

2/40

find more resources at oneclass.com

Unlock document

This preview shows pages 1-3 of the document.
Unlock all 58 pages and 3 million more documents.

Already have an account? Log in

Statistical properties of OLS

find more resources at oneclass.com

Unlock document

This preview shows pages 1-3 of the document.
Unlock all 58 pages and 3 million more documents.

Already have an account? Log in

Document Summary

Discipline of business analytics, the university of sydney business school. Lecture 3: linear regression and k-nearest neighbours: statistical properties of ols, the gaussian mlr model, k-nearest neighbours, comparison with linear regression. In classical statistics, the population parameter is xed and the data is a random sample from the population. We estimate by applying an estimator b to data (in our case the ols estimator). As the sample on which estimator b is computed is random, the value of the estimator is a random variable, and the distribution of this random variable is called the sampling distribution. We can study the uncertainty of an estimate by understanding the sampling distribution of the estimator. Suppose that we draw a large number of di erent datasets d(s) (s = 1, . On each of these datasets, we apply the estimator b and obtain a set of estimates {b (d(s))}s thought of as the distribution of these estimates as we let s . s=1.

QBUS3820 Lecture Notes - Lecture 3: Model Selection, Test Statistic, Mahalanobis Distance

Document Summary

Get access

Related Documents

ACCT10003 Lecture Notes - Lecture 11: Customer Relationship Management, Predictive Analytics, Lego Ninjago

ACCT10003 Lecture Notes - Lecture 11: Predictive Analytics, Logistic Regression, Predictive Modelling

BUS301 Lecture Notes - Lecture 1: Business Analytics, Risk Management, Treemapping