MATH1005 Lecture Notes - Lecture 5: Scatter Plot
Document Summary
Bivariate data involves a pair of variables, seeing the relationship between the 2 variables. Formally, we have (x,y), so that x is the independent variable, and y is the dependent variable. This is a graphical summary of 2 variables on the same 2d plane, with a range of points (cloud). It can be summarised in 5 numerical summaries: This is between 2 variables that describe how the points cluster along a given line. If it is strong, that means that the cloud of points are tightly clustered, that means that the predictions, between different variables would be good. If there is one variable that tends to increase with the other, than its a positive association. The centre of the cloud represents the point of averages (x,y) The horizontal spread is measured by the sd of x. The vertical spread is measured by the sd of y.