STAT1008 Lecture Notes - Lecture 2: Attention Deficit Hyperactivity Disorder, Dependent And Independent Variables
The Structure of Data 1.1
● Data:
○ Set of measurements taken on a set of individual units
○ Stored and present in a dataset, comprised of variable measures (variable
= characteristic that is recorded for each case)
○ Unique case = row in dataset
○ Variable = column
○ E.g. If values in kidney cancer dataset are rates of kidney cancer deaths,
then the cases are counties however if the values in kidney cancer dataset
are yes/no then the cases are people
● Categorical Vs Quantitative:
○ Categorical variable divides the cases into groups e.g. Gender
○ Quantitative variable measures a numerical quantity for each case e.g.
exact height
○ Each variable is therefore either categorical or quantitative
○ Quantitative variables would be changed to a categorical variable
● Variables:
○ Explanatory and Response:
■ If we are using one variable to help us understand and predict
values of another variable, we call the former the explanatory
variable and the latter the response variable
■ E.g. Does meditation help reduce stress? Meditation = explanatory
variable stress = response variable
■ E.g. Does sugar consumption increase hyperactivity? Sugar =
explan and hyperactivity = response
find more resources at oneclass.com
find more resources at oneclass.com
Document Summary
Set of measurements taken on a set of individual units. Stored and present in a dataset, comprised of variable measures (variable. = characteristic that is recorded for each case) If values in kidney cancer dataset are rates of kidney cancer deaths, then the cases are counties however if the values in kidney cancer dataset are yes/no then the cases are people. Categorical variable divides the cases into groups e. g. gender. Quantitative variable measures a numerical quantity for each case e. g. exact height. Each variable is therefore either categorical or quantitative. Quantitative variables would be changed to a categorical variable. If we are using one variable to help us understand and predict values of another variable, we call the former the explanatory variable and the latter the response variable. Meditation = explanatory variable stress = response variable.