STAB22H3 Study Guide - Midterm Guide: Observational Study, Joint Probability Distribution, Box Plot

STAB22H3 Full Course Notes

Cases: objects described by set of data customers, companies, subjects in a study. Label: special variable used in some data sets to distinguish the different cases. Categorical variable: places case into one of several groups/categories- bar graphs, pie charts. Quantitative variable: numerical values (arithmetic operations) stemleaf/histograms/boxplots. Distribution of variable tells values it takes and how often it takes these values. Distribution of categorical variables lists the categories and gives either the count or the percent of cases who fall in each category. Describe the overall pattern of a histogram(frequency, percent-relative frequency, density) by shape (symmetric), centre (midpoint) and spread (outliers) Outlier: individual value that falls outside overall pattern. Mean: x = x1 + x2+ xn n. First quartile q1: median of observations, position in ordered list is to the left of location of overall median. Third quartile q3: median of the observations, position is to the right of the location of the overall median.