STAT 263 Lecture 2: Week 1 Lecture 2
Document Summary
Location where the data is centered (similar to average of data) Variability how spread out the data in the set is. Outliers are outliers present in the data. We can write raw data in a stem-and-leaf diagram (shown below). The stems are the first digit(s) of the number and the leaves are the last digit of the number data. The frequency of the data is how many leaves there are for a given stem. We can have multiple bins for one stem aswell. Range/(approx. number of classes) = approx. class interval (largest value) (smallest value) = range. If dataset has <200 values, use square root of number of data values to find approx. number of bins. Relative frequency (percentage distribution): the frequency of the data divided by the total cumulative frequency of the whole data set. Total relative frequency is equal to 1. (we can also find cumulative relative frequency).