Textbook Notes (362,776)
Statistics (125)
STAB22H3 (122)
Chapter 4

# Chapter 4.docx

3 Pages
69 Views

School
University of Toronto Scarborough
Department
Statistics
Course
STAB22H3
Professor
Mahinda Samarakoon
Semester
Winter

Description
Stats: Data and Models – Canadian Edition Chapter 4 – Displaying and Summarizing Quantitative Data Histograms - For quantitative variables, there is no obvious way to choose piles – so all the possible values are divided into bins/classes and then the number of cases in each bin/class is counted - The classes and the counts give the distribution of the quantitative variable - The histogram displays the distribution at a glance - Making histograms: aim for 6-10 bins for smaller data sets and 10-25 bins for larger data sets - Spaces in a histogram are actual gaps in the data (regions where there are no observed values), whereas in bar graphs, there are spaces between the bars to separate the counts of the different categories - Relative frequency histogram – replaces the counts with the percentage or proportion of the total number of cases in each bin/class (shape of histogram will be the same) Stem-and-Leaf Display - Like a histogram, but shows the individual values of the data - Turning the stem-and-leaf on its side should show roughly the same shape as the histogram of the same data - The ‘stem’ is the tens digit of the data value (i.e. in 8|4, 8 represents 80) - The ‘leaf’ is the ones digit of the data value (i.e. 8|4, represents 84) - Stem-and-leaf displays show more information than histograms - With larger data sets, leaves 0-4 and leaves 5-9 are divided to make 2 lines (i.e. with the same stem) - 3 digit numbers: the number in the stem can be the hundreds digit, or both the hundreds and the tens digit together (i.e. 546 can be 5|4 or 54|6), leaves can be two digits, but it is unnecessary o Leaves are better left as one digit so that there is more room for the data and computers can better interpret the data Dotplot - A dotplot places a dot along an axis for each case in the data - Like a stem-and-leaf plot, but with dots instead of digits - Good for small data sets Think Before you Draw - Before making a histogram, stem-and-leaf display, or dotplot, check the quantitative data condition – that the data are values of a quantitative variable whose units are known - Discuss shape, spread, and centre when describing distribution The Shape of a Distribution - Humps or peaks in the distribution are called modes o For categorical variables – the mode can be the single value that appears the most often, but this is not acceptable for quantitative variables - Histograms can be unimodal, bimodal, or multimodal - If the histogram can be folded in half (vertically) with the edges matching pretty closely, it is symmetric - The thinner ends of the distribution are called tails - A histogram is skewed to side of the longer tail (if one tail stretches out farther than the other) - Outliers can be the most informative part of the data or it could be an error – don’t just throw it away; point it out, try to explain it, set it aside, rather than have it distort the data analysis The Centre of the Distribution - Mean/ȳ = ∑ o y-bar = the sum of all the values of the variable (y) divided by the number of data values (n) - The mean feels like the centre because it is the point at which the histogram balances - If the data is skewed or has outliers, the centre is not so well defined - Summarizing Skewed Distributions o The mean does not provide a good summary of the distribution when it is skewed or when it contains outliers (it is good when the data is symmetric and unimodal) o Median – value that splits the data in half; splits the histogram into two regions of equal area o The median considers only the order of the values and is therefore resistant to values that are extraordinarily large or small o Whe
More Less

Related notes for STAB22H3

OR

Don't have an account?

Join OneClass

Access over 10 million pages of study
documents for 1.3 million courses.

Join to view

OR

By registering, I agree to the Terms and Privacy Policies
Just a few more details

So we can recommend you notes for your school.