# Week 3 Study Notes

13 Dec 2010
GGR 270 ā Lecture 3 ā September 29, 2010
Statistics vs. Parameters
īGraphs are limited in what they can tell us
īDifficulty making inferences about a population when looking at a subset or sample
īTherefore, we need to use numerical measures
īMeasures associated with the population are called parameters
īMeasures associated with a sample are called statistics
Measures of the centre
Mean:
īMost commonly used measure of central tendency
īSum of all values or observations divided by the number of observations
Sample Population
Median:
īValue occupying the āmiddle positionā in an ordered set of observations
īOrder the observations, lowest to highest, and find the middle position
īExpressed as: .5(n+1)
Mode:
īValue that occurs with the highest frequency
īAllows you to locate the peak of relative frequency histogram
Choosing appropriate measure
Mean is usually the best measure as it is sensitive to change in a single observation
Not a good measure when:
īDistribution is bimodal
īSkewed distribution
īOutliers (extreme values) are present in the data set
Normal
ī
Mean, Median, Mode
Bimodal
ī
Mode, Mean, Median, Mode
Positive skew
ī
mode, med, mean ā mean>median
Negative skew
ī
mean, median, mode ā mean<median
*Mean always dragged in direction of skew
Measures of dispersion
