STA130H1 Lecture Notes - Lecture 2: Random Variable, Data Wrangling, Linear Combination

61 views5 pages
Verified Note
29 Jan 2020
School
Department
Course
Professor

Document Summary

The histogram of a variable is a graphical method to vizualize the distribution of a single variable. To construct a basic histogram: divide the data into intervals (called bins). Count the number of observations that are contained in the bin. Plot rectangles with height equal to the count from (2) and width equal to the width of the bin: different bin width will yield different histograms. The bins of the histogram are the intervals: Statistical data is obtained by observing (random) variables: a random variable can be given a precise mathematical definition that we will cover later in the course. Collecting this data will generate three variables: height, years, and sex. There are three interrelated rules which make a dataset tidy: "for a given dataset, it is usually easy to figure out what are observations and what are variables, but it is surprisingly difficult to precisely define variables and observations in general. " (wickham, 2014)

Get access

Grade+20% off
$8 USD/m$10 USD/m
Billed $96 USD annually
Grade+
Homework Help
Study Guides
Textbook Solutions
Class Notes
Textbook Notes
Booster Class
40 Verified Answers
Class+
$8 USD/m
Billed $96 USD annually
Class+
Homework Help
Study Guides
Textbook Solutions
Class Notes
Textbook Notes
Booster Class
30 Verified Answers

Related Documents