[STAT 3201] - Final Exam Guide - Everything you need to know! (34 pages long)

520 views34 pages

Document Summary

What is big data: a dataset that is complex (structured or unstructured) and large in dimensionality and number of observations. Common examples: social media such as twitter or facebook, genomics whole genome sequencing, medical imaging, high-frequency finance, video analysis youtube, neuroscience brain connectivity. These datasets pose new computational and statistical challenges. Working with big data requires knowledge from statistics, computer science, application field (bioinformatics or business), etc. Probabilities quantify uncertainty regarding the occurrence of events. Variables: a quantity whose value is determined by the result of a chance process, any characteristic or measurement that differs from individual to individual. Sample mean: the average of a portion of the population. Sample median: the middle value of a portion of the population. Comparing means and medians: outlier, an observation that is numerically distant from the rest of the data; can be caused by errors in data collection.

Get access

Grade+20% off
$8 USD/m$10 USD/m
Billed $96 USD annually
Grade+
Homework Help
Study Guides
Textbook Solutions
Class Notes
Textbook Notes
Booster Class
40 Verified Answers

Related textbook solutions

Related Documents