IST 195 Lecture Notes - Lecture 2: Yelp, Data Management, Oakland Athletics
Document Summary
A plethora of data is being collected and warehoused. Facebook users generate 200-400 tb of pictures every day. Twitter has 500 million tweets per day. The nsa collects about 29 pb of data every day. All the data in the world - basketball court. Data nsa collects - dime on basketball court. Ex: oakland a"s used data analytics to create winning team - rather than biases. 4 types of data: unstructured, the structure is not formally defined or anticipated, social media, rss feeds, videos, docs, pdfs, graphics, hard to analyze, semi-structure, hybrid data, emails, word documents with tables b. i. Unstructured: body of email - can contain text, tables, attachments, etc: structured, highly organized and manageable a. i. Weather, yelp, uber: geofence: taking a physical space and building a digital fence (area) around it b. i. b. ii. Offers special content to phones within that area. Companies would write checks to their database vendor to process large amounts of data.