COMMERCE 2KA3 Lecture Notes - Lecture 5: Big Data, Data Warehouse, Data Cleansing
Document Summary
A methodology for documenting databases illustrating the relationship between various entities in the database. Using databases to keep track of basic transactions. Companies also need databases to provide information that will help the company run more efficiently and make better decisions. Accelerate simple queries against large volumes of structured and unstructured data. Main disadvantage: security, we don"t know who can access our data in the cloud. Need new technologies and tools capable of maintained and analyzing non- traditional data. A subset of a data warehouse containing only a portion of the organization"s data for a specific function or population of users organization"s data for a specific function or population of users. Enabled distributed parallel processing of huge amounts of data across inexpensive computers. We focus on different factors in our analysis (ex. Provides insights unto corporate data by finding hidden patterns and relationships in large database and inferring rules from them to predict future behaviour.