COMP 541 Lecture Notes - Lecture 3: Hortonworks, Enterprise Search, Hsqldb
Document Summary
Draw the diagrams of hadoop platform, application framework or/and ecosystem, explain the following words: Gfs it is a fuzzy system that is augmented with an evolutionary learning process. Amazon s3 is where you can store and retrieve service from amazon server. Yarn is one of the key features in the second-generation hadoop 2 version of the apache. Apache knox as reverse proxy (with contribution from hortonworks) or using apache accumulo for cell-level security; M-brain is a technology partner who is helping companies to navigate the turbulent and ever expanding business environment and struggling with the information overload. Mapreduce is a programming model for processing large data sets with a parallel , distributed algorithm on a cluster. Apa(cid:272)he tez is an extensi(cid:271)le framework for building high performance batch and interactive data processing applications, coordinated by yarn in apache hadoop. Apache pig is a high-level platform for creating programs that run on apache hadoop.