- Instructor: Suneel Kumar
- Lectures: 11
- Quizzes: 3
- Students: 4
- Duration: 10 weeks
Welcome to AppsTek’s Big Data Specialization! We are excited and looking forward to learning about you. Big Data requires new programming frameworks and systems. For this course, we don’t need programming knowledge or experience, but we do want to clear the foundation of some of the key concepts.
A good data analyst is one who turns data into relevant information, information into useful insights and insight into a business decision. Big data is data that contains greater variety, arriving in increasing volumes and with ever-higher velocity.
Around 2005, people began to realize just how much data users generated through Facebook, YouTube, and other online services. Hadoop was developed that same year. The development of open-source frameworks, such as Hadoop (and more recently, Spark) was essential for the growth of big data because they make big data easier to work with and cheaper to store.
Program Curriculum
- Understanding Big Data and Hadoop/Brief history of Hadoop
- Hadoop Architecture and HDFS
- Hadoop MapReduce Framework
- Advanced MapReduce
- Pig, Hive, Advanced Hive
- Data form databases with Sqoop
- Introduction to HBase & exploring master and region servers in HBase
- Processing Distributed Data with Apache Spark
- Streaming on Kafka
- Security components in Hadoop
- Live use cases hands on Hadoop Project