Hadoop is the biggest thing in data storage now. Hadoop consists of a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. Most of the enterprises are using or started to use hadoop internally. Since the technology is new , there is a big talent gap. course on hadoop explains the concept in most simple ways. Labs on hadoop enable you to get familiarize to the usage of hadoop on real projects organizations are working on.
Instructor led live class - 15 hrs
Hands on - 10 hrs
Project work - 20 hrs
Printed Study Material or e-book
Access to screen cast (24*7 access) - 10 hrs duration
Please leave your email, We will notify you when new schedules are out.
This module introduce you to the world of Big data
Introduction to big data
What are modern computing problems.
How Hadoop resolve these problems.
This module help you to set up hadoop and configure it
Introduction to Hadoop
HDFS, Yarn and map reduce
Architecture of Hadoop
How mapreduce works
This module shows you how to program in hadoop. There are so many open source tools available with hadoop namely hbase, hive, pig, oozie, zookeeper. Though the purpose and development console are different for these tools, basic concept remains same. This module walk you through the open source tools for hadoop.
Module enable you to start using one of the hadoop tool/project called Pig
Pig used scripting language called Pig Latin - a high level scrpting
Using java libraries in Pig
Writing UDFs in pig
HDFS is a file system and database operations are not possible on HDFS. HBase gives database capabilities to hadoop.
Learn hadoop project HIVE
This module helps you to familiarize with other important projects in Hadoop.
Introduction to Oozie.
Introduction to Flume.
Introduction to Sqoop.
Introduction to Zookeeper.
This module cover advanced topic , which helps you in real life projects
Best practices for Hadoop development
Implementation of workflows
Who should take this course?
Hadoop is a nice technology and practitioners are migrating to hadoop. Hadoop has tools for people from application development background as well as database background. Course is intended for professionals who are interested in moving to a world of big data.
Course does not have specific prerequisites. It covers modules for professionals from database background as well as application programming background. Course is tailored to make student aware of the concept of big data and to slowly start coding in hadoop.