Apache Hadoop Developer Training

Overview

Hadoop developer training is a 5 day course on Apache Hadoop aimed at experienced programmers who are new to big data world. Put your existing programming and problem solving skills to solve problems in big data using hadoop.

Course takes a practitioner approach to content. It will teach you from basic syntax and semantics to advanced problem solving using hadoop. We focus heavily on hands on to make sure each and every concept is clear to you. We stress on building distributed mind set from day one. Each section has a practical focus, mixing presentation with in-depth hands-on labs and exercises.

If you are experienced developer who want to take first steps in Hadoop, this course is right for you.

Prerequisites

  • Programming experience in Java.
  • Basic familiarity with Unix/Linux is highly recommended
  • Prior knowledge about Hadoop is not required.

Course content

  • Introduction
    • Why Bigdata?
    • Introduction to Hadoop
    • Hadoop Architecture
    • Introduction to Hadoop 2.0
    • Introduction to Map/Reduce
  • HDFS
    • Hadoop installation
    • Introduction to HDFS
    • HDFS interfaces
    • HDFS ETL
    • HDFS api java examples
  • Map/Reduce
    • Mapper and Reduce API
    • Configuration API
    • Custom writables
    • Chain mapper and chain reducer
    • Combiners
    • Reduce side Join
    • Distributed Cache
    • Map side join
    • Job dependency management
    • Sequence files
    • Custom input format

    All the above api's with real world example code.

  • Hive
    • Introduction to Hive
    • Hive installation
    • Hive query language Hands on
    • Partitions
    • Bucketing
    • Indexing
    • UDF / UDAF
    • Hive Serde
    • Hive bucketing
  • Pig
    • Introduction to Pig
    • Pig installation
    • Pig latin Hands on
    • Pig UDF
Course Summary
Length
5 days

Audience

Developers who are looking to understand Big data and hadoop technologies.

Frameworks covered

  • Map/Reduce
  • HDFS
  • Hive
  • Pig
  • Others on demand
Public and Corporate courses

We run public courses every few months. Public courses are open to all. Get in touch to find out when the next course is scheduled.

Corporate courses are run exclusively for your team. You have complete control over the syllabus and schedule. Let us know your custom requirements on the booking form and we'll customize the material appropriately.