Course Detail

Big Data Course

Big Data Course - Apponix Academy


Course Detail


Course Description

Big Data Course Objectives

 
  • Understanding the fundamentals ofHadoop Hive
  • Know Indexing and Map Side Join in Hive
  • Learn how to handle complex data types
  • Acquire in-depth understanding on HDFS (HadoopDistribite File System)
  • Advanced level learning of PIG, Sqoop, HBase and Yarn
  • Create Ubuntu image in VMware
  • Understanding about computational tools at application level
  • Enable the trainees to handle Data / Statistical analysis better

Big Data Hadoop Course Syllabus

 
  • 1: Hadoop Introduction
  • Introduction to Data and System
  • Types of Data
  • Traditional way of dealing large data and its problems
  • Types of Systems & Scaling
  • What is Big Data
  • Challenges in Big Data
  • Challenges in Traditional Application
  • New Requirements
  • What is Hadoop? Why Hadoop?
  • Brief history of Hadoop
  • Features of Hadoop
  • Hadoop and RDBMS
  • Hadoop Ecosystem’s overview
  • 2: Hadoop Installation
  • Installation in detail
  • Creating Ubuntu image in VMware
  • Downloading Hadoop
  • Installing SSH
  • Configuring Hadoop, HDFS & MapReduce
  • Download, Installation & Configuration Hive
  • Download, Installation & Configuration Pig
  • Download, Installation & Configuration Sqoop
  • Download, Installation & Configuration Hive
  • Configuring Hadoop in Different Modes
  • 3: Hadoop Distribute File System (HDFS)
  • File System - Concepts
  • Blocks
  • Replication Factor
  • Version File
  • Safe mode
  • Namespace IDs
  • Purpose of Name Node
  • Purpose of Data Node
  • Purpose of Secondary Name Node
  • Purpose of Job Tracker
  • Purpose of Task Tracker
  • HDFS Shell Commands – copy, delete, create directories etc.
  • Reading and Writing in HDFS
  • Difference of Unix Commands and HDFS commands
  • Hadoop Admin Commands
  • Hands on exercise with Unix and HDFS commands
  • Read / Write in HDFS – Internal Process between Client, NameNode & DataNodes
  • Accessing HDFS using Java API
  • Various Ways of Accessing HDFS
  • Understanding HDFS Java classes and methods
  • Commissioning / DeCommissioning DataNode
  • Balancer
  • Replication Policy
  • Network Distance / Topology Script
  • 4: Map Reduce Programming
  • About MapReduce
  • Understanding block and input splits
  • MapReduce Data types
  • Understanding Writable
  • Data Flow in MapReduce Application
  • Understanding MapReduce problem on datasets
  • MapReduce and Functional Programming
  • Writing MapReduce Application
  • Understanding Mapper function
  • Understanding Reducer Function
  • Understanding Driver
  • Usage of Combiner
  • Usage of Distributed Cache
  • Passing the parameters to mapper and reducer
  • Analysing the Results
  • Log files
  • Input Formats and Output Formats
  • Counters, Skipping Bad and unwanted Records
  • Writing Join’s in MapReduce with 2 Input files. Join Types
  • Execute MapReduce Job - Insights
  • Exercise’s on MapReduce

Institute Overview

Pune, Maharashtra, India

Setting a benchmark in the industry, Apponix Technologies Private Limited is the most innovative Training and Recruitment Company strategically located in Bangalore, delivering classroom and online trainings across India, UK and USA. What We Focus O... Read More

Related Courses

Google Map