Hadoop Training

Hadoop Training in Chennai

Dhaksha Technology  No 1 Big data hadoop training institute in chennai. We are the best real time hadoop training institute in chennai. Providing training in all ecosystems like Hive, Pig, HDFS, Sqoop and etc… The bigdata mainly used to store and analysis hue amount of the structured and unstructured data.we are providing training in hadoop development ,hadoop testing and hadoop admin.

Hadoop Training 

 In Today’s  most of the top companies storing and analyzing huge volume of data like Banking,Telecommunication, CC Comers, Hospitality Data, Airlines, Sensors, Social network, Face book, Online shopping . So heavyweight of data that it becomes difficult to access or process using in normal database management systems. It will take more time to process or very slow. Apache developed open source hadoop will process huge volume of data very quick manner.

We are providing more example with real time task  with different ecosystem and also UNIX basic commands training providing to access the HDFS file system and to create unix shell script. 

Hive Training in Chennai  

Dhaksha technology offering Hive training in medavakkam. The Apache Hive used to store structured data in HDFS . Basic SQL knowledge is enough to work in hive environment. Hive support different type of file format like Text File,RC file ,ORC file,Sequence file, AVRO file and we can store the file in compressed format using  bzip2, gzip, LZO, zip etc. Hive having different optimization Technic(file format, Mapjoin, partition) to improve the  hive query performance. Apache Hive to store the data in distributed manner and hive used to extract,transfer and load the data.

We are presenting  real time hive training  in medavakkam,Limited number of batches,flexible timing,  training in real time Dhaksha Technology is a leader in providing Hadoop training In Medavakkam driven by industrial experts with all Real times.

All topics are Completely Real time. Theory Books and interview questions provided at sample class!! 

Sqoop Training in Chennai

Sqoop is a apache open source tool  to transfer bulk data from RDBMS  like Teradata, Netezza, Oracle, MySQL, Postgres, and HSQLDB to HDFS and we can also export the data from HDFS to relational database management. By using sqoop we can import all the tables or particular tables from database to HDFS. we can also implement the incremental import to get newly added data from table. Sqoop  support to import data from RDBMS to hive table. Dhaksha Technology providing sqoop training in medavakkam.

Sqoop features:

  • Full Load
  • Incremental Load
  • Parallel import/export
  • Import results of SQL query
  • Compression
  • Connectors for all major RDBMS Databases
  • Kerberos Security Integration
  • Load data directly into Hive/Hbase
  • Support for Accumuloe, MySQL, Postgres, and HSQLDB

Pig Training in Chennai

Spark Training in Chennai

Apache Spark is the new processing engine to process hue amount of data. Spark application developed  by Apache Software Foundation.Spark is 100 times faster than map reduce in hadoop and it will process all the data in memory.

Dhaksha Technology providing real time spark training  in medavakkam,training session available in weekdays and weekend. we are covering all the topics in spark lik RDD,Share variable,transformation and action,Spark SQL,Spark Datafrmae,dataset etc.Spark engine will process batch and real time streaming data.we can develop the spark program by using scala or java language. 

Spark process large amounts of structure/unstructured data and the need for increased speed to fulfil the real-time analytics have made this technology a real alternative for Big Data computational exercises. 

Big Data certification

  1. Cloudera Certified Administrator for Apache Hadoop (CCAH)
  2. Cloudera Certified Professional: Data Scientist (CCP: DS)
  3. Cloudera Certified Professional Data Engineer
  4. EMC Data Scientist Associate (EMCDSA)

hadoop training in chennai

Hadoop Course Content 

Hadoop – Big Data Overview 

Hadoop  course Syllabus

  • HDFS -Architecture

  • Hadoop – Installation

  • Hadoop1 Vs Hadoop2

  • Hadoop -NameNode

  • Hadoop – DataNode

  • Hadoop – Job tracker and task tracker 

  • Hadoop – Basic Commands

  • Hadoop – Replication Factor

  • Hadoop – Rack Awareness

  • Hadoop –  MapReduce 

  • Hadoop – Introduction

  • Hadoop –  Introduction

Hadoop – HIVE

2.1  About Hive ,2.2  Advantage and disadvantage of Hive

2.3  Different between Hive and other Databases.

2.3 Hive data types

2.4 Hive DML

     2.4.1 Create Database      

     2.4.2 Internal Table(managed table)

     2.4.3 External Table

     2.4.4 Alter table,Drop table,Truncate table

2.5 Hive DML

     2.5.1 Load,update, delete, select

2.6 Hive commands

     2.6.1 show,desc,describe formatted,describe extended  

2.7 Hive partition 

     2.7.1 Hive partition types,Static partition,Dynamic Partition,Buckets 

     2.7.2 Difference between static and dynamic partition 

     2.7.3 difference between partition and buckets

2.8 Hive file format

2.9 Hive file compression 



  • Hive Built-in Operators

  • Hive Bulit-in Function

  • Hive View and Indexes

  • Hive File Format

  • Hive File Compression


  • HiveQL Select Where

  • HiveQL Select Order By

  • HiveQL Select Group By

  • HiveQL Select joins

Hadoop – SQOOP   

  • SQOOP Overview

  • SQOOP Import Data

  • Full table

  • Only Subset

  • Target Directory

  • Protecting Password,

  • File format other than CSV

  • Compressing,Control Parallelism

  • All tables Import

SQOOP Incremental Import

  • Import only New data

  • Last Imported data

  • Sstoring Password in Metastore

  • Sharing Metastore between Sqoop Clients

  • SQOOP Free Form Query Import

  • SQOOP Export data to RDBMS,HIVE and HBASE

Hadoop – MapReduce 

Hadoop – PIG 

  • Pig Overview

  • Pig Architecture

  • Pig Execution Types

  • Pig Grunt Shell

  • Pig Installation

Load & Store Operators

  • Pig Reading Data

  • Pig Storing Data

Diagnostic Operators

  • Pig Diagnostic operator

  • Pig Describe Operator

  • Pig Explain Operator

  • Pig illustrate Operator

Grouping & Joining

  • Pig Group Operator

  • Pig Cogroup Operator

  • Pig Join Operator

  • Pig Cross Operator

Combining & Splitting

  • Pig Union Operator

  • Pig Split Operator


  • Pig filer Operator

  • Pig Distinct Operator

  • Pig Foreach Operator


  • Pig Oder By

  • Pig Limit Operator

Bulit -in Function

  • Pig Eval Function

  • Pig Load & Sotre Function

  • Pig Bag & Tuple Function

  • Pig String Function

  • Pig Date-Time Function

  • Pig Math Function

Other Modes Of Execution

  • Pig User Defined Function

  • Pig Running Scripts

Hadoop -HBase

  • HBase Overview

  • HBase Architecture

  • HBase Shell

  • HBase General Commands

  • HBase Admin API

  • HBase Create Table

  • HBase Listing Table

  • HBase Disabling a Table

  • HBase Enabling a Table

  • HBase Describe & Alter

  • HBase Exists

  • HBase Drop Table

  • HBase Shutting Down

  • HBase Client API

  • HBase Create Data

  • HBase Update Data

  • HBase Read Data

  • HBase Delete Data

  • HBase Scan

  • HBase Count & Truncate

  • HBase Security

Hadoop – YARN


Hadoop – SPARK

  • Spark Architecture

  • Spark- RDD

  • RDD- Transformation and Action

  • Spark -EcoSystems

  • Spark -SQL

  • Spark -Streaming 

  • Spark -Dataframe

  • Spark -DataSet

  • Spark Vs MR


Reason to choose Dhaksha Technology

  1. Trained more than 1000 students with Real-time.

  2. We have placement tie-up with more than 550+ companies.

  3. We have small training batch(3-7 students)

  4. Flexible timing

  5. Real-time Guidance

  6. Resume Preparation

  7. Certification support

spark training in chennai