Apache Hadoop
Are you having a problem with processing extensive data? Why not go for the Apache Hadoop course!!
This course is for amateur software engineers or businessmen who might want to comprehend the instruments used to fight and examine ample information. Apache Hadoop preparing will assist candidates with understanding the capacity of the executives, Hadoop filesystem, creation, and the board of Hadoop bunch. Students can make this course as their career option as it helps a lot with data information.
Apache Hadoop Eligibility Criteria
Candidate should meet the following basic criteria for the above mentioned course:
- Basic knowledge of Linux Administration, Java Programming and Hadoop administration
- Prior Knowledge of Apache Interest in data management and analysis, including cloud systems and technology.
Apache Hadoop Course Syllabus
Before enrolling into the course, students must have a brief idea on the syllabus. Below mentioned are some of the important topics that are covered in the entire course.
MODULE I – HADOOP BASICS
7 VIDEOS WHICH CONSIST OF
- Hadoop Stack Basics
- The Apache Framework: Basic Modules
- Hadoop Distributed File System (HDFS)
- The Hadoop “Zoo”
- Hadoop Ecosystem Major Components
- Exploring the Cloudera VM: Hands-On Part 1
- Exploring the Cloudera VM: Hands-On Part 2
4 READINGS
- Apache Hadoop Ecosystem
- Lesson 1 Slides (PDF)
- Hardware & Software Requirements
- Lesson 2 Slides – Cloudera VM Tour
1 PRACTICE EXERCISE
- Basic Hadoop Stack
MODULE II – INTRODUCTION TO HADOOP STACK
10 VIDEOS
- Overview of the Hadoop Stack
- The Hadoop Distributed File System (HDFS) and HDFS
- MapReduce Framework and YARN
- The Hadoop Execution Environment
- YARN, Tez, and Spark
- Hadoop Resource Scheduling
- Hadoop-Based Applications
- Introduction to Apache Pig
- Introduction to Apache HIVE
- Introduction to Apache HBASE
6 READINGS
- Hadoop Basics – Lesson 1 Slides
- Lesson 2: Hadoop Execution Environment – Slides
- Lesson 3: Hadoop-based Applications Overview – All Slides
- Command list for Applications Slide
- Tips to handle service connection errors
- References for Application
3 PRACTICE EXERCISE
- Overview of Hadoop Stack
- Hadoop Execution Environment
- Hadoop Applications
MODULE III – INTRODUCTION TO HADOOP DISTRIBUTED FILE SYSTEM
9 VIDEOS
- Overview of HDFS Architecture
- The HDFS Performance Envelope
- Read/Write Processes in HDFS
- HDFS Tuning Parameters
- HDFS Performance and Robustness
- Overview of HDFS Access, APIs, and Applications
- HDFS Commands
- Native Java API for HDFS
- REST API for HDFS
5 READINGS
- Lesson 1: Introduction to HDFS – Slides
- HDFS references
- Lesson 2: HDFS Performance and Tuning – Slides
- HDFS Access, APIs
- Lesson 3: HDFS Access, APIs, Applications – Slides
3 PRACTICE EXERCISES
- HDFS Architecture
- HDFS performance, tuning, and robustness
- Accessing HDFS
MODULE IV – INTRODUCTION TO MAP/ REDUCE
9 VIDEOS
- Introduction to Map/Reduce
- The Map/Reduce Framework
- A MapReduce Example: Wordcount in detail
- MapReduce: Intro to Examples and Principles
- MapReduce Example: Trending Wordcount
- MapReduce Example: Joining Data
- MapReduce Example: Vector Multiplication
- Computational Costs of Vector Multiplication
- MapReduce Summary
3 READINGS
- Lesson 1: Introduction to MapReduce – Slides.
- A note on debugging map/reduce programs.
- Lesson 2: MapReduce Examples and Principles – Slides
1 PRACTICE EXERCISE
- Lesson 1 Review
MODULE V – SPARK
10 VIDEOS
- Introduction to Apache Spark
- Architecture of Spark
- Resilient Distributed Datasets
- Spark Transformations
- Wide Transformations
- Directed Acyclic Graph (DAG) Scheduler
- Actions in Spark
- Memory Caching in Spark
- Broadcast Variables
- Accumulators
4 READINGS
- Setup PySpark on the Cloudera VM
- Lesson 1: Intro to Apache Spark – Slides
- Lesson 2: RDD and Transformations – Slides
- Lesson 3: Scheduling, Actions, Caching – Slides
3 PRACTICE PAPERS
- Spark Lesson 1
- Spark Lesson 2
- Spark Lesson 3
Top Recruiters for Apache Hadoop
Getting placed in a good company is like a dream come true for every student. Some of the major recruiters who are hiring students from Apache Hadoop are;
- Air India
- Airtel
- BAJAJ Allianz
- HCL
- Infosys
- Indian Oil
- Wipro
- Nestle
- Intel
- HP
Colleges/Institute offering Apache Hadoop Courses in India
Below mentioned are some of the colleges and Institutions offering Apache Hadoop courses;or improving your enlisting approach.
American Management and Technology College, Jaro Education. |
Total Fees: 39,385 INR | 6 months | Apply Now |
Aptech Computer Education, West Mumbai |
Total Fees: INR 16,500 | 3 months | Apply Now |
Mapping Minds, Delhi |
Total Fees: INR 25,000 | 15 weeks | Apply Now |
Indian Institute of Hardware Technology Ltd (IIHT), Kalkaji, Delhi |
Total Fees: 30,000 INR | 3 months | Apply Now |
Manipal Global Academy of Data Science, Bangalore |
Total Fees: 15,220 INR | 3 months | Apply Now |