logo small

With this Big Data Hadoop course, you will learn the big data framework using Hadoop and Spark, including HDFS, YARN, and MapReduce. The course will also cover Pig, Hive, and Impala to process and analyse large datasets stored in the HDFS and use Sqoop and Flume for data ingestion.

You will be shown real-time data processing using Spark, including functional programming in Spark, implementing Spark applications, understanding parallel processing in Spark, and using Spark RDD optimisation techniques. You will also learn the various interactive algorithms in Spark and use Spark SQL for creating, transforming, and querying data forms.

Finally, you will be required to execute real-life, industry-based projects using CloudLab in the domains of banking, telecommunication, social media, insurance, and e-commerce.

Access duration
  • 1 year access to the platform
Big Data Hadoop and Spark Developer online course details
  • 16 lessons
  • Free course included - Apache Kafka
  • Free course included - Core Java
  • 5 real-life industry projects
  • Duration of 24 hours
  • Access 24/7
  • There is no exam available, but you must complete 85% of the course and complete one simulation test, with a minimum score of 60%, to obtain a certificate.




Prezzo scontato

314,10€ fino al 30/04/2021

Durata accesso corso

1 anno




By the end of the course you will be able to understand:

  • The different components of Hadoop ecosystem such as Hadoop 2.7, Yarn, MapReduce, Pig, Hive, Impala, HBase, Sqoop, Flume, and Apache Spark 
  • Hadoop Distributed File System (HDFS) and YARN architecture
  • MapReduce and its characteristics and assimilate advanced MapReduce concepts
  • Different types of file formats, Avro schema, using Avro with Hive, and Sqoop and Schema evolution
  • Flume, Flume architecture, sources, flume sinks, channels, and flume configurations
  • HBase, its architecture and data storage, and learn the difference between HBase and RDBMS
  • Resilient distribution datasets (RDD) in detail
  • The common use cases of Spark and various interactive algorithms

You will also be able to:

  • Ingest data using Sqoop and Flume
  • Create database and tables in Hive and Impala, understand HBase, and use Hive and Impala for partitioning
  • Gain a working knowledge of Pig and its components
  • Do functional programming in Spark, and implement and build Spark applications
  • Gain an in-depth understanding of parallel processing in Spark and Spark RDD optimisation techniques
  • Create, transform and query data frames with Spark SQL

A chi è rivolto

Big data career opportunities are on the rise and Hadoop is quickly becoming a must-know technology in big data architecture. Big Data training is suitable for IT, data management, and analytics professionals, including:

  • Software developers and architects
  • Analytics professionals
  • Senior IT professionals
  • Testing and mainframe professionals
  • Data management professionals
  • Business intelligence professionals
  • Project managers
  • Aspiring data scientists
  • Graduates looking to build a career in big data analytics


The course covers the following topics:

  • Course introduction 
  • Lesson 1 - Introduction to big data and Hadoop ecosystem 
  • Lesson 2 - HDFS and YARN 
  • Lesson 3 - MapReduce and Sqoop 
  • Lesson 4 - Basics of Hive and Impala 
  • Lesson 5 - Working with Hive and Impala 
  • Lesson 6 - Types of data formats 
  • Lesson 7 - Advanced Hive concept and data file partitioning 
  • Lesson 8 - Apache Flume and HBase 
  • Lesson 9 - Pig 
  • Lesson 10 - Basics of Apache Spark 
  • Lesson 11 - RDDs in Spark 
  • Lesson 12 - Implementation of Spark applications 
  • Lesson 13 - Spark parallel processing 
  • Lesson 14 - Spark RDD optimisation techniques 
  • Lesson 15 - Spark algorithm 
  • Lesson 16 - Spark SQL 
  • FREE COURSE - Apache Kafka
  • FREE COURSE - Core Java


There are no prerequisites for this course. However, it's beneficial to have some knowledge of Core Java and SQL. We offer a complimentary self-paced online course "Java essentials for Hadoop" if you need to brush up your Core Java skills.

Lingua docente


Lingua materiale corso



Con l’acquisto di qualsiasi pacchetto online, iLEARN offre la possibilità di accedere gratuitamente a un corso online per 30 giorni a scelta tra i seguenti:

iLEARN inoltre offre la possibilità di aggiungere anche il relativo esame ad un prezzo ridotto.

Gestione dei dati personali e privacy di ILX Group*

Al fine di poter usufruire dei servizi offerti da ILX Group, vi preghiamo di visionare l'informativa sulla privacy presente a questo link e di fornire il relativo consenso. Il consenso è obbligatorio per l’acquisto e l’erogazione del servizio.

Wallet - Porta un Amico

L'opzione seguente permette di creare il tuo wallet (portafoglio) se già non ne possiedi uno. La funzione Wallet ti permette di accomulare crediti nel tuo portafoglio, semplicemente promovendo i prodotti iLEARN ai tuoi amici. Puoi utilizzare questi crediti per effettuare acquisti sul nostro sito. Per maggiori informazioni in merito alle caratteristiche del Wallet, leggi la pagina seguente

Big Data Hadoop and Spark Developer 1 anno senza esame corso online in Inglese erogato da iLEARN Innovative Learning

logo footer

iCONS - Innovative Consulting S.r.l.
Galleria J.F. Kennedy 10/A
20831 Seregno (MB) - Italy

0039 0362 330107
[email protected]

ISO 9001

Logo CSQ

2020 © iCONS - Innovative Consulting S.r.l.

iLEARN is a business unit of iCONS - Innovative Consulting Srl - VAT number 03334560962
iCONS - Innovative Consulting srl is certified ISO 9001 for training and consulting services.

Lean IT is a trademark of Lean IT Association LLC. DASA DevOps is a trademark of DevOps Agile Skills Association LLC 2018. BRMP® is a registered trademark of Business Relationship Management Institute, Inc. The Six Sigma logo is a Trade Mark of 6sigmastudy™ (a brand of VMEdu,Inc.). The SCRUMStudy logo is a Trade Mark of SCRUMStudy™ (a brand of VMEdu,Inc.). The PMI logo is a mark of the Project Management Institute, Inc. The PMI Authorized Training Partner seal is a mark of the Project Management Institute, Inc. CAPM®, PMP®, PMI-ACP®, PMI-RMP® and PMI-PgMP® are registered trademarks of PMI. AgileSHIFT®, ITIL®, PRINCE2®, PRINCE2 Agile®, P3O®, MSP®, MoP®, M_o_R® and MoV® are Registered Trade Marks of AXELOS Limited. The Swirl Logo™ are Trade Marks of AXELOS Limited. CCBA® and CBAP are registered certification marks owned by International Institute of Business Analysis™ (IIBA®). These certification marks are used with the express permission of International Institute of Business Analysis. TOGAF®, ArchiMate® and IT4IT™ are registered trademarks of The Open Group in the United States and other countries. COBIT® 5, COBIT® 2019, CISA® and CISM® are Registered Trade Marks of the Information Systems Audit and Control Association and the IT Governance Institute. AgileBA® is a registered trademark of Agile Business Consortium Limited. APMG-International™ AgilePM®, APMG-International™ ISO/IEC 20000, APMG-International™ ISO/IEC 27001, APMG-International™ Sourcing Governance, APMG-International™ Service Catalogue, APMG-International™ Change Analyst, APMG-International™ CMDB, APMG-International™ GDPR Awareness, APMG-International™ Praxis Framework, APMG-International™ Problem Analyst, APMG-International™ Change Management, APMG-International™ Better Business Cases and APMG-International™ Managing Benefits are Trade Marks of APM Group Limited. ScrumLearn®, AgileLearn® and Organizational Resilience® are registered trademarks of iCONS - Innovative Consulting S.r.l. VeriSM™ is a Trade Mark of IFDC. EXIN® is a registered trademark of EXIN.