This course provides a practical introduction to the Apache Hadoop ecosystem. You will learn the basic skills needed to analyze and manage large, unstructured datasets. The course covers core concepts such as the data lake, MapReduce, and using Spark for analytics. You will install and configure Hadoop on your own computer using the Hortonworks HDP sandbox. The course includes instruction on the Hadoop Distributed File System (HDFS), its architecture, and how to use it in real-world situations. This course is suitable for beginners and those looking to expand their data analytics skills. By the end, you will understand the fundamentals of Hadoop and Spark for scalable data processing.

Discover new skills with 30% off courses from industry experts. Save now.


Hadoop and Spark Fundamentals: Unit 1
This course is part of Hadoop and Spark Fundamentals Specialization

Instructor: Pearson
Included with
Recommended experience
What you'll learn
Understand the core concepts of Hadoop, including its architecture, data lake metaphor, and the role of MapReduce and Spark in big data analytics.
Install and configure a full-featured Hadoop and Spark environment on your desktop or laptop using the Hortonworks HDP sandbox.
Navigate and utilize the Hadoop Distributed File System (HDFS), including advanced features like high availability and federation.
Gain hands-on experience running Hadoop and Spark applications, preparing you for real-world data analytics challenges.
Skills you'll gain
Details to know

Add to your LinkedIn profile
August 2025
3 assignments
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There is 1 module in this course
This module introduces the fundamentals of Hadoop and Spark, starting with core concepts and the transformative impact of Hadoop on data management. It guides learners through installing a full-featured Hadoop environment on a desktop or laptop using the Hortonworks HDP sandbox or direct installation. The module also covers the Hadoop Distributed File System (HDFS), highlighting its architecture, advantages for big data, navigation tools, and advanced features. A bonus lesson provides essential Linux command line skills for beginners.
What's included
18 videos3 assignments
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Explore more from Data Management
- Status: Free Trial
University of California San Diego
- Status: Free Trial
Johns Hopkins University
Why people choose Coursera for their career





Open new doors with Coursera Plus
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy
Frequently asked questions
Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.
If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. You’ll be able to submit assignments once the session starts.
Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. You’ll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.
More questions
Financial aid available,