Apache Hadoop is the classical framework for processing Big Data, and Spark is a new in-memory processing engine.
Hadoop Developer Foundation | Working with Hadoop, HDFS, Hive, Yarn, Spark and More is a lab-intensive hands-on Hadoop course that explores processing large data streams in the Hadoop Ecosystem. Working in a hands-on learning environment, students will learn techniques and tools for ingesting, transforming, and exporting data to and from the Hadoop Ecosystem for processing, as well as processing data using Map/Reduce, and other critical tools including Hive and Pig. Towards the end of the course, we’ll introduce other useful tools such as Spark and Oozie and discuss essential security in the ecosystem.
NOTE: This course agenda can be adjusted to add review and discussion of pending desired exam and Certifications as needed. We’ll collaborate with your organization to tune the agenda as needed to accommodate additional prep topics and review.
This “skills-centric” course is about 50% hands-on lab and 50% lecture, designed to train attendees in core big data/ Spark development and use skills, coupling the most current, effective techniques with the soundest industry practices. Throughout the course students will be led through a series of progressively advanced topics, where each topic consists of lecture, group discussion, comprehensive hands-on lab exercises, and lab review.
Working in a hands-on learning environment led by our expert Hadoop team, students will explore:
Need different skills or topics? If your team requires different topics or tools, additional skills or custom approach, this course may be further adjusted to accommodate. We offer additional Big Data / Data Science, Hadoop, development, programming, analytics, Python/R, Spark, and other related topics that may be blended with this course for a track that best suits your needs.
This in an intermediate-level course is geared for experienced developers seeking to be proficient in Hadoop, Spark tools & related technologies. Attendees should be experienced developers who are comfortable with programming languages. Students should also be able to navigate Linux command line, and who have basic knowledge of Linux editors (such as VI / nano) for editing code.
In order to gain the most from this course, attending students should be:
Please see the Related Courses tab for specific Pre-Requisite courses, Related Courses or Follow On training options. Our team will be happy to help you with recommendations for next steps in your Learning Journey
Please note that this list of topics is based on our standard course offering, evolved from typical industry uses and trends. We will work with you to tune this course and level of coverage to target the skills you need most. Each section below also has an accompanying hands-on lab sprcific to the topics and concepts in that chaper. Please inquire for additional details.
Day One
Introduction to Hadoop
HDFS
Day Two
YARN
Data Ingestion
HBase
Oozie
Day Three
Working with Hive
Hive (Advanced)
Day Four
Hive in Cloudera (or tools of choice)
Working with Spark
Spark Basics
Spark Shell
RDDs (Condensed coverage)
Spark Dataframes & Datasets
Spark SQL
Spark API programming (Scala and Python)
Spark and Hadoop
Capstone project (Optional)
Optional Additional Topics – Please Inquire for Details
Machine Learning (ML / MLlib)
GraphX
Spark Streaming
Student Materials: Each participant will receive a Student Guide with course notes, code samples, software tutorials, step-by-step written lab instructions, diagrams and related reference materials and resource links. Students will also receive the project files (or code, if applicable) and solutions required for the hands-on work.
Hands-On Setup Made Simple! Our dedicated tech team will work with you to ensure our ‘easy-access’ cloud-based course environment is accessible, fully-tested and verified as ready to go well in advance of the course start date, ensuring a smooth start to class and effective learning experience for all participants. Please inquire for details and options.
Live scheduled classes are listed below or browse our full course catalog anytime
Check out custom training solutions planned around your unique needs and skills.
Exclusive materials, ongoing support and a free live course refresh with every class.
Mix, Match & Master!
2FOR1: Two Courses, One Price!
Enroll in *any* two public courses (for 2023 *OR* 2024 dates!) by October 31, for one price! Learn something new, or share the promo!
Special Offers
Limited Offer for most courses.
SAVE 50%