Exploring Big Data & Data Analysis in the Big Data Ecosystem

Gain the Skills Required to Use Integrated Big Data Solutions to Acquire, Process, Integrate and Analyze Big Data

TTDS6615

Intermediate

4 Days

Course Overview

In the Exploring Big Data course, you will learn to use Integrated Big Data Solution to acquire, process, integrate and analyze big data.  In this course, you will be introduced to Big Data Cloud Service. Increase your Big Data technology portfolio by learning to use a wide range of big data acquisition, processing, integration, and analysis techniques. You will also explore engineered systems for Big Data, which provide a variety of data integration and analysis capabilities. Analysis options include Big Data SQL, Data Mining, R Enterprise, and Big Data Discovery.

Course Objectives

This course is approximately 50% hands-on, combining expert lecture, real-world demonstrations and group discussions with machine-based practical labs and exercises.  Our engaging instructors and mentors are highly experienced practitioners who bring years of current "on-the-job" experience into every classroom.  

Working in a hands-on learning environment, led by our Big Data expert instructor, students will learn to:

  • Define Big Data and Identify Big Data Use Cases
  • Review Big Data Management Architecture and Engineered Systems
  • Describe Integrated Big Data Solution and its components.
  • Examine MapReduce programs and balance MapReduce jobs
  • Use NoSQL Database
  • Use XQuery for Hadoop
  • Install, use, and administer the Big Data Appliance
  • Provide data security and enable resource management
  • Examine MapReduce programs and balance MapReduce jobs
  • Use the BigDataLite Virtual Machine
  • Use the Hadoop Distributed File System (HDFS)to store, distribute, and replicate data across the nodes in the Hadoop cluster.
  • Acquire big data using the HDFS Command Line Interface, Flume, and NoSQL Database.
  • Use MapReduce and YARN for distributed processing of the data stored in the Hadoop cluster.
  • Process big data using MapReduce, YARN, Hive, Pig, XQuery for Hadoop, Solr, and Spark.
  • Integrate big data and warehouse data using Sqoop, Big Data Connectors, Copy to BDA, Big Data SQL, Data Integrator, and GoldenGate.
  • Analyze big data using Big Data SQL, Advanced Analytics technologies, and Big Data Discovery.
  • Use and manage Big Data Appliance.
  • Secure your data.
  • Understand Big Data Cloud Service: Key Features & Benefits

Need different skills or topics?  If your team requires different topics or tools, additional skills or custom approach, this course may be further adjusted to accommodate.  We offer additional Big Data / Data Science, Hadoop, programming, analytics, Python/R, and other related topics that may be blended with this course for a track that best suits your needs. Our team will collaborate with you to understand your needs and will target the course to focus on your specific learning objectives and goals.

Course Prerequisites

This course is geared for attendees who wish to use Integrated Big Data Solution to acquire, process, integrate and analyze big data.

Attendees should possess the following incoming skills:

  • Basic to Intermediate IT Skills, and Big Data knowledge
  • Good foundational mathematics or logic skills
  • Basic Linux skills, including familiarity with command-line options such as ls, cd, cp, and su

Please see the Related Courses tab for specific Pre-Requisite courses, Related Courses that offer similar skills or topics, and next-step Learning Path recommendations.

Course Agenda

Please note that this list of topics is based on our standard course offering, evolved from typical industry uses and trends. We will work with you to tune this course and level of coverage to target the skills you need most.

  • Big Data and the Information Management System
  • Using Big Data Lite Virtual Machine
  • Introduction to the Big Data Ecosystem
  • Introduction to the Hadoop Distributed File System (HDFS)
  • Acquire Data using CLI, Fuse-DFS, and Flume
  • Using and Administering NoSQL Database
  • Introduction to MapReduce
  • Using YARN to Manage Resources
  • Overview of Apache Hive and Apache Pig
  • Overview of Cloudera Impala, Solr, and Apache Spark
  • Using XQuery for Hadoop
  • Options for Integrating Your Big Data
  • Using Big Data SQL
  • Using Advanced Analytics
  • Introducing Big Data Discovery
  • Using the Big Data Appliance (BDA)
  • Managing the Big Data Appliance
  • Balancing MapReduce Jobs
  • Securing Your Data on the BDA
  • Introduction to Big Data Cloud Service (BDCS)

Course Materials

Each student will receive a Student Guide with course notes, code samples, software tutorials, step-by-step written lab instructions, diagrams and related reference materials and links (as applicable). Students will also receive the project files (or code, if applicable) and solutions required for the hands-on work.

Lab Setup Made Simple.   All course labs and solutions, data sets, Tableau course software (limited version, for course use only), detailed courseware, lab guides and resources (as applicable) are provided for attendees in our easy access, no installation required, remote lab environment for the duration of the course. Our tech team will help set up, test and verify lab access for each attendee prior to the course start date, ensuring a smooth start to class and successful hands-on course experience for all participants. 

Raise the bar for advancing technology skills

Attend a Class!

Live scheduled classes are listed below or browse our full course catalog anytime

Special Offers

We regulary offer discounts for individuals, groups and corporate teams. Contact us

Custom Team Training

Check out custom training solutions planned around your unique needs and skills.

EveryCourse Extras

Exclusive materials, ongoing support and a free live course refresh with every class.

New Site, BIG Savings!
We're celebrating the launch of our lonnngggg awaited new site with with *50% off all 2021 Public Classes* booked by April 30!  Check out our Current Offers for Individuals, Teams and Organizations to Learn for Less!

See our latest Offers and Promotions

Learn. Explore. Advance!

Extend your training investment! Recorded sessions, free re-sits and after course support included with Every Course
Trivera MiniCamps
Gain the skills you need with less time in the classroom with our short course, live-online hands-on events
Trivera QuickSkills: Free Courses and Webinars
Training on us! Keep your skills current with free live events, courses & webinars
Trivera AfterCourse: Coaching and Support
Expert level after-training support to help organizations put new training skills into practice on the job

The voices of our customers speak volumes

Special Offers
Limited Offer for most courses.

SAVE 50%

Learn More