Hands-on Data Analysis with Pandas

Quick Start to Using the Pandas Library to Reshape, Clean, Aggregate, Analyze & Visualize Your Data

TTPS4878

Intermediate

3 Days

Course Overview

Data analysis has become a necessary skill in a variety of domains where knowing how to work with data and extract insights can generate significant value. Geared for data team members with incoming Python scripting experience, Hands-On Data Analysis with Pandas will show you how to analyze your data, get started with machine learning, and work effectively with Python libraries often used for data science, such as pandas, NumPy, matplotlib, seaborn, and scikit-learn.

Using real-world datasets, you will learn how to use the powerful pandas library to perform data wrangling to reshape, clean, and aggregate your data. Then, you will be able to conduct exploratory data analysis by calculating summary statistics and visualizing the data to find patterns. In the concluding lessons, you will explore some applications of anomaly detection, regression, clustering, and classification using scikit-learn to make predictions based on past data.  Students will leave the course armed with the skills required to use pandas to ensure the veracity of their data, visualize it for effective decision-making, and reliably reproduce analyses across multiple datasets.

This course is approximately 50% hands-on, combining expert lecture, real-world demonstrations and group discussions with machine-based practical labs and exercises.  Our engaging instructors and mentors are highly experienced practitioners who bring years of current "on-the-job" experience into every classroom.  Working in a hands-on learning environment, guided by our expert team, attendees will learn to:

  • Understand how data analysts and scientists gather and analyze data
  • Perform data analysis and data wrangling using Python
  • Combine, group, and aggregate data from multiple sources
  • Create data visualizations with pandas, matplotlib, and seaborn
  • Apply machine learning (ML) algorithms to identify patterns and make predictions
  • Use Python data science libraries to analyze real-world datasets
  • Use pandas to solve common data representation and analysis problems
  • Build Python scripts, modules, and packages for reusable analysis code
  • Perform efficient data analysis and manipulation tasks using pandas
  • Apply pandas to different real-world domains with the help of step-by-step demonstrations
  • Get accustomed to using pandas as an effective data exploration tool.

Need different skills or topics?  If your team requires different topics or tools, additional skills or custom approach, this course may be further adjusted to accommodate.  We offer additional python, data science, AI / machine learning and other related topics that may be blended with this course for a track that best suits your needs. Our team will collaborate with you to understand your needs and will target the course to focus on your specific learning objectives and goals.

Course Objectives

This course is approximately 50% hands-on, combining expert lecture, real-world demonstrations and group discussions with machine-based practical labs and exercises.  Our engaging instructors and mentors are highly experienced practitioners who bring years of current "on-the-job" experience into every classroom. 

Working in a hands-on learning environment, guided by our expert team, attendees will learn to:

  • Understand how data analysts and scientists gather and analyze data
  • Perform data analysis and data wrangling using Python
  • Combine, group, and aggregate data from multiple sources
  • Create data visualizations with pandas, matplotlib, and seaborn
  • Apply machine learning (ML) algorithms to identify patterns and make predictions
  • Use Python data science libraries to analyze real-world datasets
  • Use pandas to solve common data representation and analysis problems
  • Build Python scripts, modules, and packages for reusable analysis code
  • Perform efficient data analysis and manipulation tasks using pandas
  • Apply pandas to different real-world domains with the help of step-by-step demonstrations
  • Get accustomed to using pandas as an effective data exploration tool.

Need different skills or topics?  If your team requires different topics or tools, additional skills or custom approach, this course may be further adjusted to accommodate.  We offer additional python, data science, AI / machine learning and other related topics that may be blended with this course for a track that best suits your needs. Our team will collaborate with you to understand your needs and will target the course to focus on your specific learning objectives and goals.

Course Prerequisites

This course is geared for Python-experienced attendees who wish to be equipped with the skills you need to use pandas to ensure the veracity of your data, visualize it for effective decision-making, and reliably reproduce analyses across multiple datasets.

Take Before: Students should have skills at least equivalent to the following course(s) or should have attended as a pre-requisite:

  • TTDS6600     Understanding Data Science | A Technical Overview – 1 day (helpful but not required)
  • TTPS4800     Introduction to Python Programming (3 days)

Please see the Related Courses tab for specific course recommendations and links.

Course Agenda

Please note that this list of topics is based on our standard course offering, evolved from typical industry uses and trends. We’ll work with you to tune this course and level of coverage to target the skills you need most.

  1. Introduction to Data Analysis
  • Fundamentals of data analysis
  • Statistical foundations
  • Setting up a virtual environment
  1. Working with Pandas DataFrames
  • Pandas data structures
  • Bringing data into a pandas DataFrame
  • Inspecting a DataFrame object
  • Grabbing subsets of the data
  • Adding and removing data
  1. Data Wrangling with Pandas
  • What is data wrangling?
  • Collecting temperature data
  • Cleaning up the data
  • Restructuring the data
  • Handling duplicate, missing, or invalid data
  1. Aggregating Pandas DataFrames
  • Database-style operations on DataFrames
  • DataFrame operations
  • Aggregations with pandas and numpy
  • Time series
  1. Visualizing Data with Pandas and Matplotlib
  • An introduction to matplotlib
  • Plotting with pandas
  • The pandas.plotting subpackage
  1. Plotting with Seaborn and Customization Techniques
  • Utilizing seaborn for advanced plotting
  • Formatting
  • Customizing visualizations
  1. Financial Analysis - Bitcoin and the Stock Market
  • Building a Python package
  • Data extraction with pandas
  • Exploratory data analysis
  • Technical analysis of financial instruments
  • Modeling performance
  1. Rule-Based Anomaly Detection
  • Simulating login attempts
  • Exploratory data analysis
  • Rule-based anomaly detection
  1. Getting Started with Machine Learning in Python
  • Learning the lingo
  • Exploratory data analysis
  • Preprocessing data
  • Clustering
  • Regression
  • Classification
  1. Making Better Predictions - Optimizing Models
  • Hyperparameter tuning with grid search
  • Feature engineering
  • Ensemble methods
  • Inspecting classification prediction confidence
  • Addressing class imbalance
  • Regularization
  1. Machine Learning Anomaly Detection
  • Exploring the data
  • Unsupervised methods
  • Supervised methods
  • Online learning
  1. The Road Ahead
  • Data resources
  • Practicing working with data
  • Python practice

Course Materials

All course software (limited versions, for course use only), courseware files or course notes (as applicable), labs / data sets and solutions (as applicable) are provided for you in our “easy access / no install required” high-speed remote lab environment.  In most cases, we can also offer local (non-cloud) set up as an alternative. Either way, our dedicated live tech team works with every student to ensure everyone is set up with working access and ready to go prior to every course start date, ensuring a smooth delivery and great hands-on experience. All your coursework can be accessed or downloaded after class, so you never lose your work or materials. Please ask for details.

Raise the bar for advancing technology skills

Attend a Class!

Live scheduled classes are listed below or browse our full course catalog anytime

Special Offers

We regulary offer discounts for individuals, groups and corporate teams. Contact us

Custom Team Training

Check out custom training solutions planned around your unique needs and skills.

EveryCourse Extras

Exclusive materials, ongoing support and a free live course refresh with every class.

Attend a Course

Please see the current upcoming available open enrollment course dates posted below. Please feel free to Register Online below, or call 844-475-4559 toll free to connect with our Registrar for assistance. If you need additional date options, please contact us for scheduling.

Course Title Days Date Time Price
Hands-on Data Analysis with Pandas 3 Days Jan 19 to Jan 21 10:00 AM to 06:00 PM EST $2,395.00 Enroll
Hands-on Data Analysis with Pandas 3 Days Mar 2 to Mar 4 10:00 AM to 06:00 PM EST $2,395.00 Enroll
Hands-on Data Analysis with Pandas 3 Days Apr 13 to Apr 15 10:00 AM to 06:00 PM EST $2,395.00 Enroll
Hands-on Data Analysis with Pandas 3 Days May 18 to May 20 10:00 AM to 06:00 PM EST $2,395.00 Enroll
Hands-on Data Analysis with Pandas 3 Days Jun 22 to Jun 24 10:00 AM to 06:00 PM EST $2,395.00 Enroll

Year-End Savings!
Register today to receive *50% off all 2021 Public Classes*!  Check out our Current Offers for Individuals, Teams and Organizations to Learn for Less!

See our latest Offers and Promotions

Learn. Explore. Advance!

Extend your training investment! Recorded sessions, free re-sits and after course support included with Every Course
Trivera MiniCamps
Gain the skills you need with less time in the classroom with our short course, live-online hands-on events
Trivera QuickSkills: Free Courses and Webinars
Training on us! Keep your skills current with free live events, courses & webinars
Trivera AfterCourse: Coaching and Support
Expert level after-training support to help organizations put new training skills into practice on the job

The voices of our customers speak volumes

Special Offers
Limited Offer for most courses.

SAVE 50%

Learn More