Information Technology
Hands on Training icon
Hands On Training
Hands on Training icon

Introduction to PySpark

Course Cover

5

(3)

compare button icon

Course Features

icon

Duration

4 hours

icon

Delivery Method

Online

icon

Available on

Limited Access

icon

Accessibility

Mobile, Desktop, Laptop

icon

Language

English

icon

Subtitles

English

icon

Level

Beginner

icon

Teaching Type

Self Paced

icon

Video Content

4 hours

Course Description

This course will show you how to use Spark with Python. Spark allows you to perform parallel computations using large data sets. It is easy to integrate into Python. PySpark, the Python package that makes all of this magic possible, is responsible. This package allows you to access data on flights between Portland, Washington and Seattle. This package will show you how to manage the data and build a machine-learning pipeline that predicts if flights will be delayed. To get into high-performance machine learning, you can spark your Python code!

blur
blur

Highlights

blur

Pedagogy

Top 30 Percentile

blur

Rating & Reviews

Top 30 Percentile

blur

Parameters

cv-icon

Pedagogy

This course empowers you with essential Python Programming skills, enabling practical application in your daily life. Gain proficiency to effectively apply these skills in real-world situations, enhancing your capabilities and experiences. An exceptional course in Python Programming, this stands out for its Self Paced learning approach. Learners have the flexibility to progress at their own speed, tailoring the experience to their individual needs. With a focus on cultivating industry-relevant skills, this course ensures that learners attain a skillset aligned with current industry demands.

cv-icon

Rating & Reviews

This highly acclaimed course is among the top-rated in Python Programming, boasting a rating greater than 4 and an overall rating of 5.0. Its exceptional quality sets it apart, making it an excellent choice for individuals seeking top-notch learning experience in Python Programming.

Course Overview

projects-img

Virtual Labs

projects-img

International Faculty

projects-img

Post Course Interactions

projects-img

Hands-On Training,Instructor-Moderated Discussions

Skills You Will Gain

Prerequisites/Requirements

Introduction to Python

What You Will Learn

Learn to implement distributed data management and machine learning in Spark using the PySpark package

In this course, you'll learn how to use Spark from Python!

You'll use this package to work with data about flights from Portland and Seattle

You'll learn to wrangle this data and build a whole machine learning pipeline to predict whether or not flights will be delayed

Course Instructors

Author Image

Nick Solomon

Data Scientist

Nick has a degree in mathematics with a concentration in statistics from Reed College. He's worked on many data science projects in the past, doing everything from mapping crime data to developing ne...
Author Image

Lore Dirick

Director of Data Science Education at Flatiron School

Lore is a data scientist with expertise in applied finance. She obtained her PhD in Business Economics and Statistics at KU Leuven, Belgium. During her PhD, she collaborated with several banks workin...

Course Reviews

Average Rating Based on 3 reviews

5.0

100%

Course Cover