Machine Learning with PySpark

Course Cover

5

(3)

compare button icon

Course Features

icon

Duration

4 hours

icon

Delivery Method

Online

icon

Available on

Limited Access

icon

Accessibility

Mobile, Desktop, Laptop

icon

Language

English

icon

Subtitles

English

icon

Level

Intermediate

icon

Teaching Type

Self Paced

icon

Video Content

4 hours

Course Description

Spark is a powerful tool for Big Data. Spark transparently manages the allocation of compute tasks within a cluster. This allows for quick operations and lets you concentrate on the analysis, not worrying about the technical details. In this course you'll learn how to get data into Spark and then delve into the three fundamental Spark Machine Learning algorithms: Linear Regression, Logistic Regression/Classifiers, and creating pipelines. This course will also cover analysing large quantities of spam text messages as well as flight delays. This will give you the knowledge and skills to harness Spark's power for Machine Learning projects.

Course Overview

projects-img

Virtual Labs

projects-img

International Faculty

projects-img

Post Course Interactions

projects-img

Hands-On Training,Instructor-Moderated Discussions

Skills You Will Gain

Prerequisites/Requirements

Introduction to PySpark

Statistical Thinking in Python (Part 1)

What You Will Learn

Learn how to make predictions with Apache Spark

In this course you'll learn how to get data into Spark and then delve into the three fundamental Spark Machine Learning algorithms: Linear Regression, Logistic Regression/Classifiers, and creating pipelines

Along the way you'll analyse a large dataset of flight delays and spam text messages

Course Instructors

Author Image

Andrew Collier

Data Scientist @ Exegetic Analytics

Andrew Collier is a Data Scientist, working mostly in R and Python but also dabbling in a wide range of other technologies. When not in front of a computer he spends time with his family and runs obsessively.

Course Reviews

Average Rating Based on 3 reviews

5.0

100%

Course Cover