Handling Streaming Data with GCP Dataflow

Course Cover
compare button icon

Course Features

icon

Duration

192 minutes

icon

Delivery Method

Online

icon

Available on

Downloadable Courses

icon

Accessibility

Mobile, Desktop, Laptop

icon

Language

English

icon

Subtitles

English

icon

Level

Advanced

icon

Teaching Type

Self Paced

icon

Video Content

192 minutes

Course Description

Dataflow allows developers the ability to transform and process data with simple, intuitive APIs. Dataflow uses the Apache Beam architecture to unify batch and stream processing. This course, Handling streaming data with GCP Dataflow, will show you how the GCP offers a variety of connectors that allow you to connect the Dataflow service to other GCP services. Next, you'll stream live Twitter feeds from the Pub/Sub messaging system and create your pipeline to process these messages. You will also learn how to create pipelines that have a side input and branching pipelines in order to send your final results to multiple sources. After completing this course, you will be able to create complex Dataflow pipelines and integrate them with other Google services. You can also test these pipelines on Google Cloud Platform.

Course Overview

projects-img

International Faculty

projects-img

Post Course Interactions

projects-img

Hands-On Training,Instructor-Moderated Discussions

Skills You Will Gain

What You Will Learn

In this course, handling streaming data with gcp dataflow, you will discover the gcp provides a wide range of connectors to integrate the dataflow service with other gcp services such as the pub/sub messaging service and the bigquery data warehouse

First, you will see how you can integrate your dataflow pipelines with other services to use as a source of streaming data or as a sink for your final results

Next, you will stream live twitter feeds to the pub/sub messaging service and implement your pipeline to read and process these twitter messages

Finally, you will implement pipelines with a side input, and branching pipelines to write your final results to multiple sinks

When you are finished with this course you will have the skills and knowledge to design complex dataflow pipelines, integrate these pipelines with other google services, and test and run these pipelines on the google cloud platform

Course Instructors

Author Image

Janani Ravi

Instructor

Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework...
Course Cover