Artificial Intelligence & Data Science
Star icon
Most Popular
Trending Arrow Icon
Trending
Hands on Training icon
Hands On Training
Star icon
Trending Arrow Icon
Hands on Training icon

NoSQL, Big Data, and Spark Foundations Specialization

Course Cover
compare button icon

Course Features

icon

Duration

4 months

icon

Delivery Method

Online

icon

Available on

Limited Access

icon

Accessibility

Desktop, Laptop

icon

Language

English

icon

Subtitles

English

icon

Level

Beginner

icon

Effort

2 hours per week

icon

Teaching Type

Self Paced

Course Description

In the data management field, Big Data Engineers and NoSQL-skilled professionals are highly in demand. This specialization is for people who want to learn the fundamental skills needed for working with Big Data and Apache Spark. The three courses are packed with information and cover the most popular NoSQL databases, Apache Cassandra and MongoDB, as well as Apache Spark, which is an analytics engine that allows for large-scale data processing. After a brief overview of the various NoSQL data repositories (not just SQL), you will be able to work with many of them, including MonogoDB, Cassandra, IBM Cloudant and MonogoDB. You will perform data management tasks such as inserting, updating and deleting data, creating and replicating databases, querying, indexing, aggregated & sharding it, and querying. You'll then gain a basic understanding of Big Data technologies like Hadoop, MapReduce and HDFS. Next, you will learn more about Apache Spark, Spark Dataframes and Spark SQL. Finally, you will be able to scale Spark with Kubernetes and the Spark Application UI. The final course will teach you how to use Spark Structured Streaming SparkML for extract, transform and load processing (ETL) as well as machine learning tasks. This specialization is for those who are just starting out in NoSQL or Big Data, whether they are a Data Engineer, Software Developer or IT Architect.

Course Overview

projects-img

International Faculty

projects-img

Post Course Interactions

projects-img

Instructor-Moderated Discussions

Skills You Will Gain

What You Will Learn

Work with NoSQL databases to insert, update, delete, query, index, aggregate, and shard/partition data

Develop hands-on NoSQL experience working with MongoDB, Apache Cassandra, and IBM Cloudant

Develop foundational knowledge of Big Data and gain hands-on lab experience using Apache Hadoop, MapReduce,  Apache Spark, Spark SQL, and Kubernetes

Perform Extract, Transform and Load (ETL) processing and Machine Learning model training and deployment with Apache Spark

Course Instructors

Author Image

Rav Ahuja

Global Program Director

Rav Ahuja is a Global Program Director at IBM. Rav leads growth strategy, curriculum creation, and partner programs for the IBM Skills Network. He is the architect for the IBM Data Science Profession...
Author Image

Ramesh Sannareddy

Data Engineering Subject Matter Expert

Ramesh Sannareddy holds a Bachelors Degree in Information Systems (Birla Institute of Technology, Pilani). He has two and a half decades of experience in Information Technology Infrastructure Managem...
Author Image

Steve Ryan

Instructor & Content Developer

Steve Ryan is based in the UK and works for Skill-Up Technologies as a Senior Instructional Designer and Content Developer. His background is in IT, and had been a Microsoft Certified Trainer since 1...
Author Image

Karthik Muthuraman

Software Engineer (Machine Learning)

Karthik Muthuraman is a Software Engineer and Data Scientist at IBM’s Center for Open Source Data & AI Technologies (CODAIT). At CODAIT he works on solving ML problems using open source tools and mod...
Course Cover