Star icon
Most Popular

Data Analyst with Python

blur

Learn Path Description

Gain the career-building Python skills you need to succeed as a data analyst. No coding experience required. In this track, you’ll learn how to import, clean, manipulate, and visualize data—all integral skills for any aspiring data professional or researcher. Through interactive exercises, you’ll get hands-on with some of the most popular Python libraries, including pandas, NumPy, Matplotlib, and many more. You’ll also gain experience of working with real-world datasets, including data from the Titanic and from Twitter’s streaming API, to grow your data manipulation and exploratory data analysis skills, before moving on to learn the SQL skills you'll need to query data from databases and join tables. Start this track, grow your Python and SQL skills, and begin your journey to becoming a confident data analyst.

Skills You Will Gain

Courses In This Learning Path

blur
icon

Total Duration

4 hours

icon

Level

Beginner

icon

Learn Type

Certifications

Introduction to Data Science in Python

Start your journey to Data Science. Data Science is not a programming language. Bayes, the kidnapped Golden Retriever, will be solved by data. Additionally, you'll learn Python syntax as well as popular Data Science modules such Matplotlib (for charts & graphs) or Pandas (for table data).

blur
icon

Total Duration

4 hours

icon

Level

Intermediate

icon

Learn Type

Certifications

Intermediate Python

Python is essential for any data scientist looking to be a data scientist. Learn how to visualize real data with Matplotlib's functions and how to use data structure like the pandas DataFrame and dictionary. You will be able combine the knowledge you have gained about control flow and boolean reasoning in Python to solve a case using hacker stats.

blur
icon

Total Duration

4 hours

icon

Level

Intermediate

icon

Learn Type

Certifications

Data Manipulation with pandas

Pandas is the most popular Python library and it's used for data manipulation. This course will show you how to manipulate DataFrames. This course will teach you how to extract, filter, and transform real data for analysis. Pandas will teach you the basics of data science. To learn how to clean up, calculate, import, and visualize statistics using pandas, you will be using real-world data like global temperature time series or Walmart sales figures. This will allow you to improve the power of Python.

blur
icon

Total Duration

4 hours

icon

Level

Intermediate

icon

Learn Type

Certifications

Joining Data with pandas

A Data Scientist should be able work with multiple data sets. Pandas are a crucial component of the Python data scientist ecosystem. There are 5 million pandas questions on Stack Overflow. Pandas will teach you how to combine, organise, join and reshape multiple DataFrames. You will be using datasets from the World Bank as well as the City Of Chicago. This course will provide a solid foundation for data-joining using pandas.

blur
icon

Total Duration

4 hours

icon

Level

Beginner

icon

Learn Type

Certifications

Introduction to Data Visualization with Matplotlib

Visualizing data in plots and figures helps you to see patterns and provides insight. Visualizations can also be useful for data analysts or other data users. This course will show you how to use Matplotlib which is a powerful Python data visualization program. Matplotlib allows you to create rich visualizations using many types of data. This course will show you how to create visualizations using different types of data. These visualizations can also be automated, customized, and shared.

blur
icon

Total Duration

4 hours

icon

Level

Beginner

icon

Learn Type

Certifications

Introduction to Data Visualization with Seaborn

Seaborn is a powerful Python library that makes it easy to create informative and attractive visualizations. This course will introduce Seaborn and show you how to visualize data with box plots and scatter plots. While you examine survey responses about student hobbies and factors that influence academic success, this will be done. Seaborn's benefits as a statistical visualization tool will be discussed, along with how it calculates confidence intervals. Seaborn will be used in a variety of situations to help you analyze and communicate your data.

blur
icon

Total Duration

16 hours

icon

Level

Beginner

icon

Learn Type

Certifications

Data Manipulation with Python

Data in real life is messy. Pandas libraries are valuable because they make it easy to organize data. Pandas makes data manipulation easy by allowing you to extract, filter, and transform data in DataFrames. This allows for fast and reliable data analysis. This is the right track to help you improve your data wrangling abilities. As you learn to use pandas to create multiple DataFrames, you will also be able to analyze real-world data. Additionally, you will get hands-on experience in creating visualizations, merging, and combining data. You'll use your newly acquired data manipulation skills to analyze how weather and gender affect police behavior at the end of this track. This track will teach you how to save time when manipulating data with pandas.

blur
icon

Total Duration

3 hours

icon

Level

Beginner

icon

Learn Type

Certifications

Introduction to Importing Data in Python

Data scientists are responsible to clean, manipulate, and interpret data. Before you can do all of this, however, you need to know how to import data in Python. This course will show you how to import data from flat files such as.txt and.csv into Python. It also covers files native to other programs like Excel spreadsheets, Stata files and SAS files. This course also covers relational databases like SQLite or PostgreSQL.

blur
icon

Total Duration

2 hours

icon

Level

Intermediate

icon

Learn Type

Certifications

Intermediate Importing Data in Python

Data scientists are responsible to clean, manipulate, and interpret data. Before you can do all of this, however, you need to know how to import data in Python. This course teaches you how to import data from flat files such as.txt and.csv into Python. It also covers files native to other programs like Excel spreadsheets, Stata files and SAS files. This course will help you expand your knowledge and teach you how to pull data from Application Programming Interfaces APIs. For example, the Twitter streaming API allows us to stream live Tweets.

blur
icon

Total Duration

4 hours

icon

Level

Intermediate

icon

Learn Type

Certifications

Cleaning Data in Python

Data cleansing is a crucial task in data science, as it ensures accurate results and prevents incorrect conclusions. Data scientists spend roughly 80% of their time cleaning and manipulating data, with only 20% dedicated to analysis. Without proper data cleaning, both machine learning and data analysis will fail.

This course on data science with Python focuses on teaching you how to identify, treat, and fix various data-cleaning problems. It covers topics such as dealing with wrong data types, ensuring data falls within the correct range, handling missing data, and managing record linking.

By taking this course, you will learn how to clean datasets using Python. It provides a comprehensive understanding of the techniques and tools necessary to clean and manipulate data effectively. The course emphasizes practical applications and provides hands-on experience in Python.

Some of the key keywords associated with this course include data science with Python training, cleaning the data in Python, Python for data analysis, Python and data science course, advanced Python for data science, and more. By enrolling in this course, you will gain expertise in utilizing Python libraries for data science and analysis.

Overall, this course is designed to equip you with the essential skills and knowledge required to effectively clean and manipulate data for accurate analysis. It is ideal for anyone interested in pursuing a career in data science or for professionals seeking to enhance their existing skills in Python and data cleaning techniques.

blur
icon

Total Duration

13 hours

icon

Level

Intermediate

icon

Learn Type

Certifications

Importing & Cleaning Data with Python

Working in Python requires that you know how to prepare your data. You must know what to do before you can get the real insights. This track will teach you how to import data from many sources including.csv and.xls. Once you have learned how to import your data, it is time to prepare your data to be analysed. As you work with real-world data like restaurant reviews, you'll learn how to deal with incorrect data types, fix missing data, and link records. The Tweepy package will allow you to access Twitter's API and scrape the internet for data. This track will give you the data prep skills that you need to clean up your data.

blur
icon

Total Duration

4 hours

icon

Level

Intermediate

icon

Learn Type

Certifications

Exploratory Data Analysis in Python

How can data be transformed into answers? Exploratory data analysis allows for you to examine datasets, answer queries, and visualize the results. This course will help you validate and clean data, visualize relationships between variables, answer questions, and use regression models for explaining and predicting. The course will cover data related to health and demographics, including the National Survey of Family Growth and the General Social Survey. These methods can be used in any field of science, engineering or business. Pandas is a Python library that lets you work with data. You can also find other core Python libraries, such as SciPy for regression or Matplotlib to visualize data. These skills and tools will enable you to make fascinating discoveries and work with real data.

blur
icon

Total Duration

4 hours

icon

Level

Intermediate

icon

Learn Type

Certifications

Analyzing Police Activity with pandas

Once you have mastered the basics of pandas you can use your knowledge to answer questions about real data. The Stanford Open Policing Project dataset and its effects on police behavior will be examined. This will give you more experience in cleaning up data, creating visualizations, and manipulating time series data. Analyzing police activity using pandas will provide valuable experience in data analysis from start to finish. This will help you prepare for your data science career.

blur
icon

Total Duration

4 hours

icon

Level

Intermediate

icon

Learn Type

Certifications

Working with Categorical Data in Python

Data scientists need to be able use non-numerical data such as marital status and blood type to understand, summarize, or use them. This course will show you how to use pandas and seaborn to visualize and manipulate categorical information. Through hands-on exercises, you will learn how to use pandas and seaborn to visualize and manipulate categorical data. You will be able use a variety datasets to improve your ability to work in categorical data. This includes Las Vegas trip reviews, and the characteristics of adoptable dogs.

blur
icon

Total Duration

4 hours

icon

Level

Beginner

icon

Learn Type

Certifications

Introduction to SQL

Data scientists transform raw data into actionable insights. Relational databases are structured tables. These databases store large amounts of data from all over the world, including transaction histories for customers and electronic medical records. Data scientists use these tables to modify and extract data using SQL language. This course will teach you syntax for SQL common to multiple databases such as PostgreSQL and MySQL. This course will provide all the information that you need to get started working with databases.

blur
icon

Total Duration

4 hours

icon

Level

Intermediate

icon

Learn Type

Certifications

Streamlined Data Ingestion with pandas

Before you can analyze the data, you need to first acquire it. This course will show you how to build pipelines to import data from common storage formats. Pandas, a Python library specializing in analytics, can be used to extract data. This can include spreadsheets of survey responses that are linked to a database of public service requests. Learn how to modify imports and fix problems such as incorrect data types. Learn how to create custom data sets from various sources.

blur
icon

Total Duration

4 hours

icon

Level

Intermediate

icon

Learn Type

Certifications

Introduction to Relational Databases in SQL

SQL is the most common way to query databases. Databases can also be used for other purposes. You can model many phenomena and their relationships. This will help you ensure your data is consistent and has a better structure. You will see this when you examine real data about questionable university affiliations. This will show you how databases can be used for your benefit column-by-column and table-by-table. You will learn how to create tables and relationships as well as how to maintain data integrity. You will also learn about constraints and other unique features.

blur
icon

Total Duration

5 hours

icon

Level

Intermediate

icon

Learn Type

Certifications

Joining Data in SQL

Once you've completed the Introduction to SQL course and have a good understanding of the basics of SQL, it's time to improve your queries with joins or relational set theory. This course will teach you about the power and benefits of joining tables, as well as the unique characteristics of cities around the world. Learn inner and outer joins as well as self, semi, and anti joins. Cross joins are a vital tool in any PostgreSQL wizards' toolbox. After you've mastered intersections, exception clauses, and unions using simple diagrams and examples you won't have to worry about setting it up theory. You will also learn about subqueries. These ideas can also be visualized using Venn diagrams or other linking illustrations.

blur
icon

Total Duration

4 hours

icon

Level

Beginner

icon

Learn Type

Certifications

Introduction to Databases in Python

This course will cover the basics of Python and SQL. Because databases are everywhere, analysts, data scientists, engineers, and others need to interact with them constantly, this course will teach you the basics of SQL and Python. SQLAlchemy is a Python SQL toolkit SQLAlchemy that makes it easy query, build, and write to databases like SQLite and MySQL.

blur