Information Technology
Hands on Training icon
Hands On Training
Hands on Training icon

Web Scraping in Python

Course Cover

5

(3)

compare button icon

Course Features

icon

Duration

4 hours

icon

Delivery Method

Online

icon

Available on

Limited Access

icon

Accessibility

Mobile, Desktop, Laptop

icon

Language

English

icon

Subtitles

English

icon

Level

Intermediate

icon

Teaching Type

Self Paced

icon

Video Content

4 hours

Course Description

Data science has always recognized the importance of tools that allow for the retrieval and analysis of information stored on the Internet. This course will show you how to navigate HTML code and create tools that automatically crawl websites. Although we will be using the Python library scrapy to scrape, many of these techniques can be used with other Python libraries like BeautifulSoup and Selenium. This course will give you a solid understanding of the html structure and the tools needed to access it. It is also possible to create simple scrapy spiders which can crawl the web on a large scale.

Course Overview

projects-img

Virtual Labs

projects-img

International Faculty

projects-img

Post Course Interactions

projects-img

Hands-On Training,Instructor-Moderated Discussions

projects-img

Case Studies, Captstone Projects

Skills You Will Gain

Prerequisites/Requirements

Intermediate Python

What You Will Learn

Learn to retrieve and parse information from the internet using the Python library scrapy

Upon the completion of this course, you will have a strong mental model of html structure, will be able to build tools to parse html code and access desired information, and create a simple scrapy spiders to crawl the web at scale

Course Instructors

Author Image

Thomas Laetsch

Data Scientist at New York University

Since January 2016, Thomas Laetsch has been a Moore-Sloan Post-Doctoral Associate in the Center for Data Science at NYU. In 2012, he received his PhD in mathematics from the University of California,...

Course Reviews

Average Rating Based on 3 reviews

5.0

100%

Course Cover