Playing TicTacToe with Reinforcement Learning and OpenAI Gym

Course Cover
compare button icon

Course Features

icon

Duration

45 minutes

icon

Delivery Method

Online

icon

Available on

Limited Access

icon

Accessibility

Mobile, Desktop, Laptop

icon

Language

English

icon

Subtitles

English

icon

Level

Intermediate

icon

Teaching Type

Self Paced

icon

Video Content

45 minutes

Course Description

Learn to design and train an agent that is never lost when playing TicTacToe by using an Reinforcement Learning algorithm known as Temporal Difference Learning and Open AI GymReinforcement Learning is a distinct type of machine learning that is different from unsupervised and supervised learning. It's learning how to perform a task by trial and trial and. It is also an effective method to teach robots how to engage in games. The OpenAI Gym is a Python library that will standardize the interaction between the Agent/User/Robot with the environment to allow interaction with a range different Gym environments. Reinforcement Learning along with OpenAI Gym is a great combination that can be utilized to develop games that incorporate the AI aspect.In this Guided Project, you will be taught ways to work with the OpenAI Gym environment. We will work with an environment that was designed specifically to play TicTacToe and you will be taught how to create customized environments. We will also learn about reinforcement learning and an algorithm called Temporal Difference Learning, and how to create an Agent that uses Temporal Difference Learning to play TicTacToe. We will also play TicTacToe using our specially trained agent and our environment. We will also look at an illustration of an TicTacToe game that uses a graphic user interface.

Course Overview

projects-img

International Faculty

projects-img

Post Course Interactions

projects-img

Instructor-Moderated Discussions

projects-img

Case Studies, Captstone Projects

Skills You Will Gain

Prerequisites/Requirements

Just a web browser and Python programming knowledge are required

What You Will Learn

You'll be able to create an agent that uses Temporal Difference Learning to play TicTacToe

You'll be able to explain what Reinforcement Learning is

You'll be able to explain what Temporal Difference Learning is

You'll be able to play some games against the trained agent

You'll be able to train and Test the agents using the TicTacToe environment

You'll be able to work with an OpenAI Gym environment and the TicTacToe environment

You'll be able to install a custom OpenAI Gym environment

Course Instructors

Author Image

Azim Hirjani

Cognitive Data Scientist at IBM

Azim Hirjani is a Data Scientist Intern at IBM and is pursuing a BS in Computer Science from the University of Toronto. He creates content for various IBM Data Science courses on platforms such as Co...
Course Cover