Hands On Training

Playing TicTacToe with Reinforcement Learning and OpenAI Gym

via

Cognitive Class

Course Features

Duration

45 minutes

Delivery Method

Online

Available on

Limited Access

Accessibility

Mobile, Desktop, Laptop

Language

English

Subtitles

English

Level

Intermediate

Teaching Type

Self Paced

Video Content

45 minutes

Course Description

Learn to design and train an agent that is never lost when playing TicTacToe by using an Reinforcement Learning algorithm known as Temporal Difference Learning and Open AI GymReinforcement Learning is a distinct type of machine learning that is different from unsupervised and supervised learning. It's learning how to perform a task by trial and trial and. It is also an effective method to teach robots how to engage in games. The OpenAI Gym is a Python library that will standardize the interaction between the Agent/User/Robot with the environment to allow interaction with a range different Gym environments. Reinforcement Learning along with OpenAI Gym is a great combination that can be utilized to develop games that incorporate the AI aspect.In this Guided Project, you will be taught ways to work with the OpenAI Gym environment. We will work with an environment that was designed specifically to play TicTacToe and you will be taught how to create customized environments. We will also learn about reinforcement learning and an algorithm called Temporal Difference Learning, and how to create an Agent that uses Temporal Difference Learning to play TicTacToe. We will also play TicTacToe using our specially trained agent and our environment. We will also look at an illustration of an TicTacToe game that uses a graphic user interface.

Course Overview

International Faculty

Post Course Interactions

Instructor-Moderated Discussions

Case Studies, Captstone Projects

Skills You Will Gain

Prerequisites/Requirements

Just a web browser and Python programming knowledge are required

What You Will Learn

You'll be able to create an agent that uses Temporal Difference Learning to play TicTacToe

You'll be able to explain what Reinforcement Learning is

You'll be able to explain what Temporal Difference Learning is

You'll be able to play some games against the trained agent

You'll be able to train and Test the agents using the TicTacToe environment

You'll be able to work with an OpenAI Gym environment and the TicTacToe environment

You'll be able to install a custom OpenAI Gym environment

Course Content

Expand all sections

Module 1: Install a custom OpenAI Gym environment

Module 2: OpenAI Gym environment and the TicTacToe environment

Module 3: What Reinforcement Learning is

Module 4: What Temporal Difference Learning is

Course Instructors

Azim Hirjani

Cognitive Data Scientist at IBM

Azim Hirjani is a Data Scientist Intern at IBM and is pursuing a BS in Computer Science from the University of Toronto. He creates content for various IBM Data Science courses on platforms such as Co...