Sample-based Learning Methods

Master reinforcement learning algorithms through hands-on implementation of Monte Carlo and TD methods.

This course cannot be purchased separately - to access the complete learning experience, graded assignments, and earn certificates, you'll need to enroll in the full Reinforcement Learning Specialization program. You can audit this specific course for free to explore the content, which includes access to course materials and lectures. This allows you to learn at your own pace without any financial commitment.

4.8

(1,225 ratings)

33,124 already enrolled

Instructors:

Martha White

Adam White

English

پښتو, বাংলা, اردو, 3 more

This course includes

22 Hours

Of Self-paced video lessons

Intermediate Level

Completion Certificate

awarded on course completion

Free

Add to compare

What you'll learn

Implement Temporal-Difference learning and Monte Carlo methods

Understand exploration strategies in sampled experience

Apply TD algorithm for value function estimation

Master Expected Sarsa and Q-learning implementation

Develop model-based approaches using Dyna architecture

Skills you'll gain

Reinforcement Learning

Machine Learning

Temporal Difference Learning

Monte Carlo Methods

Q-Learning

Function Approximation

AI Systems

Algorithm Implementation

Python Programming

Model-Based Planning

This course includes:

3 Hours PreRecorded video

4 quizzes, 1 assignment

Access on Mobile, Tablet, Desktop

FullTime access

Shareable certificate

Closed caption

Get a Completion Certificate

Share your certificate with prospective employers and your professional network on LinkedIn.

Created by

University of Alberta

Provided by

Coursera

Top companies offer this course to their employees

Top companies provide this course to enhance their employees' skills, ensuring they excel in handling complex projects and drive organizational success.

There are 5 modules in this course

This advanced course focuses on sample-based learning methods in reinforcement learning. Students learn about Monte Carlo methods and temporal difference learning for estimating value functions from actual experience. The curriculum covers key algorithms including Q-learning, Expected Sarsa, and the Dyna architecture. Through hands-on programming assignments, students implement these methods to solve practical problems while understanding the balance between exploration and exploitation in learning processes.

Welcome to the Course!

Module 1 · 0 Hours to complete

Monte Carlo Methods for Prediction & Control

Module 2 · 3 Hours to complete

Temporal Difference Learning Methods for Prediction

Module 3 · 5 Hours to complete

Temporal Difference Learning Methods for Control

Module 4 · 5 Hours to complete

Planning, Learning & Acting

Module 5 · 7 Hours to complete

Fee Structure

Instructors

Martha White

4.7 rating

802 Reviews

99,327 Students

4 Courses

Innovator in Machine Learning and Reinforcement Learning

Martha White is an Assistant Professor in the Department of Computing Sciences at the University of Alberta, where she specializes in developing algorithms for agents that learn continuously from streams of data, focusing on representation learning and reinforcement learning. As a Principal Investigator at the Alberta Machine Intelligence Institute (AMII) and director of the Reinforcement Learning and Artificial Intelligence Lab (RLAI), she is at the forefront of research aimed at enhancing adaptive learning systems. Martha holds a Ph.D. in Computing Science from the University of Alberta and has published extensively on topics related to machine learning, including over 40 papers in top-tier journals and conferences. Her commitment to advancing knowledge in AI is complemented by her passion for mentoring emerging researchers and promoting diversity in computing. Outside of her academic pursuits, Martha enjoys soccer, outdoor activities, cooking, and reading science fiction, reflecting her diverse interests beyond the realm of technology.

Adam White

4.7 rating

802 Reviews

99,327 Students

4 Courses

Leader in Reinforcement Learning and Artificial Intelligence

Adam White is an Assistant Professor in the Department of Computing Sciences at the University of Alberta and a Senior Research Scientist at DeepMind. His research centers on artificial intelligence, specifically on replicating or simulating human-level intelligence in both physical and simulated agents. Adam's work explores how intelligence can be modeled through reinforcement learning agents that interact with unknown environments, learning from scalar reward signals rather than explicit feedback. He has taught courses on Reinforcement Learning and Artificial Intelligence at both the University of Alberta and Indiana University, contributing to the education of future leaders in AI. As a Principal Investigator of the Reinforcement Learning and Artificial Intelligence Lab (RLAI) and a fellow at the Alberta Machine Intelligence Institute (AMII), Adam is dedicated to advancing the field through innovative research and practical applications. His contributions include developing new algorithms for reinforcement learning, creating the RL-Glue framework for reinforcement learning experiments, and demonstrating learning capabilities in mobile robots. Outside of academia, Adam enjoys playing Gaelic football and exploring the natural world, reflecting his diverse interests beyond his professional pursuits.

This course includes

22 Hours

Of Self-paced video lessons

Intermediate Level

Completion Certificate

awarded on course completion

Free

Add to compare

Testimonials

Testimonials and success stories are a testament to the quality of this program and its impact on your career and learning journey. Be the first to help others make an informed decision by sharing your review of the course.

4.8 course rating

1,225 ratings

Frequently asked questions

Below are some of the most commonly asked questions about this course. We aim to provide clear and concise answers to help you better understand the course content, structure, and any other relevant information. If you have any additional questions or if your question is not listed here, please don't hesitate to reach out to our support team for further assistance.

When will I have access to the lectures and assignments?

What will I get if I subscribe to this Specialization?

What is the refund policy?

Is financial aid available?