Master reinforcement learning algorithms through hands-on implementation of Monte Carlo and TD methods.
Master reinforcement learning algorithms through hands-on implementation of Monte Carlo and TD methods.
This course cannot be purchased separately - to access the complete learning experience, graded assignments, and earn certificates, you'll need to enroll in the full Reinforcement Learning Specialization program. You can audit this specific course for free to explore the content, which includes access to course materials and lectures. This allows you to learn at your own pace without any financial commitment.
4.8
(1,225 ratings)
33,124 already enrolled
Instructors:
Martha White
Adam White
English
پښتو, বাংলা, اردو, 3 more
What you'll learn
Implement Temporal-Difference learning and Monte Carlo methods
Understand exploration strategies in sampled experience
Apply TD algorithm for value function estimation
Master Expected Sarsa and Q-learning implementation
Develop model-based approaches using Dyna architecture
Skills you'll gain
This course includes:
3 Hours PreRecorded video
4 quizzes, 1 assignment
Access on Mobile, Tablet, Desktop
FullTime access
Shareable certificate
Closed caption
Get a Completion Certificate
Share your certificate with prospective employers and your professional network on LinkedIn.
Created by
Provided by
Top companies offer this course to their employees
Top companies provide this course to enhance their employees' skills, ensuring they excel in handling complex projects and drive organizational success.
There are 5 modules in this course
This advanced course focuses on sample-based learning methods in reinforcement learning. Students learn about Monte Carlo methods and temporal difference learning for estimating value functions from actual experience. The curriculum covers key algorithms including Q-learning, Expected Sarsa, and the Dyna architecture. Through hands-on programming assignments, students implement these methods to solve practical problems while understanding the balance between exploration and exploitation in learning processes.
Welcome to the Course!
Module 1 · 0 Hours to complete
Monte Carlo Methods for Prediction & Control
Module 2 · 3 Hours to complete
Temporal Difference Learning Methods for Prediction
Module 3 · 5 Hours to complete
Temporal Difference Learning Methods for Control
Module 4 · 5 Hours to complete
Planning, Learning & Acting
Module 5 · 7 Hours to complete
Fee Structure
Instructors
Martha White
4.7 rating
802 Reviews
99,327 Students
4 Courses
Innovator in Machine Learning and Reinforcement Learning
Martha White is an Assistant Professor in the Department of Computing Sciences at the University of Alberta, where she specializes in developing algorithms for agents that learn continuously from streams of data, focusing on representation learning and reinforcement learning. As a Principal Investigator at the Alberta Machine Intelligence Institute (AMII) and director of the Reinforcement Learning and Artificial Intelligence Lab (RLAI), she is at the forefront of research aimed at enhancing adaptive learning systems. Martha holds a Ph.D. in Computing Science from the University of Alberta and has published extensively on topics related to machine learning, including over 40 papers in top-tier journals and conferences. Her commitment to advancing knowledge in AI is complemented by her passion for mentoring emerging researchers and promoting diversity in computing. Outside of her academic pursuits, Martha enjoys soccer, outdoor activities, cooking, and reading science fiction, reflecting her diverse interests beyond the realm of technology.
Adam White
4.7 rating
802 Reviews
99,327 Students
4 Courses
Leader in Reinforcement Learning and Artificial Intelligence
Adam White is an Assistant Professor in the Department of Computing Sciences at the University of Alberta and a Senior Research Scientist at DeepMind. His research centers on artificial intelligence, specifically on replicating or simulating human-level intelligence in both physical and simulated agents. Adam's work explores how intelligence can be modeled through reinforcement learning agents that interact with unknown environments, learning from scalar reward signals rather than explicit feedback. He has taught courses on Reinforcement Learning and Artificial Intelligence at both the University of Alberta and Indiana University, contributing to the education of future leaders in AI. As a Principal Investigator of the Reinforcement Learning and Artificial Intelligence Lab (RLAI) and a fellow at the Alberta Machine Intelligence Institute (AMII), Adam is dedicated to advancing the field through innovative research and practical applications. His contributions include developing new algorithms for reinforcement learning, creating the RL-Glue framework for reinforcement learning experiments, and demonstrating learning capabilities in mobile robots. Outside of academia, Adam enjoys playing Gaelic football and exploring the natural world, reflecting his diverse interests beyond his professional pursuits.
Testimonials
Testimonials and success stories are a testament to the quality of this program and its impact on your career and learning journey. Be the first to help others make an informed decision by sharing your review of the course.
4.8 course rating
1,225 ratings
Frequently asked questions
Below are some of the most commonly asked questions about this course. We aim to provide clear and concise answers to help you better understand the course content, structure, and any other relevant information. If you have any additional questions or if your question is not listed here, please don't hesitate to reach out to our support team for further assistance.