A Complete Reinforcement Learning System (Capstone)

开始时间: 04/22/2022 持续时间: Unknown

所在平台: CourseraArchive

课程类别: 其他类别

大学或机构: CourseraNew

课程主页: https://www.coursera.org/archive/complete-reinforcement-learning-system

课程评论：没有评论

第一个写评论关注课程

课程详情

In this final course, you will put together your knowledge from Courses 1, 2 and 3 to implement a complete RL solution to a problem. This capstone will let you see how each component---problem formulation, algorithm selection, parameter selection and representation design---fits together into a complete solution, and how to make appropriate choices when deploying RL in the real world. This project will require you to implement both the environment to stimulate your problem, and a control agent with Neural Network function approximation. In addition, you will conduct a scientific study of your learning system to develop your ability to assess the robustness of RL agents. To use RL in the real world, it is critical to (a) appropriately formalize the problem as an MDP, (b) select appropriate algorithms, (c ) identify what choices in your implementation will have large impacts on performance and (d) validate the expected behaviour of your algorithms. This capstone is valuable for anyone who is planning on using RL to solve real problems. To be successful in this course, you will need to have completed Courses 1, 2, and 3 of this Specialization or the equivalent. By the end of this course, you will be able to: Complete an RL solution to a problem, starting from problem formulation, appropriate algorithm selection and implementation and empirical study into the effectiveness of the solution.

完整的强化学习系统（Capstone）：在这最后一门课程中，您将汇总在课程1、2和3中的知识，以实施针对问题的完整RL解决方案。结束语将使您了解如何将每个组件-问题公式，算法选择，参数选择和表示设计-组合成一个完整的解决方案，以及在实际环境中部署RL时如何做出适当的选择。该项目将要求您既实现刺激问题的环境，又实现具有神经网络功能近似的控制代理。此外，您将对您的学习系统进行科学研究，以增强评估RL代理的健壮性的能力。要在现实世界中使用RL，至关重要的是（a）将问题适当地形式化为MDP，（b）选择适当的算法，（c）确定实现中的哪些选择会对性能产生重大影响，并且（d）验证算法的预期行为。对于计划使用RL解决实际问题的任何人来说，这一成功都是很有价值的。为了成功完成本课程，您将需要完成本专业的课程1、2和3或同等课程。在本课程结束时，您将能够：从问题表述，适当的算法选择和实现以及对解决方案有效性的实证研究开始，完成对问题的RL解决方案。

课程大纲

Milestone 1: Formalize Word Problem as MDP
Milestone 2: Choosing The Right Algorithm
Milestone 3: Identify Key Performance Parameters
Milestone 4: Implement Your Agent