APP下载
反馈
加州大学伯克利分校 2017:深度增强学习课程
本课程共57集 翻译完 欢迎学习

课程介绍:https://www.youtube.com/playlist?list=PLkFD6_40KJIwTmSbCv9OVJB3YaO4sFwkX CS294-112 Deep Reinforcement Learning Sp17 课程主页:http://rll.berkeley.edu/deeprlcourse/

立即播放
用手机看
课程免费缓存,随时观看~
扫码下载网易公开课APP
收藏
课程列表
【第13集】Learning policies by imitating optimal controllers (Levine)(上)
【第14集】Learning policies by imitating optimal controllers (Levine)(中)
【第15集】Learning policies by imitating optimal controllers (Levine)(下)
【第16集】RL definitions, value iteration, policy iteration (Schulman)(上)
【第17集】RL definitions, value iteration, policy iteration (Schulman)(中)
【第18集】RL definitions, value iteration, policy iteration (Schulman)(下)
【第22集】Learning Q-functions: Q-learning, SARSA, and others (Schulman)(上)
【第23集】Learning Q-functions: Q-learning, SARSA, and others (Schulman)(中)
【第24集】Learning Q-functions: Q-learning, SARSA, and others (Schulman)(下)
【第25集】Advanced Q-learning: replay buffers, target networks, double Q-learning (Sc(上)
【第26集】Advanced Q-learning: replay buffers, target networks, double Q-learning (Sc(中)
【第27集】Advanced Q-learning: replay buffers, target networks, double Q-learning (Sc(下)
【第31集】Inverse RL: acquiring objectives from demonstration (Finn)(上)
【第32集】Inverse RL: acquiring objectives from demonstration (Finn)(中)
【第33集】Inverse RL: acquiring objectives from demonstration (Finn)(下)
【第34集】Advanced policy gradients: natural gradient and TRPO (Schulman)(上)
【第35集】Advanced policy gradients: natural gradient and TRPO (Schulman)(中)
【第36集】Advanced policy gradients: natural gradient and TRPO (Schulman)(下)
【第37集】Policy gradient variance reduction and actor-critic algorithms (Schulman)(上)
【第38集】Policy gradient variance reduction and actor-critic algorithms (Schulman)(中)
【第39集】Policy gradient variance reduction and actor-critic algorithms (Schulman)(下)
【第40集】Summary of policy gradients and temporal difference methods (Schulman)(上)
【第41集】Summary of policy gradients and temporal difference methods (Schulman)(中)
【第42集】Summary of policy gradients and temporal difference methods (Schulman)(下)
【第46集】Parallel RL algorithms, open problems and challenges in deep reinforcement(上)
【第47集】Parallel RL algorithms, open problems and challenges in deep reinforcement(中)
【第48集】Parallel RL algorithms, open problems and challenges in deep reinforcement(下)
【第52集】Neural Architecture Search with Reinforcement Learning: Quoc Le and Barret Z(上)
【第53集】Neural Architecture Search with Reinforcement Learning: Quoc Le and Barret Z(中)
【第54集】Neural Architecture Search with Reinforcement Learning: Quoc Le and Barret Z(下)
【第55集】Generalization and Safety in Reinforcement Learning and Control: Aviv Tamar(上)
【第56集】Generalization and Safety in Reinforcement Learning and Control: Aviv Tamar(中)
【第57集】Generalization and Safety in Reinforcement Learning and Control: Aviv Tamar(下)
查看全部课程
相关推荐
12:41
1-严重暴力犯罪心理解析(上)
3.6万播放
13:56
侵权责任法 第二十讲(上)
1488播放
03:31
脑洞!数以千计的行星和太阳大战巨型...
821播放
08:14
第24节_元素定位-xpath
1140播放
03:55
什么是摄影里的糖水片 它有什么优缺...
1404播放
01:45
这三句话,教你夸对孩子 !
704播放
00:11
动作名字:华尔兹大环舞
1452播放
10:34
内力公式在叠块模型中的应用2(下)
805播放
00:40
血压高头昏脑胀,菊花茶里加两物
1482播放
02:30
清洁工将十条毒蛇,放进富豪的游泳池
938播放
08:32
0.1 质点运动学(上)
1018播放
09:12
08 【判断推理】第8讲_Bili...
664播放
30:29
2018法考-刑法119必背01-...
5615播放