Get Bonus Downloads Here.url
0.18 KB
~Get Your Files Here !
01 - Introduction
001 Introduction.html
0.07 KB
002 Reinforcement Learning series.html
0.68 KB
003 Google Colab.mp4
5.77 MB
003 Google Colab_en.vtt
1.75 KB
004 Where to begin.html
0.07 KB
02 - Refresher The Markov Decision Process (MDP)
001 Elements common to all control tasks.mp4
38.74 MB
001 Elements common to all control tasks_en.vtt
5.98 KB
002 The Markov decision process (MDP).mp4
25.10 MB
002 The Markov decision process (MDP)_en.vtt
5.62 KB
003 Types of Markov decision process.mp4
8.68 MB
003 Types of Markov decision process_en.vtt
2.16 KB
004 Trajectory vs episode.mp4
4.94 MB
004 Trajectory vs episode_en.vtt
1.09 KB
005 Reward vs Return.mp4
5.29 MB
005 Reward vs Return_en.vtt
1.60 KB
006 Discount factor.mp4
14.77 MB
006 Discount factor_en.vtt
4.07 KB
007 Policy.mp4
7.41 MB
007 Policy_en.vtt
2.08 KB
008 State values v(s) and action values q(s,a).mp4
4.28 MB
008 State values v(s) and action values q(s,a)_en.vtt
1.17 KB
009 Bellman equations.mp4
12.41 MB
009 Bellman equations_en.vtt
2.99 KB
010 Solving a Markov decision process.mp4
14.14 MB
010 Solving a Markov decision process_en.vtt
3.18 KB
03 - Refresher Monte Carlo methods
001 Monte Carlo methods.mp4
13.73 MB
001 Monte Carlo methods_en.vtt
3.34 KB
002 Solving control tasks with Monte Carlo methods.mp4
23.79 MB
002 Solving control tasks with Monte Carlo methods_en.vtt
7.04 KB
003 On-policy Monte Carlo control.mp4
20.44 MB
003 On-policy Monte Carlo control_en.vtt
4.56 KB
04 - Refresher Temporal difference methods
001 Temporal difference methods.mp4
12.62 MB
001 Temporal difference methods_en.vtt
3.59 KB
002 Solving control tasks with temporal difference methods.mp4
14.52 MB
002 Solving control tasks with temporal difference methods_en.vtt
3.61 KB
003 Monte Carlo vs temporal difference methods.mp4
8.87 MB
003 Monte Carlo vs temporal difference methods_en.vtt
1.60 KB
004 SARSA.mp4
17.77 MB
004 SARSA_en.vtt
3.90 KB
005 Q-Learning.mp4
11.08 MB
005 Q-Learning_en.vtt
2.53 KB
006 Advantages of temporal difference methods.mp4
3.71 MB
006 Advantages of temporal difference methods_en.vtt
1.15 KB
05 - Refresher N-step bootstrapping
001 N-step temporal difference methods.mp4
12.51 MB
001 N-step temporal difference methods_en.vtt
3.36 KB
002 Where do n-step methods fit.mp4
11.15 MB
002 Where do n-step methods fit_en.vtt
2.65 KB
003 Effect of changing n.mp4
28.01 MB
003 Effect of changing n_en.vtt
4.64 KB
06 - Refresher Brief introduction to Neural Networks
001 Function approximators.mp4
36.32 MB
001 Function approximators_en.vtt
8.57 KB
002 Artificial Neural Networks.mp4
24.35 MB
002 Artificial Neural Networks_en.vtt
3.88 KB
003 Artificial Neurons.mp4
25.64 MB
003 Artificial Neurons_en.vtt
5.82 KB
004 How to represent a Neural Network.mp4
38.16 MB
004 How to represent a Neural Network_en.vtt
7.27 KB
005 Stochastic Gradient Descent.mp4
49.84 MB
005 Stochastic Gradient Descent_en.vtt
6.40 KB
006 Neural Network optimization.mp4
23.39 MB
006 Neural Network optimization_en.vtt
4.40 KB
07 - Refresher REINFORCE
001 Policy gradient methods.mp4
21.65 MB
001 Policy gradient methods_en.vtt
4.74 KB
002 Representing policies using neural networks.mp4
27.76 MB
002 Representing policies using neural networks_en.vtt
5.19 KB
003 Policy performance.mp4
8.52 MB
003 Policy performance_en.vtt
2.57 KB
004 The policy gradient theorem.mp4
15.88 MB
004 The policy gradient theorem_en.vtt
3.84 KB
005 REINFORCE.mp4
13.24 MB
005 REINFORCE_en.vtt
4.15 KB
006 Parallel learning.mp4
12.34 MB
006 Parallel learning_en.vtt
3.57 KB
007 Entropy regularization.mp4
23.15 MB
007 Entropy regularization_en.vtt
6.63 KB
008 REINFORCE 2.mp4
10.89 MB
008 REINFORCE 2_en.vtt
2.36 KB
08 - PyTorch Lightning
001 PyTorch Lightning.mp4
32.01 MB
001 PyTorch Lightning_en.vtt
9.27 KB
002 Link to the code notebook.html
0.07 KB
09 - REINFORCE for continuous control tasks
001 REINFORCE for continuous action spaces.html
0.07 KB
10 - Advantage Actor Critic (A2C)
001 A2C.mp4
50.09 MB
001 A2C_en.vtt
10.59 KB
11 - Generalized Advantage Estimation (GAE)
001 Generalized Advantage Estimation.html
0.07 KB
12 - Proximal Policy Optimization (PPO)
001 Proximal Policy Optimization.html
0.07 KB
13 - Phasic PPO
001 Phasic PPO.html
0.07 KB
Bonus Resources.txt
0.38 KB
Feel free to post any comments about this torrent, including links to Subtitle, samples, screenshots, or any other relevant information, Watch [ FreeCourseWeb com ] Udemy - Advanced Reinforcement Learning - policy gradient methods Online Free Full Movies Like 123Movies, Putlockers, Fmovies, Netflix or Download Direct via Magnet Link in Torrent Details.