Other
[FreeCoursesOnline Me] Coursera - Practical Reinforcement Learning
Torrent info
Name:[FreeCoursesOnline Me] Coursera - Practical Reinforcement Learning
Infohash: 31B47A1285DF93A33F1C80A563FD43B322FC434D
Total Size: 1.41 GB
Magnet: Magnet Download
Seeds: 0
Leechers: 0
Stream: Watch Full Movies @ LimeMovies
Last Updated: 2026-01-19 16:47:10 (Update Now)
Torrent added: 2018-09-28 10:34:46
Torrent Files List
001.Welcome (Size: 1.41 GB) (Files: 111)
001.Welcome
001. Why should you care.mp4
001. Why should you care.srt
002. Reinforcement learning vs all.mp4
002. Reinforcement learning vs all.srt
002.Reinforcement Learning
003. Multi-armed bandit.mp4
003. Multi-armed bandit.srt
004. Decision process & applications.mp4
004. Decision process & applications.srt
003.Black box optimization
005. Markov Decision Process.mp4
005. Markov Decision Process.srt
006. Crossentropy method.mp4
006. Crossentropy method.srt
007. Approximate crossentropy method.mp4
007. Approximate crossentropy method.srt
008. More on approximate crossentropy method.mp4
008. More on approximate crossentropy method.srt
004.All the cool stuff that isn't in the base track
009. Evolution strategies core idea.mp4
009. Evolution strategies core idea.srt
010. Evolution strategies math problems.mp4
010. Evolution strategies math problems.srt
011. Evolution strategies log-derivative trick.mp4
011. Evolution strategies log-derivative trick.srt
012. Evolution strategies duct tape.mp4
012. Evolution strategies duct tape.srt
013. Blackbox optimization drawbacks.mp4
013. Blackbox optimization drawbacks.srt
005.Striving for reward
014. Reward design.mp4
014. Reward design.srt
006.Bellman equations
015. State and Action Value Functions.mp4
015. State and Action Value Functions.srt
016. Measuring Policy Optimality.mp4
016. Measuring Policy Optimality.srt
007.Generalized Policy Iteration
017. Policy evaluation & improvement.mp4
017. Policy evaluation & improvement.srt
018. Policy and value iteration.mp4
018. Policy and value iteration.srt
008.Model-free learning
019. Model-based vs model-free.mp4
019. Model-based vs model-free.srt
020. Monte-Carlo & Temporal Difference; Q-learning.mp4
020. Monte-Carlo & Temporal Difference; Q-learning.srt
021. Exploration vs Exploitation.mp4
021. Exploration vs Exploitation.srt
022. Footnote Monte-Carlo vs Temporal Difference.mp4
022. Footnote Monte-Carlo vs Temporal Difference.srt
009.On-policy vs off-policy
023. Accounting for exploration. Expected Value SARSA..mp4
023. Accounting for exploration. Expected Value SARSA..srt
010.Experience Replay
024. On-policy vs off-policy; Experience replay.mp4
024. On-policy vs off-policy; Experience replay.srt
011.Limitations of Tabular Methods
025. Supervised & Reinforcement Learning.mp4
025. Supervised & Reinforcement Learning.srt
026. Loss functions in value based RL.mp4
026. Loss functions in value based RL.srt
027. Difficulties with Approximate Methods.mp4
027. Difficulties with Approximate Methods.srt
012.Case Study Deep Q-Network
028. DQN bird's eye view.mp4
028. DQN bird's eye view.srt
029. DQN the internals.mp4
029. DQN the internals.srt
013.Honor
030. DQN statistical issues.mp4
030. DQN statistical issues.srt
031. Double Q-learning.mp4
031. Double Q-learning.srt
032. More DQN tricks.mp4
032. More DQN tricks.srt
033. Partial observability.mp4
033. Partial observability.srt
014.Policy-based RL vs Value-based RL
034. Intuition.mp4
034. Intuition.srt
035. All Kinds of Policies.mp4
035. All Kinds of Policies.srt
036. Policy gradient formalism.mp4
036. Policy gradient formalism.srt
037. The log-derivative trick.mp4
037. The log-derivative trick.srt
015.REINFORCE
038. REINFORCE.mp4
038. REINFORCE.srt
016.Actor-critic
039. Advantage actor-critic.mp4
039. Advantage actor-critic.srt
040. Duct tape zone.mp4
040. Duct tape zone.srt
041. Policy-based vs Value-based.mp4
041. Policy-based vs Value-based.srt
042. Case study A3C.mp4
042. Case study A3C.srt
043. A3C case study (2 2).mp4
043. A3C case study (2 2).srt
044. Combining supervised & reinforcement learning.mp4
044. Combining supervised & reinforcement learning.srt
017.Measuting exploration
045. Recap bandits.mp4
045. Recap bandits.srt
046. Regret measuring the quality of exploration.mp4
046. Regret measuring the quality of exploration.srt
047. The message just repeats. 'Regret, Regret, Regret.'.mp4
047. The message just repeats. 'Regret, Regret, Regret.'.srt
018.Uncertainty-based exploration
048. Intuitive explanation.mp4
048. Intuitive explanation.srt
049. Thompson Sampling.mp4
049. Thompson Sampling.srt
050. Optimism in face of uncertainty.mp4
050. Optimism in face of uncertainty.srt
051. UCB-1.mp4
051. UCB-1.srt
052. Bayesian UCB.mp4
052. Bayesian UCB.srt
019.Planning with Monte Carlo Tree Search
053. Introduction to planning.mp4
053. Introduction to planning.srt
054. Monte Carlo Tree Search.mp4
054. Monte Carlo Tree Search.srt
[FreeCoursesOnline.Me].url
[FreeTutorials.Us].url
[FTU Forum].url
tracker
leech seedsTorrent description
Feel free to post any comments about this torrent, including links to Subtitle, samples, screenshots, or any other relevant information, Watch [FreeCoursesOnline Me] Coursera - Practical Reinforcement Learning Online Free Full Movies Like 123Movies, Putlockers, Fmovies, Netflix or Download Direct via Magnet Link in Torrent Details.
related torrents
Torrent name
health leech seeds Size








