This presentation was recorded at GOTOpia November 2020. #GOTOcon #GOTOpia http://gotopia.eu Phil Winder - Author of "Reinforcement Learning" & CEO of Winder Research ABSTRACT Reinforcement learning (RL) is lined up to become the hottest new artificial intelligence paradigm in the next few years. Building upon machine learning, reinforcement learning has the potential to automate strategic-level thinking in industry. In this presentation I present a code-driven introduction to RL, where you will explore a fundamental framework called the Markov decision process (MDP) and learn how to build an RL algorithm to solve it. First I show you how to create a simple “GridWorld” simulation of the MDP, from the ground up, to help demonstrate why and how RL works. Then I derive a simple RL algorithm that’s capable of solving your simulation. Finally I will provide actionable next steps to show you how to take this learning and apply it to industry. This presentation includes a Jupyter notebook that you can tinker with during the presentation. Full instructions will be provided. Although this presentation is suitable for beginners [...] TIMECODES 00:00 Intro 02:53 Agenda 03:08 What is Reinforcement Learning (RL)? 08:52 Coding the MDP 10:35 Coding the RL solution 29:50 Next steps 31:53 Outro Download slides and read the full abstract here: https://gotopia.eu/november-2020/sessions/1648/a-code-driven-introduction-to-reinforcement-learning RECOMMENDED BOOK Phil Winder • Reinforcement Learning • https://amzn.to/3t1S1VZ https://twitter.com/GOTOcon https://www.linkedin.com/company/goto- https://www.facebook.com/GOTOConferences #ReinforcementLearning #RL #AI #ML #ArtificialIntelligence #MachineLearning #DataScience #JupyterNotebook #MDP #RLAlgorithm #GridWorld Looking for a unique learning experience? Attend the next GOTO conference near you! Get your ticket at https://gotopia.tech SUBSCRIBE TO OUR CHANNEL - new videos posted almost daily. https://www.youtube.com/user/GotoConferences/?sub_confirmation=1
Get notified about new features and conference additions.