Deep Reinforcement Learning in Python Tutorial – A Course on How to Implement Deep Learning Papers

///Deep Reinforcement Learning in Python Tutorial – A Course on How to Implement Deep Learning Papers

Deep Reinforcement Learning in Python Tutorial – A Course on How to Implement Deep Learning Papers

FavoriteLoadingAdd to favorites

In this intermediate deep learning tutorial, you will learn how to go from reading a paper on deep deterministic policy gradients to implementing the concepts in Tensorflow. This process can be applied to any deep learning paper, not just deep reinforcement learning.

In the second part, you will learn how to code a deep deterministic policy gradient (DDPG) agent using Python and PyTorch, to beat the continuous lunar lander environment (a classic machine learning problem).

DDPG combines the best of Deep Q Learning and Actor Critic Methods into an algorithm that can solve environments with continuous action spaces. We will have an actor network that learns the (deterministic) policy, coupled with a critic network to learn the action-value functions. We will make use of a replay buffer to maximize sample efficiency, as well as target networks to assist in algorithm convergence and stability.

🎥 Course created by Phil Tabor. Check out his YouTube channel:

⭐️ Course Contents ⭐️
⌨️ (0:00:00) Introduction
⌨️ (0:04:58) How to Implement Deep Learning Papers
⌨️ (1:59:00) Deep Deterministic Policy Gradients are Easy in Pytorch

Learn to code for free and get a developer job:

Read hundreds of articles on programming:

And subscribe for new videos on technology every day:

source

By |2020-04-24T03:13:01+00:00April 24th, 2020|Python Video Tutorials|24 Comments

24 Comments

  1. TH April 24, 2020 at 3:13 am - Reply

    Why not use an IDE to see typos before running it?

  2. Where did you get your brain?

  3. Hossein Haeri April 24, 2020 at 3:13 am - Reply

    At 43:01, you say: " i is each element of that minibatch transitions" which is wrong. i is just the index of the reply memory, i.e. state i+1 follows after state i.
    And thanks for your great explanation. helped me a lot.

  4. Debayon Dhar Chowdhury April 24, 2020 at 3:13 am - Reply

    Where is the code of this video?

  5. masoud masoumi moghaddam April 24, 2020 at 3:13 am - Reply

    This is actually a video tutorial with so much academic quality.
    I am really amazed by this video and ability to implement a paper in this pace.
    Would plz keep up your good job?
    Thanks bro.

  6. Fktu diablo April 24, 2020 at 3:13 am - Reply

    2:45:50 Michael Jackson still alive guys

  7. Liu daniel April 24, 2020 at 3:13 am - Reply

    Hi, i have run the code, but it did not converge at all. So I waana to know your hyperparameter's setting. Thanks a lot =-=

  8. Gabriel Sinca April 24, 2020 at 3:13 am - Reply

    Python is bloat.

  9. vandan gorade April 24, 2020 at 3:13 am - Reply

    Please make more videos on implementing research papers on your channel😃

  10. Hejar Shahabi April 24, 2020 at 3:13 am - Reply

    I really like to learn python, and I have a question what is this video about? cause I didn't get anything

  11. Mort Kebab April 24, 2020 at 3:13 am - Reply

    Excellent camera quality – the only problem is that the programmed color changes of the PC lights cause the color adjustment of the camera to change.

  12. Omar ElKhatib April 24, 2020 at 3:13 am - Reply

    thanks please more like those read scientific papers .

  13. Boris the Blade April 24, 2020 at 3:13 am - Reply

    Phil youre a fucking legend

  14. Dev Isle April 24, 2020 at 3:13 am - Reply

    Awesome! Looking to learn more and post on my channel.

  15. Gunesh Shanbhag April 24, 2020 at 3:13 am - Reply

    Its very advanced for me i guess (still watched watched for 20mins) … Hope to get some advice from phil for beginners… To really reach to a level of implementing papers….any advice on learning road path would be helpful. Have subscribed to your channel also.Thanks Phil. 🙂

  16. Henry AI Labs April 24, 2020 at 3:13 am - Reply

    Thank you! This is an incredible Reinforcement Learning tutorial!

  17. No Reason Channel April 24, 2020 at 3:13 am - Reply

    Reminded of Sherlock's assistant.

  18. Deepak S.m. April 24, 2020 at 3:13 am - Reply

    Thank u for this video 😀

    Are u a brother of Bucky Roberts (thenewboston) ?

  19. It Works Channel April 24, 2020 at 3:13 am - Reply

    Bunch of thanks 🙏

  20. Alvin Kuruvilla April 24, 2020 at 3:13 am - Reply

    This looks awesome
    But I have a question, do you think this could be applied to website fingerprinting, where an ai is able to figure out the port and server information as you visit a website to make sure it's secure?

  21. Hüseyin Yıldız April 24, 2020 at 3:13 am - Reply

    Thanks soo much you Bro 😉 New Magicians in Future They are Coders,Programmers,Engineers and All Designers.

  22. interesting..!!

  23. xo xo April 24, 2020 at 3:13 am - Reply

    Awesome! This is what I was waiting for. Super thank you!
    ( ö )/

  24. Arunarka Mukhopadhyay April 24, 2020 at 3:13 am - Reply

    Thank you!!

Leave A Comment

*