102 saves · From 2018 · June 24, 2018 note: If you want to cite an example from the post, please cite the paper which that example came from. If you want to cite the post as a whole, you can use the following BibTeX:
2 min read · 25 saves · 2019-09-12 · Andrea Manero-Bastin on September 9, 2019 at 12:30am
View Blog · This article was written by Steeve Huang. Reinforcement Learning (RL) refers to a kind of Machine Learning method in which the agent receives a delayed rewar…
7 min read · 60 saves · From 2018 · We’ve developed Random Network Distillation (RND), a prediction-based method for encouraging reinforcement learning agents to explore their environments through curiosity, which for the first time exceeds average human performance on Montezuma’s Revenge.