10+ Best Articles on Reinforcement Learning
The most useful articles on reinforcement learning from around the web—beginners to advanced—curated by thought leaders and our community. We focus on timeless pieces and update the list whenever we discover new, must-read articles or videos—make sure to bookmark and revisit this page.
Top 5 Reinforcement Learning Articles
At a glance: these are the articles that have been most read, shared, and saved on reinforcement learning by Refind users in 2023 so far.
- Is DeepMind’s new reinforcement learning system a step toward general AI?
- Discovering faster matrix multiplication algorithms with reinforcement learning
- DeepMind scientists: Reinforcement learning is enough for general AI
- RLHF: Reinforcement Learning from Human Feedback
- What is reinforcement learning? How AI trains itself
What is ...?
New to #reinforcement learning? These articles make an excellent introduction.
What is reinforcement learning? How AI trains itself
Reinforcement learning is the subset of ML by which an algorithm can be programmed to respond to complex environments for optimal results.
Introduction to Various Reinforcement Learning Algorithms
This article was written by Steeve Huang. Reinforcement Learning (RL) refers to a kind of Machine Learning method in which the agent receives a delayed rewar…
Trending
These links are currently making the rounds on reinforcement learning on Refind.
RLHF: Reinforcement Learning from Human Feedback
A narrative that is often glossed over in the demo frenzy is about the incredible technical creativity that went into making models like ChatGPT work. One such cool idea is RLHF: incorporating…
Short Articles
Short on time? Check out these useful short articles on reinforcement learning—all under 10 minutes.
Is DeepMind’s new reinforcement learning system a step toward general AI?
DeepMind has released a new paper that shows impressive advances in reinforcement learning. How far does it bring us toward general AI?
«The key advantage of reinforcement learning is its ability to develop behavior by taking actions and getting feedback, similar to the way humans and animals learn by interacting with their environment»
DeepMind scientists: Reinforcement learning is enough for general AI
In a new paper, scientists at DeepMind suggest that reward maximization and reinforcement learning are enough to develop artificial general intelligence.
«intelligence and its associated abilities will emerge not from formulating and solving complicated problems but by sticking to a simple but powerful principle: reward maximization.»
Introducing Google Research Football: A Novel Reinforcement Learning Environment
Posted by Karol Kurach, Research Lead and Olivier Bachem, Research Scientist, Google Research, Zürich The goal of reinforcement learning...
Reinforcement Learning with Prediction-Based Rewards
We’ve developed Random Network Distillation (RND), a prediction-based method for encouraging reinforcement learning agents to explore their environments through curiosity, which for the first time exceeds average human performance on Montezuma’s Revenge.
Google releases open source reinforcement learning framework for training AI models
Google's releasing a reinforcement learning framework that makes it easier to train AI models with cutting-edge techniques.
Long Articles
These are some of the most-read long-form articles on reinforcement learning.
Discovering faster matrix multiplication algorithms with reinforcement learning
A reinforcement learning approach based on AlphaZero is used to discover efficient and provably correct algorithms for matrix multiplication, finding faster algorithms for a variety of matrix sizes.
Reinforcement Learning from scratch
Inspired by a great tutorial at O’Reilly AI
Reinforcement learning’s foundational flaw
By definition, learning from scratch is just about the least sample-efficient approach there can be.
Deep Reinforcement Learning Doesn't Work Yet
June 24, 2018 note: If you want to cite an example from the post, please cite the paper which that example came from. If you want to cite the post as a whole, you can use the following BibTeX:
Reinforcement Learning: A Deep Dive
This is a deep dive into deep reinforcement learning. We will tackle a concrete problem with modern libraries such as TensorFlow, TensorBoard, Keras, and OpenAI Gym. You will learn how to implement…
Publications
We monitor hundreds of publications, blogs, newsletters, and news sources in Reinforcement Learning, including:
Toptal
Toptal is a network of the world’s top talent in business, design, and technology that enables companies to scale their teams, on demand.
Data Science Central
Part of the DSC Community and TechTarget, our focus is on data science, ML, AI, deep learning, dataviz, Hadoop, IoT and BI.
VentureBeat
Obsessed with covering transformative technology.
Google AI
Google AI is focused on bringing the benefits of AI to everyone. In conducting and applying our research, we advance the state-of-the-art in many domains.
nature
Research, News, and Commentary from Nature, the international journal of science. For daily science news, get Nature Briefing: http://go.nature.com/naturebriefing
What is Refind?
Every day Refind picks the most relevant links from around the web for you. Picking only a handful of links means focusing on what’s relevant and useful. We favor timeless pieces—links with long shelf-lives, articles that are still relevant one month, one year, or even ten years from now. These lists of the best resources on any topic are the result of years of careful curation.
How does Refind curate?
It’s a mix of human and algorithmic curation, following a number of steps:
- We monitor 10k+ sources and 1k+ thought leaders on hundreds of topics—publications, blogs, news sites, newsletters, Substack, Medium, Twitter, etc.
- In addition, our users save links from around the web using our Save buttons and our extensions.
- Our algorithm processes 100k+ new links every day and uses external signals to find the most relevant ones, focusing on timeless pieces.
- Our community of active users gets the most relevant links every day, tailored to their interests. They provide feedback via implicit and explicit signals: open, read, listen, share, mark as read, read later, «More/less like this», etc.
- Our algorithm uses these internal signals to refine the selection.
- In addition, we have expert curators who manually curate niche topics.
The result: lists of the best and most useful articles on hundreds of topics.
How does Refind detect «timeless» pieces?
We focus on pieces with long shelf-lives—not news. We determine «timelessness» via a number of metrics, for example, the consumption pattern of links over time.
How many sources does Refind monitor?
We monitor 10k+ content sources on hundreds of topics—publications, blogs, news sites, newsletters, Substack, Medium, Twitter, etc.
Which sources does Refind monitor on reinforcement learning?
We monitor hundreds of sources on reinforcement learning, including Toptal, Data Science Central, VentureBeat, Google AI, nature, and many more.
Can I submit a link?
Indirectly, by using Refind and saving links from outside (e.g., via our extensions).
How can I report a problem?
When you’re logged-in, you can flag any link via the «More» (...) menu. You can also report problems via email to hello@refind.com
Who uses Refind?
200k+ smart people start their day with Refind. To learn something new. To get inspired. To move forward. Our apps have a 4.9/5 rating.
Is Refind free?
Yes, it’s free!
How can I sign up?
Head over to our homepage and sign up by email or with your Twitter or Google account.