How does Refind detect «timeless» pieces?

We focus on pieces with long shelf-lives—not news. We determine «timelessness» via a number of metrics, for example, the consumption pattern of links over time.

Head over to <a href='https://refind.com'>our homepage</a> and sign up by email or with your Twitter or Google account.

10+ Best Articles on Reinforcement Learning

Q: How does Refind curate?

It&rsquo;s a mix of human and algorithmic curation, following a number of steps:<ol><li>We monitor 10k+ sources and 1k+ thought leaders on hundreds of topics&mdash;publications, blogs, news sites, newsletters, Substack, Medium, Twitter, etc.</li><li>In addition, our users save links from around the web using our Save buttons and our extensions.</li><li>Our algorithm processes 100k+ new links every day and uses external signals to find the most relevant ones, focusing on timeless pieces.</li><li>Our community of active users gets the most relevant links every day, tailored to their interests. They provide feedback via implicit and explicit signals: open, read, listen, share, mark as read, read later, &laquo;More/less like this&raquo;, etc.</li><li>Our algorithm uses these internal signals to refine the selection.</li><li>In addition, we have expert curators who manually curate niche topics.</li></ol>The result: lists of the best and most useful articles on hundreds of topics.

Q: How does Refind detect &laquo;timeless&raquo; pieces?

We focus on pieces with long shelf-lives&mdash;not news. We determine &laquo;timelessness&raquo; via a number of metrics, for example, the consumption pattern of links over time.

Q: How many sources does Refind monitor?

We monitor 10k+ content sources on hundreds of topics&mdash;publications, blogs, news sites, newsletters, Substack, Medium, Twitter, etc.

Q: How can I report a problem?

When you&rsquo;re logged-in, you can flag any link via the &laquo;More&raquo; (...) menu. You can also report problems via email to hello@refind.com

The most useful articles on reinforcement learning from around the web, curated by thought leaders and our community.

Refind focuses on timeless pieces and updates the list whenever new, must-read articles or videos are discovered.

On this page

Top 5
What is ...?
Short
Long
Related Topics
What is Refind?
Keep Learning

What is ...?

New to #reinforcement learning? These articles make an excellent introduction.

What is reinforcement learning? How AI trains itself

VentureBeat

7 min

Reinforcement learning is the subset of ML by which an algorithm can be programmed to respond to complex environments for optimal results.

Introduction to Various Reinforcement Learning Algorithms

Data Science Central

2 min

This article was written by Steeve Huang. Reinforcement Learning (RL) refers to a kind of Machine Learning method in which the agent receives a delayed rewar…

Short Articles

Short on time? Check out these useful short articles on reinforcement learning—all under 10 minutes.

Is DeepMind’s new reinforcement learning system a step toward general AI?

bdtechtalks.com

8 min

DeepMind has released a new paper that shows impressive advances in reinforcement learning. How far does it bring us toward general AI?

«The key advantage of reinforcement learning is its ability to develop behavior by taking actions and getting feedback, similar to the way humans and animals learn by interacting with their environment»

DeepMind scientists: Reinforcement learning is enough for general AI

bdtechtalks.com

8 min

In a new paper, scientists at DeepMind suggest that reward maximization and reinforcement learning are enough to develop artificial general intelligence.

«intelligence and its associated abilities will emerge not from formulating and solving complicated problems but by sticking to a simple but powerful principle: reward maximization.»

Introducing Google Research Football: A Novel Reinforcement Learning Environment

Google AI

4 min

Posted by Karol Kurach, Research Lead and Olivier Bachem, Research Scientist, Google Research, Zürich The goal of reinforcement learning...

Reinforcement Learning with Prediction-Based Rewards

OpenAI

8 min

We’ve developed Random Network Distillation (RND), a prediction-based method for encouraging reinforcement learning agents to explore their environments through curiosity, which for the first time exceeds average human performance on Montezuma’s Revenge.

Google releases open source reinforcement learning framework for training AI models

VentureBeat

2 min

Google's releasing a reinforcement learning framework that makes it easier to train AI models with cutting-edge techniques.

Long Articles

These are some of the most-read long-form articles on reinforcement learning.

Discovering faster matrix multiplication algorithms with reinforcement learning

nature

20+ min

Paywall possible

A reinforcement learning approach based on AlphaZero is used to discover efficient and provably correct algorithms for matrix multiplication, finding faster algorithms for a variety of matrix sizes.

Faster sorting algorithms discovered using deep reinforcement learning

nature

20+ min

Paywall possible

Artificial intelligence goes beyond the current state of the art by discovering unknown, faster sorting algorithms as a single-player game using a deep reinforcement learning agent. These algorithms…

RLHF: Reinforcement Learning from Human Feedback

huyenchip.com

18+ min

A narrative that is often glossed over in the demo frenzy is about the incredible technical creativity that went into making models like ChatGPT work. One such cool idea is RLHF: incorporating…

Reinforcement Learning 101

Towards Data Science

10 min

Paywall possible

Learn the essentials of Reinforcement Learning!

Reinforcement learning’s foundational flaw

The Gradient

15+ min

By definition, learning from scratch is just about the least sample-efficient approach there can be.

What is Refind?

Every day Refind picks the most relevant links from around the web for you. Picking only a handful of links means focusing on what’s relevant and useful.

How does Refind curate?

It’s a mix of human and algorithmic curation, following a number of steps:

We monitor 10k+ sources and 1k+ thought leaders on hundreds of topics—publications, blogs, news sites, newsletters, Substack, Medium, Twitter, etc.
In addition, our users save links from around the web using our Save buttons and our extensions.
Our algorithm processes 100k+ new links every day and uses external signals to find the most relevant ones, focusing on timeless pieces.
Our community of active users gets the most relevant links every day, tailored to their interests. They provide feedback via implicit and explicit signals: open, read, listen, share, mark as read, read later, «More/less like this», etc.
Our algorithm uses these internal signals to refine the selection.
In addition, we have expert curators who manually curate niche topics.

The result: lists of the best and most useful articles on hundreds of topics.

How does Refind detect «timeless» pieces?

We focus on pieces with long shelf-lives—not news. We determine «timelessness» via a number of metrics, for example, the consumption pattern of links over time.

How many sources does Refind monitor?

We monitor 10k+ content sources on hundreds of topics—publications, blogs, news sites, newsletters, Substack, Medium, Twitter, etc.

Can I submit a link?

Indirectly, by using Refind and saving links from outside (e.g., via our extensions).

How can I report a problem?

When you’re logged-in, you can flag any link via the «More» (...) menu. You can also report problems via email to hello@refind.com

Who uses Refind?

450k+ smart people start their day with Refind. To learn something new. To get inspired. To move forward. Our apps have a 4.9/5 rating.

Is Refind free?

Yes, it’s free!

How can I sign up?

Head over to our homepage and sign up by email or with your Twitter or Google account.

Keep Learning

Get the big picture on your favorite topics.