github.com
aaronsnoswell/unimodal-irl: Algorithms for Uni-Modal Inverse Reinforcement Learning
· Algorithms for Uni-Modal Inverse Reinforcement Learning - GitHub - aaronsnoswell/unimodal-irl: Algorithms for Uni-Modal Inverse Reinforcement Learning · Shared by 4, including Matt Shaffer
github.com
kngwyu/Rainy: Deep RL agents with PyTorch
· :umbrella: Deep RL agents with PyTorch:umbrella: . Contribute to kngwyu/Rainy development by creating an account on GitHub. · Shared by 4, including Matt Shaffer
github.com
GitHub - EPFL-VILAB/palmer
· PALMER: Perception-Action Loop with Memory for Long-Horizon Planning, NeurIPS 2022 - GitHub - EPFL-VILAB/palmer: PALMER: Perception-Action Loop with Memory for Long-Horizon Planning, NeurIPS 2022 · Shared by 4, including Matt Shaffer
github.com
utiasDSL/safe-control-gym: PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and RL
· PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and RL - GitHub - utiasDSL/safe-control-gym: PyBullet CartPole and Quadrotor environme... · Shared by 5, including Matt Shaffer
github.com
GitHub - tajwarfahim/proactive_interventions
· Codebase for NeurIPS 2022 paper, "When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning" - GitHub - tajwarfahim/proactive_interventions: Codebase for NeurIPS ... · Shared by 5, including Matt Shaffer
github.com
tengyangxie/garage: A toolkit for reproducible reinforcement learning research.
· A toolkit for reproducible reinforcement learning research. - GitHub - tengyangxie/garage: A toolkit for reproducible reinforcement learning research. · Shared by 5, including Matt Shaffer
github.com
hari-sikchi/AWAC: Advantage weighted Actor Critic for Offline RL
· Advantage weighted Actor Critic for Offline RL. Contribute to hari-sikchi/AWAC development by creating an account on GitHub. · Shared by 4, including Matt Shaffer
github.com
lucidrains/PaLM-rlhf-pytorch: Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
· Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM - GitHub - lucidrains/PaLM-rlhf-pytorch: Implementation of RLHF ... · Shared by 45, including Chris Heilmann codepo8@toot.cafe, AGI.Eth | MTL.Eth, Sebastian Raschka, Matt Shaffer, Nico Müller 🇺🇦, Helmi
github.com
openrlbenchmark/openrlbenchmark
· Contribute to openrlbenchmark/openrlbenchmark development by creating an account on GitHub. · Shared by 4, including Matt Shaffer
github.com
GitHub - zwbgood6/deform
· ICRA 2021 paper code: Deformable Linear Object Prediction Using Locally Latent Linear Dynamics. - GitHub - zwbgood6/deform: ICRA 2021 paper code: Deformable Linear Object Prediction Using Locally L... · Shared by 5, including Matt Shaffer
github.com
lindermanlab/S5
· Contribute to lindermanlab/S5 development by creating an account on GitHub. · Shared by 5, including Matt Shaffer
github.com
lvwerra/trl: Train transformer language models with reinforcement learning.
· Train transformer language models with reinforcement learning. - GitHub - lvwerra/trl: Train transformer language models with reinforcement learning. · Shared by 13, including Matt Shaffer, Nico Müller 🇺🇦, Brian Roemmele
github.com
polixir/causal-mbrl: Toolkit of Causal Model-based Reinforcement Learning.
· Toolkit of Causal Model-based Reinforcement Learning. - GitHub - polixir/causal-mbrl: Toolkit of Causal Model-based Reinforcement Learning. · Shared by 6, including Matt Shaffer
github.com
facebookresearch/RRL: Code for RRL (https://sites.google.com/view/abstractions4rl)
· Code for RRL (https://sites.google.com/view/abstractions4rl) - GitHub - facebookresearch/RRL: Code for RRL (https://sites.google.com/view/abstractions4rl) · Shared by 6, including Matt Shaffer
github.com
GitHub - deepmind/mujoco_menagerie: A collection of high-quality models for the MuJoCo physics engine, curated by DeepMind.
· A collection of high-quality models for the MuJoCo physics engine, curated by DeepMind. - GitHub - deepmind/mujoco_menagerie: A collection of high-quality models for the MuJoCo physics engine, cura... · Shared by 57, including AGI.Eth | MTL.Eth, Matt Shaffer, Nando de Freitas 🏳️🌈, Mike Tamir, PhD
github.com
albertwilcox/mcac: Author implementation of Monte Carlo Augmented Actor Critic in PyTorch
· Author implementation of Monte Carlo Augmented Actor Critic in PyTorch - GitHub - albertwilcox/mcac: Author implementation of Monte Carlo Augmented Actor Critic in PyTorch · Shared by 5, including Matt Shaffer