Code · Reinforcement Learning

You landed on page 2.
Start from the top?
github.com
alex-petrenko/sample-factory: High throughput asynchronous reinforcement learningNov 2022 · High throughput asynchronous reinforcement learning - GitHub - alex-petrenko/sample-factory: High throughput asynchronous reinforcement learning · Shared by 4, including Matt Shaffer
github.com
dmksjfl/MCQ: Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)Nov 2022 · Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022) - GitHub - dmksjfl/MCQ: Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS... · Shared by 5, including Matt Shaffer
github.com
GitHub - lmzintgraf/varibad: Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-LearningNov 2022 · Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020) - GitHub - lmzintgraf/varibad: Implementation of VariBAD: A Very Good Method... · Shared by 5, including Matt Shaffer
github.com
GitHub - Farama-Foundation/Kabuki: Gymnasium for offline reinforcement learningOct 2022 · Gymnasium for offline reinforcement learning. Contribute to Farama-Foundation/Kabuki development by creating an account on GitHub. · Shared by 5, including Matt Shaffer
github.com
GitHub - yangchen1997/Multi-Agent-Reinforcement-Learning: PyTorch implements multi-agent reinforcement learning algorithms, including Qmix, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise…Oct 2022 · PyTorch implements multi-agent reinforcement learning algorithms, including Qmix, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise Control+PPO, Grid Wise Control+DDPG. - GitHub - yang... · Shared by 5, including Matt Shaffer
github.com
iassael/learning-to-communicate: Learning to Communicate with Deep Multi-Agent Reinforcement LearningOct 2022 · Learning to Communicate with Deep Multi-Agent Reinforcement Learning - GitHub - iassael/learning-to-communicate: Learning to Communicate with Deep Multi-Agent Reinforcement Learning · Shared by 4, including Matt Shaffer
github.com
YanjieZe/rl3d: Code for "Visual Reinforcement Learning with Self-Supervised 3D Representations"Oct 2022 · Code for "Visual Reinforcement Learning with Self-Supervised 3D Representations" - GitHub - YanjieZe/rl3d: Code for "Visual Reinforcement Learning with Self-Supervised 3D Representat... · Shared by 5, including Matt Shaffer
github.com
HaozhiQi/hora: In-Hand Object Rotation via Rapid Motor AdaptationOct 2022 · In-Hand Object Rotation via Rapid Motor Adaptation - GitHub - HaozhiQi/hora: In-Hand Object Rotation via Rapid Motor Adaptation · Shared by 5, including Matt Shaffer
github.com
webЯcade: Feed-driven gamingFrom 2022 · Feed-driven gaming. Contribute to webrcade/webrcade development by creating an account on GitHub. · Shared by 8, including Matt Shaffer
github.com
kevinzakka/obj2mjcf: A CLI for processing composite Wavefront OBJ files into a MuJoCo-conducive formatOct 2022 · A CLI for processing composite Wavefront OBJ files into a MuJoCo-conducive format - GitHub - kevinzakka/obj2mjcf: A CLI for processing composite Wavefront OBJ files into a MuJoCo-conducive format · Shared by 5, including Matt Shaffer
github.com
GitHub - google-research/reincarnating_rl: Open source code for reusing prior computational work in RL.Sep 2022 · Open source code for reusing prior computational work in RL. - GitHub - google-research/reincarnating_rl: Open source code for reusing prior computational work in RL. · Shared by 6, including Matt Shaffer
github.com
GitHub - younggyoseo/apvSep 2022 · Contribute to younggyoseo/apv development by creating an account on GitHub. · Shared by 5, including Matt Shaffer
github.com
reiniscimurs/DRL-robot-navigation: Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3)…Aug 2022 · Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random g... · Shared by 5, including Matt Shaffer
github.com
Open Dynamic Robot InitiativeAug 2022 · Open-source hardware and software for torque-controlled robots - Open Dynamic Robot Initiative · Shared by 4, including Matt Shaffer
github.com
TakuyaHiraoka/Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning: Source files to replicate experiments in my ICLR 2022 paper.Aug 2022 · Source files to replicate experiments in my ICLR 2022 paper. - GitHub - TakuyaHiraoka/Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning: Source files to replicate experiments in my IC... · Shared by 5, including Matt Shaffer
github.com
liuzuxin/cvpo-safe-rl: Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)Aug 2022 · Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022) - GitHub - liuzuxin/cvpo-safe-rl: Code for "Constrained Variational Policy Optimiza... · Shared by 5, including Matt Shaffer
Code · Reinforcement Learning

Share Code · Reinforcement Learning