github.com
alex-petrenko/sample-factory: High throughput asynchronous reinforcement learning
· High throughput asynchronous reinforcement learning - GitHub - alex-petrenko/sample-factory: High throughput asynchronous reinforcement learning · Shared by 4, including Matt Shaffer
github.com
dmksjfl/MCQ: Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
· Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022) - GitHub - dmksjfl/MCQ: Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS... · Shared by 5, including Matt Shaffer
github.com
GitHub - lmzintgraf/varibad: Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
· Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020) - GitHub - lmzintgraf/varibad: Implementation of VariBAD: A Very Good Method... · Shared by 5, including Matt Shaffer
github.com
GitHub - Farama-Foundation/Kabuki: Gymnasium for offline reinforcement learning
· Gymnasium for offline reinforcement learning. Contribute to Farama-Foundation/Kabuki development by creating an account on GitHub. · Shared by 5, including Matt Shaffer
github.com
GitHub - yangchen1997/Multi-Agent-Reinforcement-Learning: PyTorch implements multi-agent reinforcement learning algorithms, including Qmix, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise…
· PyTorch implements multi-agent reinforcement learning algorithms, including Qmix, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise Control+PPO, Grid Wise Control+DDPG. - GitHub - yang... · Shared by 5, including Matt Shaffer
github.com
iassael/learning-to-communicate: Learning to Communicate with Deep Multi-Agent Reinforcement Learning
· Learning to Communicate with Deep Multi-Agent Reinforcement Learning - GitHub - iassael/learning-to-communicate: Learning to Communicate with Deep Multi-Agent Reinforcement Learning · Shared by 4, including Matt Shaffer
github.com
YanjieZe/rl3d: Code for "Visual Reinforcement Learning with Self-Supervised 3D Representations"
· Code for "Visual Reinforcement Learning with Self-Supervised 3D Representations" - GitHub - YanjieZe/rl3d: Code for "Visual Reinforcement Learning with Self-Supervised 3D Representat... · Shared by 5, including Matt Shaffer
github.com
HaozhiQi/hora: In-Hand Object Rotation via Rapid Motor Adaptation
· In-Hand Object Rotation via Rapid Motor Adaptation - GitHub - HaozhiQi/hora: In-Hand Object Rotation via Rapid Motor Adaptation · Shared by 5, including Matt Shaffer
github.com
webЯcade: Feed-driven gaming
· Feed-driven gaming. Contribute to webrcade/webrcade development by creating an account on GitHub. · Shared by 8, including Matt Shaffer
github.com
kevinzakka/obj2mjcf: A CLI for processing composite Wavefront OBJ files into a MuJoCo-conducive format
· A CLI for processing composite Wavefront OBJ files into a MuJoCo-conducive format - GitHub - kevinzakka/obj2mjcf: A CLI for processing composite Wavefront OBJ files into a MuJoCo-conducive format · Shared by 5, including Matt Shaffer
github.com
GitHub - google-research/reincarnating_rl: Open source code for reusing prior computational work in RL.
· Open source code for reusing prior computational work in RL. - GitHub - google-research/reincarnating_rl: Open source code for reusing prior computational work in RL. · Shared by 6, including Matt Shaffer
github.com
GitHub - younggyoseo/apv
· Contribute to younggyoseo/apv development by creating an account on GitHub. · Shared by 5, including Matt Shaffer
github.com
reiniscimurs/DRL-robot-navigation: Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3)…
· Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random g... · Shared by 5, including Matt Shaffer
github.com
Open Dynamic Robot Initiative
· Open-source hardware and software for torque-controlled robots - Open Dynamic Robot Initiative · Shared by 4, including Matt Shaffer
github.com
TakuyaHiraoka/Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning: Source files to replicate experiments in my ICLR 2022 paper.
· Source files to replicate experiments in my ICLR 2022 paper. - GitHub - TakuyaHiraoka/Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning: Source files to replicate experiments in my IC... · Shared by 5, including Matt Shaffer
github.com
liuzuxin/cvpo-safe-rl: Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)
· Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022) - GitHub - liuzuxin/cvpo-safe-rl: Code for "Constrained Variational Policy Optimiza... · Shared by 5, including Matt Shaffer