ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
Support
Quality
Security
License
Reuse
An open source robotics benchmark for meta- and multi-task reinforcement learning
Support
Quality
Security
License
Reuse
Implementations of selected inverse reinforcement learning algorithms.
Support
Quality
Security
License
Reuse
This is the official implementation of Multi-Agent PPO (MAPPO).
Support
Quality
Security
License
Reuse
Library for Model Based RL
Support
Quality
Security
License
Reuse
Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros
Support
Quality
Security
License
Reuse
Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments
Support
Quality
Security
License
Reuse
Clean PyTorch implementations of imitation and reward learning algorithms
Support
Quality
Security
License
Reuse
Deep Reinforcement Learning with pytorch & visdom
Support
Quality
Security
License
Reuse
[PAMI'22] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving, [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
Support
Quality
Security
License
Reuse
Reinforcement learning environments with musculoskeletal models
Support
Quality
Security
License
Reuse
Gibson Environments: Real-World Perception for Embodied Agents
Support
Quality
Security
License
Reuse
Code for "Learning to summarize from human feedback"
Support
Quality
Security
License
Reuse
H
Hands-On-Reinforcement-Learning-With-Pythonby sudharsan13296
Jupyter Notebook 
767
Version:Current
License: No License (No License)
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Support
Quality
Security
License
Reuse
Some basic examples of playing with RL
Support
Quality
Security
License
Reuse
Scalable Multi-Agent RL Training School for Autonomous Driving
Support
Quality
Security
License
Reuse
Gym for multi-agent reinforcement learning
Support
Quality
Security
License
Reuse
Mastering Atari with Discrete World Models
Support
Quality
Security
License
Reuse
Code for the paper "Emergent Complexity via Multi-agent Competition"
Support
Quality
Security
License
Reuse
Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.
Support
Quality
Security
License
Reuse
Deep Reinforcement Learning For Sequence to Sequence Models
Support
Quality
Security
License
Reuse
m
multi-task-learning-exampleby yaringal
Jupyter Notebook 
745
Version:Current
License: Permissive (MIT)
A multi-task learning example for the paper https://arxiv.org/abs/1705.07115
Support
Quality
Security
License
Reuse
P
Popular-RL-Algorithmsby quantumiracle
Jupyter Notebook 
744
Version:Current
License: Permissive (Apache-2.0)
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Support
Quality
Security
License
Reuse
Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
Support
Quality
Security
License
Reuse
PlayGround: AI Research into Multi-Agent Learning.
Support
Quality
Security
License
Reuse
Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.
Support
Quality
Security
License
Reuse
A PyTorch Platform for Distributed RL
Support
Quality
Security
License
Reuse
Code for the paper "Large-Scale Study of Curiosity-Driven Learning"
Support
Quality
Security
License
Reuse
D
David-Silver-Reinforcement-learningby dalmia
Jupyter Notebook 
710
Version:Current
License: Permissive (MIT)
Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.
Support
Quality
Security
License
Reuse
A fast and differentiable model predictive control (MPC) solver for PyTorch.
Support
Quality
Security
License
Reuse
Python library for Reinforcement Learning.
Support
Quality
Security
License
Reuse
Build agents which are controlled by LLMs
Support
Quality
Security
License
Reuse
H
Hands-On-Reinforcement-Learning-with-Pythonby PacktPublishing
Jupyter Notebook 
674
Version:Current
License: Permissive (MIT)
Hands-On Reinforcement Learning with Python, published by Packt
Support
Quality
Security
License
Reuse
Environment for reinforcement-learning algorithmic trading models
Support
Quality
Security
License
Reuse
P
PythonLinearNonlinearControlby Shunichi09
Python 
656
Version:Current
License: No License (No License)
PythonLinearNonLinearControl is a library implementing the linear and nonlinear control theories in python.
Support
Quality
Security
License
Reuse
Reproducing the results of "Playing Atari with Deep Reinforcement Learning" by DeepMind
Support
Quality
Security
License
Reuse
BabyAGI: an Autonomous and Self-Improving agent, or BASI
Support
Quality
Security
License
Reuse
A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario
Support
Quality
Security
License
Reuse
Python implementations of contextual bandits algorithms
Support
Quality
Security
License
Reuse
Pytorch code for ICLR-20 Paper "Learning to Explore using Active Neural SLAM"
Support
Quality
Security
License
Reuse
Decision Intelligence Platform for Autonomous Driving simulation.
Support
Quality
Security
License
Reuse
Hearthstone simulator using C++ with some reinforcement learning
Support
Quality
Security
License
Reuse
A simple OpenAI Gym environment for single and multi-agent reinforcement learning
Support
Quality
Security
License
Reuse
Python library to decode StarCraft II replay protocols
Support
Quality
Security
License
Reuse
Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.
Support
Quality
Security
License
Reuse
A PyTorch library for building deep reinforcement learning agents.
Support
Quality
Security
License
Reuse
PyTorch implementation of soft actor critic
Support
Quality
Security
License
Reuse
r
reinforcement-learning-algorithmsby TianhongDai
Python 
580
Version:Current
License: No License (No License)
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Support
Quality
Security
License
Reuse
High throughput synchronous and asynchronous reinforcement learning
Support
Quality
Security
License
Reuse
Code for ACL 2018 paper: "Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting. Chen and Bansal"
Support
Quality
Security
License
Reuse
c
chatarenaby chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
Python
818
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
m
metaworldby rlworkgroup
An open source robotics benchmark for meta- and multi-task reinforcement learning
Python
816
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
I
Inverse-Reinforcement-Learningby MatthewJA
Implementations of selected inverse reinforcement learning algorithms.
Python
808
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
o
on-policyby marlbenchmark
This is the official implementation of Multi-Agent PPO (MAPPO).
Python
805
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
mbrl-libby facebookresearch
Library for Model Based RL
Python
805
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
Super-mario-bros-A3C-pytorchby uvipen
Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros
Python
801
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
R
Reinforceby qqiang00
Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments
Jupyter Notebook
796
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
i
imitationby HumanCompatibleAI
Clean PyTorch implementations of imitation and reward learning algorithms
Python
785
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pytorch-rlby jingweiz
Deep Reinforcement Learning with pytorch & visdom
Python
784
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
transfuserby autonomousvision
[PAMI'22] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving, [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
Python
784
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
o
osim-rlby stanfordnmbl
Reinforcement learning environments with musculoskeletal models
Python
778
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
G
GibsonEnvby StanfordVL
Gibson Environments: Real-World Perception for Embodied Agents
C
776
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
summarize-from-feedbackby openai
Code for "Learning to summarize from human feedback"
Python
775
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
H
Hands-On-Reinforcement-Learning-With-Pythonby sudharsan13296
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Jupyter Notebook
767
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
R
RLexampleby cuhkrlcourse
Some basic examples of playing with RL
Python
766
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
S
SMARTSby huawei-noah
Scalable Multi-Agent RL Training School for Autonomous Driving
Python
759
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
PettingZooby PettingZoo-Team
Gym for multi-agent reinforcement learning
Python
757
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
d
dreamerv2by danijar
Mastering Atari with Discrete World Models
Python
755
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
multiagent-competitionby openai
Code for the paper "Emergent Complexity via Multi-agent Competition"
Python
746
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
v
visual-pushing-graspingby andyzeng
Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.
Python
745
Updated: 2 y ago
License: Permissive (BSD-2-Clause)
Support
Quality
Security
License
Reuse
R
RLSeq2Seqby yaserkl
Deep Reinforcement Learning For Sequence to Sequence Models
Python
745
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
multi-task-learning-exampleby yaringal
A multi-task learning example for the paper https://arxiv.org/abs/1705.07115
Jupyter Notebook
745
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
Popular-RL-Algorithmsby quantumiracle
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Jupyter Notebook
744
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
pytorch-maml-rlby tristandeleu
Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
Python
722
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
playgroundby MultiAgentLearning
PlayGround: AI Research into Multi-Agent Learning.
Python
722
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
m
maroby microsoft
Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.
Python
716
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
torchbeastby facebookresearch
A PyTorch Platform for Distributed RL
Python
714
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
l
large-scale-curiosityby openai
Code for the paper "Large-Scale Study of Curiosity-Driven Learning"
Python
712
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
D
David-Silver-Reinforcement-learningby dalmia
Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.
Jupyter Notebook
710
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
mpc.pytorchby locuslab
A fast and differentiable model predictive control (MPC) solver for PyTorch.
Python
693
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
mushroom-rlby MushroomRL
Python library for Reinforcement Learning.
Python
685
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
l
llm_agentsby mpaepper
Build agents which are controlled by LLMs
Python
680
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
H
Hands-On-Reinforcement-Learning-with-Pythonby PacktPublishing
Hands-On Reinforcement Learning with Python, published by Packt
Jupyter Notebook
674
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
gym-tradingby hackthemarket
Environment for reinforcement-learning algorithmic trading models
Jupyter Notebook
662
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
PythonLinearNonlinearControlby Shunichi09
PythonLinearNonLinearControl is a library implementing the linear and nonlinear control theories in python.
Python
656
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
R
Replicating-DeepMindby kristjankorjus
Reproducing the results of "Playing Atari with Deep Reinforcement Learning" by DeepMind
C++
653
Updated: 4 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
b
babyagi-asiby oliveirabruno01
BabyAGI: an Autonomous and Self-Improving agent, or BASI
Python
650
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
C
CityFlowby cityflow-project
A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario
C++
646
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
c
contextualbanditsby david-cortes
Python implementations of contextual bandits algorithms
Python
636
Updated: 2 y ago
License: Permissive (BSD-2-Clause)
Support
Quality
Security
License
Reuse
N
Neural-SLAMby devendrachaplot
Pytorch code for ICLR-20 Paper "Learning to Explore using Active Neural SLAM"
Python
628
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
D
DI-driveby opendilab
Decision Intelligence Platform for Autonomous Driving simulation.
Python
625
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
R
RosettaStoneby utilForever
Hearthstone simulator using C++ with some reinforcement learning
C++
616
Updated: 2 y ago
License: Strong Copyleft (AGPL-3.0)
Support
Quality
Security
License
Reuse
s
slimevolleygymby hardmaru
A simple OpenAI Gym environment for single and multi-agent reinforcement learning
Python
612
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
s2protocolby Blizzard
Python library to decode StarCraft II replay protocols
Python
600
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pybullet-gymby benelot
Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.
Python
600
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
a
autonomous-learning-libraryby cpnota
A PyTorch library for building deep reinforcement learning agents.
Python
594
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pytorch-soft-actor-criticby pranz24
PyTorch implementation of soft actor critic
Python
586
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
r
reinforcement-learning-algorithmsby TianhongDai
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Python
580
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
s
sample-factoryby alex-petrenko
High throughput synchronous and asynchronous reinforcement learning
Python
574
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
f
fast_abs_rlby ChenRocks
Code for ACL 2018 paper: "Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting. Chen and Bansal"
Python
574
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse