ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
Support
Quality
Security
License
Reuse
An open source robotics benchmark for meta- and multi-task reinforcement learning
Support
Quality
Security
License
Reuse
Implementations of selected inverse reinforcement learning algorithms.
Support
Quality
Security
License
Reuse
This is the official implementation of Multi-Agent PPO (MAPPO).
Support
Quality
Security
License
Reuse
Library for Model Based RL
Support
Quality
Security
License
Reuse
Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros
Support
Quality
Security
License
Reuse
Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments
Support
Quality
Security
License
Reuse
Clean PyTorch implementations of imitation and reward learning algorithms
Support
Quality
Security
License
Reuse
Deep Reinforcement Learning with pytorch & visdom
Support
Quality
Security
License
Reuse
[PAMI'22] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving, [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
Support
Quality
Security
License
Reuse
Reinforcement learning environments with musculoskeletal models
Support
Quality
Security
License
Reuse
Gibson Environments: Real-World Perception for Embodied Agents
Support
Quality
Security
License
Reuse
Code for "Learning to summarize from human feedback"
Support
Quality
Security
License
Reuse
H
Hands-On-Reinforcement-Learning-With-Pythonby sudharsan13296
Jupyter Notebook 767 Version:Current License: No License (No License)
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Support
Quality
Security
License
Reuse
Some basic examples of playing with RL
Support
Quality
Security
License
Reuse
Scalable Multi-Agent RL Training School for Autonomous Driving
Support
Quality
Security
License
Reuse
Gym for multi-agent reinforcement learning
Support
Quality
Security
License
Reuse
Mastering Atari with Discrete World Models
Support
Quality
Security
License
Reuse
Code for the paper "Emergent Complexity via Multi-agent Competition"
Support
Quality
Security
License
Reuse
Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.
Support
Quality
Security
License
Reuse
Deep Reinforcement Learning For Sequence to Sequence Models
Support
Quality
Security
License
Reuse
m
multi-task-learning-exampleby yaringal
Jupyter Notebook 745 Version:Current License: Permissive (MIT)
A multi-task learning example for the paper https://arxiv.org/abs/1705.07115
Support
Quality
Security
License
Reuse
P
Popular-RL-Algorithmsby quantumiracle
Jupyter Notebook 744 Version:Current License: Permissive (Apache-2.0)
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Support
Quality
Security
License
Reuse
Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
Support
Quality
Security
License
Reuse
PlayGround: AI Research into Multi-Agent Learning.
Support
Quality
Security
License
Reuse
Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.
Support
Quality
Security
License
Reuse
A PyTorch Platform for Distributed RL
Support
Quality
Security
License
Reuse
Code for the paper "Large-Scale Study of Curiosity-Driven Learning"
Support
Quality
Security
License
Reuse
D
David-Silver-Reinforcement-learningby dalmia
Jupyter Notebook 710 Version:Current License: Permissive (MIT)
Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.
Support
Quality
Security
License
Reuse
A fast and differentiable model predictive control (MPC) solver for PyTorch.
Support
Quality
Security
License
Reuse
Python library for Reinforcement Learning.
Support
Quality
Security
License
Reuse
Build agents which are controlled by LLMs
Support
Quality
Security
License
Reuse
H
Hands-On-Reinforcement-Learning-with-Pythonby PacktPublishing
Jupyter Notebook 674 Version:Current License: Permissive (MIT)
Hands-On Reinforcement Learning with Python, published by Packt
Support
Quality
Security
License
Reuse
Environment for reinforcement-learning algorithmic trading models
Support
Quality
Security
License
Reuse
P
PythonLinearNonlinearControlby Shunichi09
Python 656 Version:Current License: No License (No License)
PythonLinearNonLinearControl is a library implementing the linear and nonlinear control theories in python.
Support
Quality
Security
License
Reuse
Reproducing the results of "Playing Atari with Deep Reinforcement Learning" by DeepMind
Support
Quality
Security
License
Reuse
BabyAGI: an Autonomous and Self-Improving agent, or BASI
Support
Quality
Security
License
Reuse
A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario
Support
Quality
Security
License
Reuse
Python implementations of contextual bandits algorithms
Support
Quality
Security
License
Reuse
Pytorch code for ICLR-20 Paper "Learning to Explore using Active Neural SLAM"
Support
Quality
Security
License
Reuse
Decision Intelligence Platform for Autonomous Driving simulation.
Support
Quality
Security
License
Reuse
Hearthstone simulator using C++ with some reinforcement learning
Support
Quality
Security
License
Reuse
A simple OpenAI Gym environment for single and multi-agent reinforcement learning
Support
Quality
Security
License
Reuse
Python library to decode StarCraft II replay protocols
Support
Quality
Security
License
Reuse
Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.
Support
Quality
Security
License
Reuse
A PyTorch library for building deep reinforcement learning agents.
Support
Quality
Security
License
Reuse
PyTorch implementation of soft actor critic
Support
Quality
Security
License
Reuse
r
reinforcement-learning-algorithmsby TianhongDai
Python 580 Version:Current License: No License (No License)
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Support
Quality
Security
License
Reuse
High throughput synchronous and asynchronous reinforcement learning
Support
Quality
Security
License
Reuse
Code for ACL 2018 paper: "Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting. Chen and Bansal"
Support
Quality
Security
License
Reuse
c
chatarenaby chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
Python 818Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
m
metaworldby rlworkgroup
An open source robotics benchmark for meta- and multi-task reinforcement learning
Python 816Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
I
Inverse-Reinforcement-Learningby MatthewJA
Implementations of selected inverse reinforcement learning algorithms.
Python 808Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
o
on-policyby marlbenchmark
This is the official implementation of Multi-Agent PPO (MAPPO).
Python 805Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
mbrl-libby facebookresearch
Library for Model Based RL
Python 805Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
Super-mario-bros-A3C-pytorchby uvipen
Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros
Python 801Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
R
Reinforceby qqiang00
Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments
Jupyter Notebook 796Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
i
imitationby HumanCompatibleAI
Clean PyTorch implementations of imitation and reward learning algorithms
Python 785Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pytorch-rlby jingweiz
Deep Reinforcement Learning with pytorch & visdom
Python 784Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
transfuserby autonomousvision
[PAMI'22] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving, [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
Python 784Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
o
osim-rlby stanfordnmbl
Reinforcement learning environments with musculoskeletal models
Python 778Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
G
GibsonEnvby StanfordVL
Gibson Environments: Real-World Perception for Embodied Agents
C 776Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
summarize-from-feedbackby openai
Code for "Learning to summarize from human feedback"
Python 775Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
H
Hands-On-Reinforcement-Learning-With-Pythonby sudharsan13296
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Jupyter Notebook 767Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
R
RLexampleby cuhkrlcourse
Some basic examples of playing with RL
Python 766Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
SMARTSby huawei-noah
Scalable Multi-Agent RL Training School for Autonomous Driving
Python 759Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
PettingZooby PettingZoo-Team
Gym for multi-agent reinforcement learning
Python 757Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
d
dreamerv2by danijar
Mastering Atari with Discrete World Models
Python 755Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
multiagent-competitionby openai
Code for the paper "Emergent Complexity via Multi-agent Competition"
Python 746Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
v
visual-pushing-graspingby andyzeng
Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.
Python 745Updated: 2 y ago License: Permissive (BSD-2-Clause)
Support
Quality
Security
License
Reuse
R
RLSeq2Seqby yaserkl
Deep Reinforcement Learning For Sequence to Sequence Models
Python 745Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
multi-task-learning-exampleby yaringal
A multi-task learning example for the paper https://arxiv.org/abs/1705.07115
Jupyter Notebook 745Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
Popular-RL-Algorithmsby quantumiracle
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Jupyter Notebook 744Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
pytorch-maml-rlby tristandeleu
Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
Python 722Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
playgroundby MultiAgentLearning
PlayGround: AI Research into Multi-Agent Learning.
Python 722Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
m
maroby microsoft
Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.
Python 716Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
torchbeastby facebookresearch
A PyTorch Platform for Distributed RL
Python 714Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
l
large-scale-curiosityby openai
Code for the paper "Large-Scale Study of Curiosity-Driven Learning"
Python 712Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
David-Silver-Reinforcement-learningby dalmia
Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.
Jupyter Notebook 710Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
mpc.pytorchby locuslab
A fast and differentiable model predictive control (MPC) solver for PyTorch.
Python 693Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
mushroom-rlby MushroomRL
Python library for Reinforcement Learning.
Python 685Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
l
llm_agentsby mpaepper
Build agents which are controlled by LLMs
Python 680Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
H
Hands-On-Reinforcement-Learning-with-Pythonby PacktPublishing
Hands-On Reinforcement Learning with Python, published by Packt
Jupyter Notebook 674Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
gym-tradingby hackthemarket
Environment for reinforcement-learning algorithmic trading models
Jupyter Notebook 662Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
PythonLinearNonlinearControlby Shunichi09
PythonLinearNonLinearControl is a library implementing the linear and nonlinear control theories in python.
Python 656Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
R
Replicating-DeepMindby kristjankorjus
Reproducing the results of "Playing Atari with Deep Reinforcement Learning" by DeepMind
C++ 653Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
b
babyagi-asiby oliveirabruno01
BabyAGI: an Autonomous and Self-Improving agent, or BASI
Python 650Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
C
CityFlowby cityflow-project
A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario
C++ 646Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
c
contextualbanditsby david-cortes
Python implementations of contextual bandits algorithms
Python 636Updated: 2 y ago License: Permissive (BSD-2-Clause)
Support
Quality
Security
License
Reuse
N
Neural-SLAMby devendrachaplot
Pytorch code for ICLR-20 Paper "Learning to Explore using Active Neural SLAM"
Python 628Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
D
DI-driveby opendilab
Decision Intelligence Platform for Autonomous Driving simulation.
Python 625Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
R
RosettaStoneby utilForever
Hearthstone simulator using C++ with some reinforcement learning
C++ 616Updated: 2 y ago License: Strong Copyleft (AGPL-3.0)
Support
Quality
Security
License
Reuse
s
slimevolleygymby hardmaru
A simple OpenAI Gym environment for single and multi-agent reinforcement learning
Python 612Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
s2protocolby Blizzard
Python library to decode StarCraft II replay protocols
Python 600Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pybullet-gymby benelot
Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.
Python 600Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
a
autonomous-learning-libraryby cpnota
A PyTorch library for building deep reinforcement learning agents.
Python 594Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pytorch-soft-actor-criticby pranz24
PyTorch implementation of soft actor critic
Python 586Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
r
reinforcement-learning-algorithmsby TianhongDai
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Python 580Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
sample-factoryby alex-petrenko
High throughput synchronous and asynchronous reinforcement learning
Python 574Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
f
fast_abs_rlby ChenRocks
Code for ACL 2018 paper: "Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting. Chen and Bansal"
Python 574Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse