Reinforcement Learning Libraries - Page 3

chatarenaby chatarena

Python 818 Version:Current
License: Permissive (Apache-2.0)

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Support

Quality

Security

License

Reuse

metaworldby rlworkgroup

Python 816 Version:Current
License: Permissive (MIT)

An open source robotics benchmark for meta- and multi-task reinforcement learning

Support

Quality

Security

License

Reuse

Inverse-Reinforcement-Learningby MatthewJA

Python 808 Version:Current
License: Permissive (MIT)

Implementations of selected inverse reinforcement learning algorithms.

Support

Quality

Security

License

Reuse

on-policyby marlbenchmark

Python 805 Version:Current
License: Permissive (MIT)

This is the official implementation of Multi-Agent PPO (MAPPO).

Support

Quality

Security

License

Reuse

mbrl-libby facebookresearch

Python 805 Version:Current
License: Permissive (MIT)

Library for Model Based RL

Support

Quality

Security

License

Reuse

Super-mario-bros-A3C-pytorchby uvipen

Python 801 Version:Current
License: Permissive (MIT)

Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros

Support

Quality

Security

License

Reuse

Reinforceby qqiang00

Jupyter Notebook 796 Version:Current
License: No License (No License)

Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments

Support

Quality

Security

License

Reuse

imitationby HumanCompatibleAI

Python 785 Version:Current
License: Permissive (MIT)

Clean PyTorch implementations of imitation and reward learning algorithms

Support

Quality

Security

License

Reuse

pytorch-rlby jingweiz

Python 784 Version:Current
License: Permissive (MIT)

Deep Reinforcement Learning with pytorch & visdom

Support

Quality

Security

License

Reuse

transfuserby autonomousvision

Python 784 Version:Current
License: Permissive (MIT)

[PAMI'22] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving, [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving

Support

Quality

Security

License

Reuse

osim-rlby stanfordnmbl

Python 778 Version:Current
License: Permissive (MIT)

Reinforcement learning environments with musculoskeletal models

Support

Quality

Security

License

Reuse

GibsonEnvby StanfordVL

C 776 Version:Current
License: Permissive (MIT)

Gibson Environments: Real-World Perception for Embodied Agents

Support

Quality

Security

License

Reuse

summarize-from-feedbackby openai

Python 775 Version:Current
License: Proprietary (Proprietary)

Code for "Learning to summarize from human feedback"

Support

Quality

Security

License

Reuse

Hands-On-Reinforcement-Learning-With-Pythonby sudharsan13296

Jupyter Notebook 767 Version:Current
License: No License (No License)

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Support

Quality

Security

License

Reuse

RLexampleby cuhkrlcourse

Python 766 Version:Current
License: No License (No License)

Some basic examples of playing with RL

Support

Quality

Security

License

Reuse

SMARTSby huawei-noah

Python 759 Version:Current
License: Permissive (MIT)

Scalable Multi-Agent RL Training School for Autonomous Driving

Support

Quality

Security

License

Reuse

PettingZooby PettingZoo-Team

Python 757 Version:Current
License: Proprietary (Proprietary)

Gym for multi-agent reinforcement learning

Support

Quality

Security

License

Reuse

dreamerv2by danijar

Python 755 Version:Current
License: Permissive (MIT)

Mastering Atari with Discrete World Models

Support

Quality

Security

License

Reuse

multiagent-competitionby openai

Python 746 Version:Current
License: No License (No License)

Code for the paper "Emergent Complexity via Multi-agent Competition"

Support

Quality

Security

License

Reuse

visual-pushing-graspingby andyzeng

Python 745 Version:Current
License: Permissive (BSD-2-Clause)

Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.

Support

Quality

Security

License

Reuse

RLSeq2Seqby yaserkl

Python 745 Version:Current
License: Permissive (MIT)

Deep Reinforcement Learning For Sequence to Sequence Models

Support

Quality

Security

License

Reuse

multi-task-learning-exampleby yaringal

Jupyter Notebook 745 Version:Current
License: Permissive (MIT)

A multi-task learning example for the paper https://arxiv.org/abs/1705.07115

Support

Quality

Security

License

Reuse

Popular-RL-Algorithmsby quantumiracle

Jupyter Notebook 744 Version:Current
License: Permissive (Apache-2.0)

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Support

Quality

Security

License

Reuse

pytorch-maml-rlby tristandeleu

Python 722 Version:Current
License: Permissive (MIT)

Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch

Support

Quality

Security

License

Reuse

playgroundby MultiAgentLearning

Python 722 Version:Current
License: Permissive (Apache-2.0)

PlayGround: AI Research into Multi-Agent Learning.

Support

Quality

Security

License

Reuse

maroby microsoft

Python 716 Version:Current
License: Permissive (MIT)

Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.

Support

Quality

Security

License

Reuse

torchbeastby facebookresearch

Python 714 Version:Current
License: Permissive (Apache-2.0)

A PyTorch Platform for Distributed RL

Support

Quality

Security

License

Reuse

large-scale-curiosityby openai

Python 712 Version:Current
License: No License (No License)

Code for the paper "Large-Scale Study of Curiosity-Driven Learning"

Support

Quality

Security

License

Reuse

David-Silver-Reinforcement-learningby dalmia

Jupyter Notebook 710 Version:Current
License: Permissive (MIT)

Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.

Support

Quality

Security

License

Reuse

Python 693 Version:Current
License: Permissive (MIT)

A fast and differentiable model predictive control (MPC) solver for PyTorch.

Support

Quality

Security

License

Reuse

mushroom-rlby MushroomRL

Python 685 Version:Current
License: Permissive (MIT)

Python library for Reinforcement Learning.

Support

Quality

Security

License

Reuse

llm_agentsby mpaepper

Python 680 Version:Current
License: Permissive (MIT)

Build agents which are controlled by LLMs

Support

Quality

Security

License

Reuse

Hands-On-Reinforcement-Learning-with-Pythonby PacktPublishing

Jupyter Notebook 674 Version:Current
License: Permissive (MIT)

Hands-On Reinforcement Learning with Python, published by Packt

Support

Quality

Security

License

Reuse

gym-tradingby hackthemarket

Jupyter Notebook 662 Version:Current
License: Permissive (MIT)

Environment for reinforcement-learning algorithmic trading models

Support

Quality

Security

License

Reuse

PythonLinearNonlinearControlby Shunichi09

Python 656 Version:Current
License: No License (No License)

PythonLinearNonLinearControl is a library implementing the linear and nonlinear control theories in python.

Support

Quality

Security

License

Reuse

Replicating-DeepMindby kristjankorjus

C++ 653 Version:Current
License: Strong Copyleft (GPL-3.0)

Reproducing the results of "Playing Atari with Deep Reinforcement Learning" by DeepMind

Support

Quality

Security

License

Reuse

babyagi-asiby oliveirabruno01

Python 650 Version:Current
License: Permissive (MIT)

BabyAGI: an Autonomous and Self-Improving agent, or BASI

Support

Quality

Security

License

Reuse

CityFlowby cityflow-project

C++ 646 Version:Current
License: Permissive (Apache-2.0)

A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario

Support

Quality

Security

License

Reuse

contextualbanditsby david-cortes

Python 636 Version:Current
License: Permissive (BSD-2-Clause)

Python implementations of contextual bandits algorithms

Support

Quality

Security

License

Reuse

Neural-SLAMby devendrachaplot

Python 628 Version:Current
License: Permissive (MIT)

Pytorch code for ICLR-20 Paper "Learning to Explore using Active Neural SLAM"

Support

Quality

Security

License

Reuse

DI-driveby opendilab

Python 625 Version:Current
License: Permissive (Apache-2.0)

Decision Intelligence Platform for Autonomous Driving simulation.

Support

Quality

Security

License

Reuse

RosettaStoneby utilForever

C++ 616 Version:Current
License: Strong Copyleft (AGPL-3.0)

Hearthstone simulator using C++ with some reinforcement learning

Support

Quality

Security

License

Reuse

Python 612 Version:Current
License: Permissive (Apache-2.0)

A simple OpenAI Gym environment for single and multi-agent reinforcement learning

Support

Quality

Security

License

Reuse

s2protocolby Blizzard

Python 600 Version:Current
License: Permissive (MIT)

Python library to decode StarCraft II replay protocols

Support

Quality

Security

License

Reuse

Python 600 Version:Current
License: Proprietary (Proprietary)

Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.

Support

Quality

Security

License

Reuse

autonomous-learning-libraryby cpnota

Python 594 Version:Current
License: Permissive (MIT)

A PyTorch library for building deep reinforcement learning agents.

Support

Quality

Security

License

Reuse

pytorch-soft-actor-criticby pranz24

Python 586 Version:Current
License: Permissive (MIT)

PyTorch implementation of soft actor critic

Support

Quality

Security

License

Reuse

reinforcement-learning-algorithmsby TianhongDai

Python 580 Version:Current
License: No License (No License)

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)

Support

Quality

Security

License

Reuse

sample-factoryby alex-petrenko

Python 574 Version:Current
License: Permissive (MIT)

High throughput synchronous and asynchronous reinforcement learning

Support

Quality

Security

License

Reuse

fast_abs_rlby ChenRocks

Python 574 Version:Current
License: Permissive (MIT)

Code for ACL 2018 paper: "Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting. Chen and Bansal"

Support

Quality

Security

License

Reuse

chatarenaby chatarena

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python

818

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

metaworldby rlworkgroup

An open source robotics benchmark for meta- and multi-task reinforcement learning

Python

816

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Inverse-Reinforcement-Learningby MatthewJA

Implementations of selected inverse reinforcement learning algorithms.

Python

808

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

on-policyby marlbenchmark

This is the official implementation of Multi-Agent PPO (MAPPO).

Python

805

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

mbrl-libby facebookresearch

Library for Model Based RL

Python

805

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Super-mario-bros-A3C-pytorchby uvipen

Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros

Python

801

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Reinforceby qqiang00

Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments

Jupyter Notebook

796

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

imitationby HumanCompatibleAI

Clean PyTorch implementations of imitation and reward learning algorithms

Python

785

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pytorch-rlby jingweiz

Deep Reinforcement Learning with pytorch & visdom

Python

784

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

transfuserby autonomousvision

[PAMI'22] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving, [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving

Python

784

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

osim-rlby stanfordnmbl

Reinforcement learning environments with musculoskeletal models

Python

778

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

GibsonEnvby StanfordVL

Gibson Environments: Real-World Perception for Embodied Agents

776

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

summarize-from-feedbackby openai

Code for "Learning to summarize from human feedback"

Python

775

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Hands-On-Reinforcement-Learning-With-Pythonby sudharsan13296

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Jupyter Notebook

767

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

RLexampleby cuhkrlcourse

Some basic examples of playing with RL

Python

766

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

SMARTSby huawei-noah

Scalable Multi-Agent RL Training School for Autonomous Driving

Python

759

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

PettingZooby PettingZoo-Team

Gym for multi-agent reinforcement learning

Python

757

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

dreamerv2by danijar

Mastering Atari with Discrete World Models

Python

755

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

multiagent-competitionby openai

Code for the paper "Emergent Complexity via Multi-agent Competition"

Python

746

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

visual-pushing-graspingby andyzeng

Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.

Python

745

Updated: 2 y ago

License: Permissive (BSD-2-Clause)

Support

Quality

Security

License

Reuse

RLSeq2Seqby yaserkl

Deep Reinforcement Learning For Sequence to Sequence Models

Python

745

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

multi-task-learning-exampleby yaringal

A multi-task learning example for the paper https://arxiv.org/abs/1705.07115

Jupyter Notebook

745

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Popular-RL-Algorithmsby quantumiracle

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook

744

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

pytorch-maml-rlby tristandeleu

Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch

Python

722

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

playgroundby MultiAgentLearning

PlayGround: AI Research into Multi-Agent Learning.

Python

722

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

maroby microsoft

Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.

Python

716

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

torchbeastby facebookresearch

A PyTorch Platform for Distributed RL

Python

714

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

large-scale-curiosityby openai

Code for the paper "Large-Scale Study of Curiosity-Driven Learning"

Python

712

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

David-Silver-Reinforcement-learningby dalmia

Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.

Jupyter Notebook

710

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

mpc.pytorchby locuslab

A fast and differentiable model predictive control (MPC) solver for PyTorch.

Python

693

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

mushroom-rlby MushroomRL

Python library for Reinforcement Learning.

Python

685

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

llm_agentsby mpaepper

Build agents which are controlled by LLMs

Python

680

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Hands-On-Reinforcement-Learning-with-Pythonby PacktPublishing

Hands-On Reinforcement Learning with Python, published by Packt

Jupyter Notebook

674

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

gym-tradingby hackthemarket

Environment for reinforcement-learning algorithmic trading models

Jupyter Notebook

662

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

PythonLinearNonlinearControlby Shunichi09

PythonLinearNonLinearControl is a library implementing the linear and nonlinear control theories in python.

Python

656

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Replicating-DeepMindby kristjankorjus

Reproducing the results of "Playing Atari with Deep Reinforcement Learning" by DeepMind

C++

653

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

babyagi-asiby oliveirabruno01

BabyAGI: an Autonomous and Self-Improving agent, or BASI

Python

650

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

CityFlowby cityflow-project

A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario

C++

646

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

contextualbanditsby david-cortes

Python implementations of contextual bandits algorithms

Python

636

Updated: 2 y ago

License: Permissive (BSD-2-Clause)

Support

Quality

Security

License

Reuse

Neural-SLAMby devendrachaplot

Pytorch code for ICLR-20 Paper "Learning to Explore using Active Neural SLAM"

Python

628

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

DI-driveby opendilab

Decision Intelligence Platform for Autonomous Driving simulation.

Python

625

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

RosettaStoneby utilForever

Hearthstone simulator using C++ with some reinforcement learning

C++

616

Updated: 2 y ago

License: Strong Copyleft (AGPL-3.0)

Support

Quality

Security

License

Reuse

slimevolleygymby hardmaru

A simple OpenAI Gym environment for single and multi-agent reinforcement learning

Python

612

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

s2protocolby Blizzard

Python library to decode StarCraft II replay protocols

Python

600

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pybullet-gymby benelot

Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.

Python

600

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

autonomous-learning-libraryby cpnota

A PyTorch library for building deep reinforcement learning agents.

Python

594

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pytorch-soft-actor-criticby pranz24

PyTorch implementation of soft actor critic

Python

586

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

reinforcement-learning-algorithmsby TianhongDai

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)

Python

580

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

sample-factoryby alex-petrenko

High throughput synchronous and asynchronous reinforcement learning

Python

574

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

fast_abs_rlby ChenRocks

Code for ACL 2018 paper: "Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting. Chen and Bansal"

Python

574

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Reinforcement Learning Libraries - Page 3

chatarenaby chatarena

Python 818 Version:Current License: Permissive (Apache-2.0)

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

metaworldby rlworkgroup

Python 816 Version:Current License: Permissive (MIT)

An open source robotics benchmark for meta- and multi-task reinforcement learning

Inverse-Reinforcement-Learningby MatthewJA

Python 808 Version:Current License: Permissive (MIT)

Implementations of selected inverse reinforcement learning algorithms.

on-policyby marlbenchmark

Python 805 Version:Current License: Permissive (MIT)

This is the official implementation of Multi-Agent PPO (MAPPO).

mbrl-libby facebookresearch

Python 805 Version:Current License: Permissive (MIT)

Library for Model Based RL

Super-mario-bros-A3C-pytorchby uvipen

Python 801 Version:Current License: Permissive (MIT)

Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros

Reinforceby qqiang00

Jupyter Notebook 796 Version:Current License: No License (No License)

Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments

imitationby HumanCompatibleAI

Python 785 Version:Current License: Permissive (MIT)

Clean PyTorch implementations of imitation and reward learning algorithms

pytorch-rlby jingweiz

Python 784 Version:Current License: Permissive (MIT)

Deep Reinforcement Learning with pytorch & visdom

transfuserby autonomousvision

Python 784 Version:Current License: Permissive (MIT)

[PAMI'22] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving, [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving

osim-rlby stanfordnmbl

Python 778 Version:Current License: Permissive (MIT)

Reinforcement learning environments with musculoskeletal models

GibsonEnvby StanfordVL

C 776 Version:Current License: Permissive (MIT)

Gibson Environments: Real-World Perception for Embodied Agents

summarize-from-feedbackby openai

Python 775 Version:Current License: Proprietary (Proprietary)

Code for "Learning to summarize from human feedback"

Hands-On-Reinforcement-Learning-With-Pythonby sudharsan13296

Jupyter Notebook 767 Version:Current License: No License (No License)

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

RLexampleby cuhkrlcourse

Python 766 Version:Current License: No License (No License)

Some basic examples of playing with RL

SMARTSby huawei-noah

Python 759 Version:Current License: Permissive (MIT)

Scalable Multi-Agent RL Training School for Autonomous Driving

PettingZooby PettingZoo-Team

Python 757 Version:Current License: Proprietary (Proprietary)

Gym for multi-agent reinforcement learning

dreamerv2by danijar

Python 755 Version:Current License: Permissive (MIT)

Mastering Atari with Discrete World Models

multiagent-competitionby openai

Python 746 Version:Current License: No License (No License)

Code for the paper "Emergent Complexity via Multi-agent Competition"

visual-pushing-graspingby andyzeng

Python 745 Version:Current License: Permissive (BSD-2-Clause)

Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.

RLSeq2Seqby yaserkl

Python 745 Version:Current License: Permissive (MIT)

Deep Reinforcement Learning For Sequence to Sequence Models

multi-task-learning-exampleby yaringal

Jupyter Notebook 745 Version:Current License: Permissive (MIT)

A multi-task learning example for the paper https://arxiv.org/abs/1705.07115

Popular-RL-Algorithmsby quantumiracle

Jupyter Notebook 744 Version:Current License: Permissive (Apache-2.0)

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

pytorch-maml-rlby tristandeleu

Python 722 Version:Current License: Permissive (MIT)

Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch

playgroundby MultiAgentLearning

Python 722 Version:Current License: Permissive (Apache-2.0)

PlayGround: AI Research into Multi-Agent Learning.

maroby microsoft

Python 716 Version:Current License: Permissive (MIT)

Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.

torchbeastby facebookresearch

Python 818 Version:Current
License: Permissive (Apache-2.0)

Python 816 Version:Current
License: Permissive (MIT)

Python 808 Version:Current
License: Permissive (MIT)

Python 805 Version:Current
License: Permissive (MIT)

Python 805 Version:Current
License: Permissive (MIT)

Python 801 Version:Current
License: Permissive (MIT)

Jupyter Notebook 796 Version:Current
License: No License (No License)

Python 785 Version:Current
License: Permissive (MIT)

Python 784 Version:Current
License: Permissive (MIT)

Python 784 Version:Current
License: Permissive (MIT)

Python 778 Version:Current
License: Permissive (MIT)

C 776 Version:Current
License: Permissive (MIT)

Python 775 Version:Current
License: Proprietary (Proprietary)

Jupyter Notebook 767 Version:Current
License: No License (No License)

Python 766 Version:Current
License: No License (No License)

Python 759 Version:Current
License: Permissive (MIT)

Python 757 Version:Current
License: Proprietary (Proprietary)

Python 755 Version:Current
License: Permissive (MIT)

Python 746 Version:Current
License: No License (No License)

Python 745 Version:Current
License: Permissive (BSD-2-Clause)

Python 745 Version:Current
License: Permissive (MIT)

Jupyter Notebook 745 Version:Current
License: Permissive (MIT)

Jupyter Notebook 744 Version:Current
License: Permissive (Apache-2.0)

Python 722 Version:Current
License: Permissive (MIT)

Python 722 Version:Current
License: Permissive (Apache-2.0)

Python 716 Version:Current
License: Permissive (MIT)

Python 714 Version:Current
License: Permissive (Apache-2.0)

Python 712 Version:Current
License: No License (No License)

Jupyter Notebook 710 Version:Current
License: Permissive (MIT)

Python 693 Version:Current
License: Permissive (MIT)

Python 685 Version:Current
License: Permissive (MIT)

Python 680 Version:Current
License: Permissive (MIT)

Jupyter Notebook 674 Version:Current
License: Permissive (MIT)

Jupyter Notebook 662 Version:Current
License: Permissive (MIT)

Python 656 Version:Current
License: No License (No License)

C++ 653 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 650 Version:Current
License: Permissive (MIT)

C++ 646 Version:Current
License: Permissive (Apache-2.0)

Python 636 Version:Current
License: Permissive (BSD-2-Clause)

Python 628 Version:Current
License: Permissive (MIT)

Python 625 Version:Current
License: Permissive (Apache-2.0)

C++ 616 Version:Current
License: Strong Copyleft (AGPL-3.0)

Python 612 Version:Current
License: Permissive (Apache-2.0)

Python 600 Version:Current
License: Permissive (MIT)

Python 600 Version:Current
License: Proprietary (Proprietary)

Python 594 Version:Current
License: Permissive (MIT)

Python 586 Version:Current
License: Permissive (MIT)

Python 580 Version:Current
License: No License (No License)

Python 574 Version:Current
License: Permissive (MIT)

Python 574 Version:Current
License: Permissive (MIT)