Reinforcement Learning Libraries - Page 2

PGPortfolioby ZhengyaoJiang

Python 1588 Version:Current
License: Strong Copyleft (GPL-3.0)

PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

Support

Quality

Security

License

Reuse

Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutionsby LyWangPX

Jupyter Notebook 1585 Version:Current
License: Permissive (MIT)

Solutions of Reinforcement Learning, An Introduction

Support

Quality

Security

License

Reuse

gym-minigridby Farama-Foundation

Python 1563 Version:Current
License: Permissive (Apache-2.0)

Minimalistic gridworld package for OpenAI Gym

Support

Quality

Security

License

Reuse

rlkitby vitchyr

Python 1560 Version:Current
License: Permissive (MIT)

Collection of reinforcement learning algorithms

Support

Quality

Security

License

Reuse

evolution-strategies-starterby openai

Python 1505 Version:Current
License: Permissive (MIT)

Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"

Support

Quality

Security

License

Reuse

snakeby chuyangliu

Python 1495 Version:Current
License: Permissive (Apache-2.0)

Artificial intelligence for the Snake game.

Support

Quality

Security

License

Reuse

pymarlby oxwhirl

Python 1471 Version:Current
License: Permissive (Apache-2.0)

Python Multi-Agent Reinforcement Learning framework

Support

Quality

Security

License

Reuse

Python 1458 Version:Current
License: Permissive (MIT)

Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"

Support

Quality

Security

License

Reuse

multi-agent-emergence-environmentsby openai

Python 1454 Version:Current
License: Permissive (MIT)

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Support

Quality

Security

License

Reuse

Rainbowby Kaixhin

Python 1409 Version:Current
License: Permissive (MIT)

Rainbow: Combining Improvements in Deep Reinforcement Learning

Support

Quality

Security

License

Reuse

end-to-end-negotiatorby facebookresearch

Python 1364 Version:Current
License: Proprietary (Proprietary)

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Support

Quality

Security

License

Reuse

TD3by sfujim

Python 1355 Version:Current
License: Permissive (MIT)

Author's PyTorch implementation of TD3 for OpenAI gym tasks

Support

Quality

Security

License

Reuse

Python 1354 Version:Current
License: Proprietary (Proprietary)

[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning

Support

Quality

Security

License

Reuse

Python 1326 Version:Current
License: Permissive (MIT)

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Support

Quality

Security

License

Reuse

leedeeprl-notesby datawhalechina

Python 1323 Version:Current
License: Proprietary (Proprietary)

李宏毅《深度强化学习》笔记，在线阅读地址：https://datawhalechina.github.io/leedeeprl-notes/

Support

Quality

Security

License

Reuse

maddpgby openai

Python 1275 Version:Current
License: Permissive (MIT)

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Support

Quality

Security

License

Reuse

HTML 1272 Version:Current
License: Permissive (MIT)

For deep RL and the future of AI.

Support

Quality

Security

License

Reuse

HTML 1249 Version:Current
License: No License (No License)

Reinforcement Learning Agents in Javascript (Dynamic Programming, Temporal Difference, Deep Q-Learning, Stochastic/Deterministic Policy Gradients)

Support

Quality

Security

License

Reuse

Python 1164 Version:Current
License: Permissive (Apache-2.0)

Minimalistic gridworld package for OpenAI Gym

Support

Quality

Security

License

Reuse

SLM-Labby kengz

Python 1145 Version:Current
License: Permissive (MIT)

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Support

Quality

Security

License

Reuse

habitat-labby facebookresearch

Python 1108 Version:Current
License: Permissive (MIT)

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Support

Quality

Security

License

Reuse

ai-legionby eumemic

TypeScript 1096 Version:Current
License: Permissive (MIT)

An LLM-powered autonomous agent platform

Support

Quality

Security

License

Reuse

RLexampleby ucla-rlcourse

Python 1086 Version:Current
License: No License (No License)

Some basic examples of playing with RL

Support

Quality

Security

License

Reuse

StableDiffusion-CheatSheetby SupaGruen

HTML 1085 Version:Current
License: Permissive (MIT)

A list of StableDiffusion styles and some notes for offline use. Pure HTML, CSS and a bit of JS.

Support

Quality

Security

License

Reuse

pytorch-a3cby ikostrikov

Python 1076 Version:Current
License: Permissive (MIT)

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Support

Quality

Security

License

Reuse

PPO-PyTorchby nikhilbarhate99

Python 1067 Version:Current
License: Permissive (MIT)

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Support

Quality

Security

License

Reuse

Python 1059 Version:Current
License: Permissive (MIT)

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

Support

Quality

Security

License

Reuse

TextWorldby microsoft

Jupyter Notebook 1044 Version:Current
License: Proprietary (Proprietary)

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Support

Quality

Security

License

Reuse

pfrlby pfnet

Python 1037 Version:Current
License: Permissive (MIT)

PFRL: a PyTorch-based deep reinforcement learning library

Support

Quality

Security

License

Reuse

Reinforcement-Learning-Notebooksby Pulkit-Khandelwal

Jupyter Notebook 1037 Version:Current
License: No License (No License)

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Support

Quality

Security

License

Reuse

softlearningby rail-berkeley

Python 1035 Version:Current
License: Proprietary (Proprietary)

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Support

Quality

Security

License

Reuse

rlaxby deepmind

Python 1025 Version:Current
License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

ai-economistby salesforce

Python 1001 Version:Current
License: Permissive (BSD-3-Clause)

Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).

Support

Quality

Security

License

Reuse

Jupyter Notebook 978 Version:Current
License: No License (No License)

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Support

Quality

Security

License

Reuse

basic_reinforcement_learningby vmayoral

Jupyter Notebook 969 Version:Current
License: Strong Copyleft (GPL-3.0)

An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.

Support

Quality

Security

License

Reuse

batch-ppoby google-research

Python 949 Version:Current
License: Permissive (Apache-2.0)

Efficient Batched Reinforcement Learning in TensorFlow

Support

Quality

Security

License

Reuse

FoxDotby Qirky

Python 935 Version:Current
License: Proprietary (Proprietary)

Python driven environment for Live Coding

Support

Quality

Security

License

Reuse

IsaacGymEnvsby NVIDIA-Omniverse

Python 919 Version:Current
License: Proprietary (Proprietary)

Isaac Gym Reinforcement Learning Environments

Support

Quality

Security

License

Reuse

Python 911 Version:Current
License: Permissive (MIT)

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Support

Quality

Security

License

Reuse

ChineseChess-AlphaZeroby NeymarL

Python 909 Version:Current
License: Strong Copyleft (GPL-3.0)

Implement AlphaZero/AlphaGo Zero methods on Chinese chess.

Support

Quality

Security

License

Reuse

rex-gymby nicrusso7

Python 908 Version:Current
License: Permissive (Apache-2.0)

OpenAI Gym environments for an open-source quadruped robot (SpotMicro)

Support

Quality

Security

License

Reuse

ElegantRLby AI4Finance-LLC

Python 906 Version:Current
License: Proprietary (Proprietary)

Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch. 🔥

Support

Quality

Security

License

Reuse

estoolby hardmaru

Jupyter Notebook 893 Version:Current
License: Proprietary (Proprietary)

Evolution Strategies Tool

Support

Quality

Security

License

Reuse

procgenby openai

C++ 890 Version:Current
License: Permissive (MIT)

Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments

Support

Quality

Security

License

Reuse

Super-mario-bros-PPO-pytorchby uvipen

Python 889 Version:Current
License: Permissive (MIT)

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

Support

Quality

Security

License

Reuse

FlappyBirdRLby SarvagyaVaish

JavaScript 877 Version:Current
License: No License (No License)

Flappy Bird hack using Reinforcement Learning

Support

Quality

Security

License

Reuse

gptrpgby dzoba

JavaScript 853 Version:Current
License: No License (No License)

A demo of an GPT-based agent existing in an RPG-like environment

Support

Quality

Security

License

Reuse

PyGame-Learning-Environmentby ntasfi

Python 841 Version:Current
License: Permissive (MIT)

PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.

Support

Quality

Security

License

Reuse

nleby facebookresearch

C 839 Version:Current
License: Proprietary (Proprietary)

The NetHack Learning Environment

Support

Quality

Security

License

Reuse

MetaWorldby Farama-Foundation

Python 822 Version:Current
License: Permissive (MIT)

An open source robotics benchmark for meta- and multi-task reinforcement learning

Support

Quality

Security

License

Reuse

PGPortfolioby ZhengyaoJiang

PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

Python

1588

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutionsby LyWangPX

Solutions of Reinforcement Learning, An Introduction

Jupyter Notebook

1585

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

gym-minigridby Farama-Foundation

Minimalistic gridworld package for OpenAI Gym

Python

1563

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

rlkitby vitchyr

Collection of reinforcement learning algorithms

Python

1560

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

evolution-strategies-starterby openai

Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"

Python

1505

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

snakeby chuyangliu

Artificial intelligence for the Snake game.

Python

1495

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

pymarlby oxwhirl

Python Multi-Agent Reinforcement Learning framework

Python

1471

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

neural-mmoby openai

Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"

Python

1458

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

multi-agent-emergence-environmentsby openai

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Python

1454

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Rainbowby Kaixhin

Rainbow: Combining Improvements in Deep Reinforcement Learning

Python

1409

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

end-to-end-negotiatorby facebookresearch

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Python

1364

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

TD3by sfujim

Author's PyTorch implementation of TD3 for OpenAI gym tasks

Python

1355

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

noreward-rlby pathak22

[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning

Python

1354

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

rl-baselines3-zooby DLR-RM

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python

1326

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

leedeeprl-notesby datawhalechina

李宏毅《深度强化学习》笔记，在线阅读地址：https://datawhalechina.github.io/leedeeprl-notes/

Python

1323

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

maddpgby openai

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Python

1275

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

awesome-deep-rlby tigerneil

For deep RL and the future of AI.

HTML

1272

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

reinforcejsby karpathy

Reinforcement Learning Agents in Javascript (Dynamic Programming, Temporal Difference, Deep Q-Learning, Stochastic/Deterministic Policy Gradients)

HTML

1249

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

gym-minigridby maximecb

Minimalistic gridworld package for OpenAI Gym

Python

1164

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

SLM-Labby kengz

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Python

1145

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

habitat-labby facebookresearch

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Python

1108

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ai-legionby eumemic

An LLM-powered autonomous agent platform

TypeScript

1096

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

RLexampleby ucla-rlcourse

Some basic examples of playing with RL

Python

1086

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

StableDiffusion-CheatSheetby SupaGruen

A list of StableDiffusion styles and some notes for offline use. Pure HTML, CSS and a bit of JS.

HTML

1085

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pytorch-a3cby ikostrikov

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Python

1076

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

PPO-PyTorchby nikhilbarhate99

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Python

1067

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

rl-baselines-zooby araffin

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

Python

1059

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

TextWorldby microsoft

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Jupyter Notebook

1044

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

pfrlby pfnet

PFRL: a PyTorch-based deep reinforcement learning library

Python

1037

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Reinforcement-Learning-Notebooksby Pulkit-Khandelwal

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Jupyter Notebook

1037

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

softlearningby rail-berkeley

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Python

1035

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

rlaxby deepmind

Python

1025

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

ai-economistby salesforce

Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).

Python

1001

Updated: 2 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

DeepRL-Tutorialsby qfettes

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Jupyter Notebook

978

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

basic_reinforcement_learningby vmayoral

An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.

Jupyter Notebook

969

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

batch-ppoby google-research

Efficient Batched Reinforcement Learning in TensorFlow

Python

949

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

FoxDotby Qirky

Python driven environment for Live Coding

Python

935

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

IsaacGymEnvsby NVIDIA-Omniverse

Isaac Gym Reinforcement Learning Environments

Python

919

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

PyTorch-RLby Khrylx

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Python

911

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ChineseChess-AlphaZeroby NeymarL

Implement AlphaZero/AlphaGo Zero methods on Chinese chess.

Python

909

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

rex-gymby nicrusso7

OpenAI Gym environments for an open-source quadruped robot (SpotMicro)

Python

908

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

ElegantRLby AI4Finance-LLC

Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch. 🔥

Python

906

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

estoolby hardmaru

Evolution Strategies Tool

Jupyter Notebook

893

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

procgenby openai

Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments

C++

890

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Super-mario-bros-PPO-pytorchby uvipen

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

Python

889

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

FlappyBirdRLby SarvagyaVaish

Flappy Bird hack using Reinforcement Learning

JavaScript

877

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

gptrpgby dzoba

A demo of an GPT-based agent existing in an RPG-like environment

JavaScript

853

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

PyGame-Learning-Environmentby ntasfi

PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.

Python

841

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

nleby facebookresearch

The NetHack Learning Environment

839

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

MetaWorldby Farama-Foundation

An open source robotics benchmark for meta- and multi-task reinforcement learning

Python

822

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Reinforcement Learning Libraries - Page 2

PGPortfolioby ZhengyaoJiang

Python 1588 Version:Current License: Strong Copyleft (GPL-3.0)

PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutionsby LyWangPX

Jupyter Notebook 1585 Version:Current License: Permissive (MIT)

Solutions of Reinforcement Learning, An Introduction

gym-minigridby Farama-Foundation

Python 1563 Version:Current License: Permissive (Apache-2.0)

Minimalistic gridworld package for OpenAI Gym

rlkitby vitchyr

Python 1560 Version:Current License: Permissive (MIT)

Collection of reinforcement learning algorithms

evolution-strategies-starterby openai

Python 1505 Version:Current License: Permissive (MIT)

Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"

snakeby chuyangliu

Python 1495 Version:Current License: Permissive (Apache-2.0)

Artificial intelligence for the Snake game.

pymarlby oxwhirl

Python 1471 Version:Current License: Permissive (Apache-2.0)

Python Multi-Agent Reinforcement Learning framework

neural-mmoby openai

Python 1458 Version:Current License: Permissive (MIT)

Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"

multi-agent-emergence-environmentsby openai

Python 1454 Version:Current License: Permissive (MIT)

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Rainbowby Kaixhin

Python 1409 Version:Current License: Permissive (MIT)

Rainbow: Combining Improvements in Deep Reinforcement Learning

end-to-end-negotiatorby facebookresearch

Python 1364 Version:Current License: Proprietary (Proprietary)

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

TD3by sfujim

Python 1355 Version:Current License: Permissive (MIT)

Author's PyTorch implementation of TD3 for OpenAI gym tasks

noreward-rlby pathak22

Python 1354 Version:Current License: Proprietary (Proprietary)

[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning

rl-baselines3-zooby DLR-RM

Python 1326 Version:Current License: Permissive (MIT)

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

leedeeprl-notesby datawhalechina

Python 1323 Version:Current License: Proprietary (Proprietary)

李宏毅《深度强化学习》笔记，在线阅读地址：https://datawhalechina.github.io/leedeeprl-notes/

maddpgby openai

Python 1275 Version:Current License: Permissive (MIT)

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

awesome-deep-rlby tigerneil

HTML 1272 Version:Current License: Permissive (MIT)

For deep RL and the future of AI.

reinforcejsby karpathy

HTML 1249 Version:Current License: No License (No License)

Reinforcement Learning Agents in Javascript (Dynamic Programming, Temporal Difference, Deep Q-Learning, Stochastic/Deterministic Policy Gradients)

gym-minigridby maximecb

Python 1164 Version:Current License: Permissive (Apache-2.0)

Minimalistic gridworld package for OpenAI Gym

SLM-Labby kengz

Python 1145 Version:Current License: Permissive (MIT)

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

habitat-labby facebookresearch

Python 1108 Version:Current License: Permissive (MIT)

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

ai-legionby eumemic

TypeScript 1096 Version:Current License: Permissive (MIT)

An LLM-powered autonomous agent platform

RLexampleby ucla-rlcourse

Python 1086 Version:Current License: No License (No License)

Some basic examples of playing with RL

StableDiffusion-CheatSheetby SupaGruen

HTML 1085 Version:Current License: Permissive (MIT)

A list of StableDiffusion styles and some notes for offline use. Pure HTML, CSS and a bit of JS.

pytorch-a3cby ikostrikov

Python 1076 Version:Current License: Permissive (MIT)

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

PPO-PyTorchby nikhilbarhate99

Python 1067 Version:Current License: Permissive (MIT)

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

rl-baselines-zooby araffin

Python 1588 Version:Current
License: Strong Copyleft (GPL-3.0)

Jupyter Notebook 1585 Version:Current
License: Permissive (MIT)

Python 1563 Version:Current
License: Permissive (Apache-2.0)

Python 1560 Version:Current
License: Permissive (MIT)

Python 1505 Version:Current
License: Permissive (MIT)

Python 1495 Version:Current
License: Permissive (Apache-2.0)

Python 1471 Version:Current
License: Permissive (Apache-2.0)

Python 1458 Version:Current
License: Permissive (MIT)

Python 1454 Version:Current
License: Permissive (MIT)

Python 1409 Version:Current
License: Permissive (MIT)

Python 1364 Version:Current
License: Proprietary (Proprietary)

Python 1355 Version:Current
License: Permissive (MIT)

Python 1354 Version:Current
License: Proprietary (Proprietary)

Python 1326 Version:Current
License: Permissive (MIT)

Python 1323 Version:Current
License: Proprietary (Proprietary)

Python 1275 Version:Current
License: Permissive (MIT)

HTML 1272 Version:Current
License: Permissive (MIT)

HTML 1249 Version:Current
License: No License (No License)

Python 1164 Version:Current
License: Permissive (Apache-2.0)

Python 1145 Version:Current
License: Permissive (MIT)

Python 1108 Version:Current
License: Permissive (MIT)

TypeScript 1096 Version:Current
License: Permissive (MIT)

Python 1086 Version:Current
License: No License (No License)

HTML 1085 Version:Current
License: Permissive (MIT)

Python 1076 Version:Current
License: Permissive (MIT)

Python 1067 Version:Current
License: Permissive (MIT)

Python 1059 Version:Current
License: Permissive (MIT)

Jupyter Notebook 1044 Version:Current
License: Proprietary (Proprietary)

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Python 1037 Version:Current
License: Permissive (MIT)

Jupyter Notebook 1037 Version:Current
License: No License (No License)

Python 1035 Version:Current
License: Proprietary (Proprietary)

Python 1025 Version:Current
License: Permissive (Apache-2.0)

Python 1001 Version:Current
License: Permissive (BSD-3-Clause)

Jupyter Notebook 978 Version:Current
License: No License (No License)

Jupyter Notebook 969 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 949 Version:Current
License: Permissive (Apache-2.0)

Python 935 Version:Current
License: Proprietary (Proprietary)

Python 919 Version:Current
License: Proprietary (Proprietary)

Python 911 Version:Current
License: Permissive (MIT)

Python 909 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 908 Version:Current
License: Permissive (Apache-2.0)

Python 906 Version:Current
License: Proprietary (Proprietary)

Jupyter Notebook 893 Version:Current
License: Proprietary (Proprietary)

C++ 890 Version:Current
License: Permissive (MIT)

Python 889 Version:Current
License: Permissive (MIT)

JavaScript 877 Version:Current
License: No License (No License)

JavaScript 853 Version:Current
License: No License (No License)

Python 841 Version:Current
License: Permissive (MIT)

C 839 Version:Current
License: Proprietary (Proprietary)

Python 822 Version:Current
License: Permissive (MIT)