Multi-agent-Q-lambda-learning 联合开发网

Pudn.com > 下载中心 > GPU/显卡 > Multi-agent-Q-lambda-learning

Multi-agent-Q-lambda-learning

所属分类：GPU/显卡
开发工具：CMake
文件大小：7171KB
下载次数：0
上传日期：2023-02-28 05:15:03
上传者：sh-1993

说明： CUDA中GPU编程的异步并行多代理Q-lambda学习
(Asynchronous parallel Multi-agent Q-lambda learning with GPU programming in CUDA)

文件列表:

.DS_Store (6148, 2023-02-28)
CMakeLists.txt (1791, 2023-02-28)
ECE 277 Final Project Slides.pdf (1155686, 2023-02-28)
Extern (0, 2023-02-28)
Extern\glad (0, 2023-02-28)
Extern\glad\include (0, 2023-02-28)
Extern\glad\include\KHR (0, 2023-02-28)
Extern\glad\include\KHR\khrplatform.h (10022, 2023-02-28)
Extern\glad\include\glad (0, 2023-02-28)
Extern\glad\include\glad\glad.h (164734, 2023-02-28)
Extern\glad\src (0, 2023-02-28)
Extern\glad\src\glad.c (82394, 2023-02-28)
Extern\glm (0, 2023-02-28)
Extern\glm\include (0, 2023-02-28)
Extern\glm\include\glm (0, 2023-02-28)
Extern\glm\include\glm\core (0, 2023-02-28)
Extern\glm\include\glm\core\_detail.hpp (9966, 2023-02-28)
Extern\glm\include\glm\core\_fixes.hpp (1859, 2023-02-28)
Extern\glm\include\glm\core\_swizzle.hpp (49712, 2023-02-28)
Extern\glm\include\glm\core\_swizzle_func.hpp (62860, 2023-02-28)
Extern\glm\include\glm\core\_vectorize.hpp (4202, 2023-02-28)
Extern\glm\include\glm\core\dummy.cpp (1701, 2023-02-28)
Extern\glm\include\glm\core\func_common.hpp (19641, 2023-02-28)
Extern\glm\include\glm\core\func_common.inl (26493, 2023-02-28)
Extern\glm\include\glm\core\func_exponential.hpp (6226, 2023-02-28)
Extern\glm\include\glm\core\func_exponential.inl (4189, 2023-02-28)
Extern\glm\include\glm\core\func_geometric.hpp (5896, 2023-02-28)
Extern\glm\include\glm\core\func_geometric.inl (8302, 2023-02-28)
Extern\glm\include\glm\core\func_integer.hpp (9533, 2023-02-28)
Extern\glm\include\glm\core\func_integer.inl (16040, 2023-02-28)
Extern\glm\include\glm\core\func_matrix.hpp (6760, 2023-02-28)
Extern\glm\include\glm\core\func_matrix.inl (16602, 2023-02-28)
Extern\glm\include\glm\core\func_noise.hpp (3984, 2023-02-28)
Extern\glm\include\glm\core\func_noise.inl (13275, 2023-02-28)
Extern\glm\include\glm\core\func_packing.hpp (12383, 2023-02-28)
Extern\glm\include\glm\core\func_packing.inl (6073, 2023-02-28)
Extern\glm\include\glm\core\func_trigonometric.hpp (9760, 2023-02-28)
Extern\glm\include\glm\core\func_trigonometric.inl (5852, 2023-02-28)
... ...

# Asynchronous Multi-agent Q(λ) Learning This repo contains a C++/CUDA implementation of parallel Q(λ) Learning, which is an extension of Q-learning (referred to as Q(0) learning), and is an implementation of Watson's Q(λ) Learning found in Sutton and Barto's [reinforcement learning book](http://www.incompleteideas.net/book/the-book-2nd.html). The implementation however is different due to taking advantage of the GPUs to achieve parallelism for asynchronous learning of the algorithm.

## Environment The environment consists of a 32x32 grid and 128 agents that try to reach one flag. There are also 96 mines that if stepped on by agents will terminate their chances.

The goal is to learn a successful path to the flag for all states on the grid. ## Demo [Q-learning Demo](https://www.youtube.com/watch?v=XxnJLYPveT4&ab_channel=JawnHo).
[Q(λ) Learning demo and presesntation](https://www.youtube.com/watch?v=XxnJLYPveT4&ab_channel=JawnHo).

OpenAI logo

近期下载者：

相关文件：

评论：[我要评论] [举报此文件]

收藏者：