Multi-agent-Q-lambda-learning
所属分类:GPU/显卡
开发工具:CMake
文件大小:7171KB
下载次数:0
上传日期:2023-02-28 05:15:03
上 传 者:
sh-1993
说明: CUDA中GPU编程的异步并行多代理Q-lambda学习
(Asynchronous parallel Multi-agent Q-lambda learning with GPU programming in CUDA)
文件列表:
.DS_Store (6148, 2023-02-28)
CMakeLists.txt (1791, 2023-02-28)
ECE 277 Final Project Slides.pdf (1155686, 2023-02-28)
Extern (0, 2023-02-28)
Extern\glad (0, 2023-02-28)
Extern\glad\include (0, 2023-02-28)
Extern\glad\include\KHR (0, 2023-02-28)
Extern\glad\include\KHR\khrplatform.h (10022, 2023-02-28)
Extern\glad\include\glad (0, 2023-02-28)
Extern\glad\include\glad\glad.h (164734, 2023-02-28)
Extern\glad\src (0, 2023-02-28)
Extern\glad\src\glad.c (82394, 2023-02-28)
Extern\glm (0, 2023-02-28)
Extern\glm\include (0, 2023-02-28)
Extern\glm\include\glm (0, 2023-02-28)
Extern\glm\include\glm\core (0, 2023-02-28)
Extern\glm\include\glm\core\_detail.hpp (9966, 2023-02-28)
Extern\glm\include\glm\core\_fixes.hpp (1859, 2023-02-28)
Extern\glm\include\glm\core\_swizzle.hpp (49712, 2023-02-28)
Extern\glm\include\glm\core\_swizzle_func.hpp (62860, 2023-02-28)
Extern\glm\include\glm\core\_vectorize.hpp (4202, 2023-02-28)
Extern\glm\include\glm\core\dummy.cpp (1701, 2023-02-28)
Extern\glm\include\glm\core\func_common.hpp (19641, 2023-02-28)
Extern\glm\include\glm\core\func_common.inl (26493, 2023-02-28)
Extern\glm\include\glm\core\func_exponential.hpp (6226, 2023-02-28)
Extern\glm\include\glm\core\func_exponential.inl (4189, 2023-02-28)
Extern\glm\include\glm\core\func_geometric.hpp (5896, 2023-02-28)
Extern\glm\include\glm\core\func_geometric.inl (8302, 2023-02-28)
Extern\glm\include\glm\core\func_integer.hpp (9533, 2023-02-28)
Extern\glm\include\glm\core\func_integer.inl (16040, 2023-02-28)
Extern\glm\include\glm\core\func_matrix.hpp (6760, 2023-02-28)
Extern\glm\include\glm\core\func_matrix.inl (16602, 2023-02-28)
Extern\glm\include\glm\core\func_noise.hpp (3984, 2023-02-28)
Extern\glm\include\glm\core\func_noise.inl (13275, 2023-02-28)
Extern\glm\include\glm\core\func_packing.hpp (12383, 2023-02-28)
Extern\glm\include\glm\core\func_packing.inl (6073, 2023-02-28)
Extern\glm\include\glm\core\func_trigonometric.hpp (9760, 2023-02-28)
Extern\glm\include\glm\core\func_trigonometric.inl (5852, 2023-02-28)
... ...
# Asynchronous Multi-agent Q(λ) Learning
This repo contains a C++/CUDA implementation of parallel Q(λ) Learning, which is an extension of Q-learning (referred to as Q(0) learning), and is an implementation of Watson's Q(λ) Learning found in Sutton and Barto's [reinforcement learning book](http://www.incompleteideas.net/book/the-book-2nd.html). The implementation however is different due to taking advantage of the GPUs to achieve parallelism for asynchronous learning of the algorithm.
## Environment
The environment consists of a 32x32 grid and 128 agents that try to reach one flag. There are also 96 mines that if stepped on by agents will terminate their chances.
The goal is to learn a successful path to the flag for all states on the grid.
## Demo
[Q-learning Demo](https://www.youtube.com/watch?v=XxnJLYPveT4&ab_channel=JawnHo).
[Q(λ) Learning demo and presesntation](https://www.youtube.com/watch?v=XxnJLYPveT4&ab_channel=JawnHo).
近期下载者:
相关文件:
收藏者: