Multi-agent-Q-lambda-learning

所属分类:GPU/显卡
开发工具:CMake
文件大小:7171KB
下载次数:0
上传日期:2023-02-28 05:15:03
上 传 者sh-1993
说明:  CUDA中GPU编程的异步并行多代理Q-lambda学习
(Asynchronous parallel Multi-agent Q-lambda learning with GPU programming in CUDA)

文件列表:
.DS_Store (6148, 2023-02-28)
CMakeLists.txt (1791, 2023-02-28)
ECE 277 Final Project Slides.pdf (1155686, 2023-02-28)
Extern (0, 2023-02-28)
Extern\glad (0, 2023-02-28)
Extern\glad\include (0, 2023-02-28)
Extern\glad\include\KHR (0, 2023-02-28)
Extern\glad\include\KHR\khrplatform.h (10022, 2023-02-28)
Extern\glad\include\glad (0, 2023-02-28)
Extern\glad\include\glad\glad.h (164734, 2023-02-28)
Extern\glad\src (0, 2023-02-28)
Extern\glad\src\glad.c (82394, 2023-02-28)
Extern\glm (0, 2023-02-28)
Extern\glm\include (0, 2023-02-28)
Extern\glm\include\glm (0, 2023-02-28)
Extern\glm\include\glm\core (0, 2023-02-28)
Extern\glm\include\glm\core\_detail.hpp (9966, 2023-02-28)
Extern\glm\include\glm\core\_fixes.hpp (1859, 2023-02-28)
Extern\glm\include\glm\core\_swizzle.hpp (49712, 2023-02-28)
Extern\glm\include\glm\core\_swizzle_func.hpp (62860, 2023-02-28)
Extern\glm\include\glm\core\_vectorize.hpp (4202, 2023-02-28)
Extern\glm\include\glm\core\dummy.cpp (1701, 2023-02-28)
Extern\glm\include\glm\core\func_common.hpp (19641, 2023-02-28)
Extern\glm\include\glm\core\func_common.inl (26493, 2023-02-28)
Extern\glm\include\glm\core\func_exponential.hpp (6226, 2023-02-28)
Extern\glm\include\glm\core\func_exponential.inl (4189, 2023-02-28)
Extern\glm\include\glm\core\func_geometric.hpp (5896, 2023-02-28)
Extern\glm\include\glm\core\func_geometric.inl (8302, 2023-02-28)
Extern\glm\include\glm\core\func_integer.hpp (9533, 2023-02-28)
Extern\glm\include\glm\core\func_integer.inl (16040, 2023-02-28)
Extern\glm\include\glm\core\func_matrix.hpp (6760, 2023-02-28)
Extern\glm\include\glm\core\func_matrix.inl (16602, 2023-02-28)
Extern\glm\include\glm\core\func_noise.hpp (3984, 2023-02-28)
Extern\glm\include\glm\core\func_noise.inl (13275, 2023-02-28)
Extern\glm\include\glm\core\func_packing.hpp (12383, 2023-02-28)
Extern\glm\include\glm\core\func_packing.inl (6073, 2023-02-28)
Extern\glm\include\glm\core\func_trigonometric.hpp (9760, 2023-02-28)
Extern\glm\include\glm\core\func_trigonometric.inl (5852, 2023-02-28)
... ...

# Asynchronous Multi-agent Q(λ) Learning This repo contains a C++/CUDA implementation of parallel Q(λ) Learning, which is an extension of Q-learning (referred to as Q(0) learning), and is an implementation of Watson's Q(λ) Learning found in Sutton and Barto's [reinforcement learning book](http://www.incompleteideas.net/book/the-book-2nd.html). The implementation however is different due to taking advantage of the GPUs to achieve parallelism for asynchronous learning of the algorithm.

## Environment The environment consists of a 32x32 grid and 128 agents that try to reach one flag. There are also 96 mines that if stepped on by agents will terminate their chances.

The goal is to learn a successful path to the flag for all states on the grid. ## Demo [Q-learning Demo](https://www.youtube.com/watch?v=XxnJLYPveT4&ab_channel=JawnHo).
[Q(λ) Learning demo and presesntation](https://www.youtube.com/watch?v=XxnJLYPveT4&ab_channel=JawnHo).

OpenAI logo
OpenAI logo


近期下载者

相关文件


收藏者