reinforcejs
所属分类:人工智能/神经网络/深度学习
开发工具:HTML
文件大小:604KB
下载次数:0
上传日期:2019-02-18 20:11:56
上 传 者:
sh-1993
说明: reinforcejs,Javascript中的强化学习代理(动态编程、时间差分、深度Q学习、随机设计...
(Reinforcement Learning Agents in Javascript (Dynamic Programming, Temporal Difference, Deep Q-Learning, Stochastic/Deterministic Policy Gradients))
文件列表:
agentzoo (0, 2015-12-07)
agentzoo\puckagent.json (36359, 2015-12-07)
agentzoo\wateragent.json (446607, 2015-12-07)
external (0, 2015-12-07)
external\.DS_Store (6148, 2015-12-07)
external\d3.min.js (150762, 2015-12-07)
external\highlight.pack.js (10183, 2015-12-07)
external\highlight_default.css (2642, 2015-12-07)
external\images (0, 2015-12-07)
external\images\ui-bg_diagonals-thick_18_b81900_40x40.png (418, 2015-12-07)
external\images\ui-bg_diagonals-thick_20_666666_40x40.png (312, 2015-12-07)
external\images\ui-bg_flat_10_000000_40x100.png (205, 2015-12-07)
external\images\ui-bg_glass_100_f6f6f6_1x400.png (262, 2015-12-07)
external\images\ui-bg_glass_100_fdf5ce_1x400.png (348, 2015-12-07)
external\images\ui-bg_glass_65_ffffff_1x400.png (207, 2015-12-07)
external\images\ui-bg_gloss-wave_35_f6a828_500x100.png (5815, 2015-12-07)
external\images\ui-bg_highlight-soft_100_eeeeee_1x100.png (278, 2015-12-07)
external\images\ui-bg_highlight-soft_75_ffe45c_1x100.png (328, 2015-12-07)
external\images\ui-icons_222222_256x240.png (6922, 2015-12-07)
external\images\ui-icons_228ef1_256x240.png (4549, 2015-12-07)
external\images\ui-icons_ef8c08_256x240.png (4549, 2015-12-07)
external\images\ui-icons_ffd27a_256x240.png (4549, 2015-12-07)
external\images\ui-icons_ffffff_256x240.png (6299, 2015-12-07)
external\jquery-1.11.2.min.js (95931, 2015-12-07)
external\jquery-2.1.3.min.js (84319, 2015-12-07)
external\jquery-ui.min.css (18195, 2015-12-07)
external\jquery-ui.min.js (100680, 2015-12-07)
external\jquery.flot.min.js (52966, 2015-12-07)
external\marked.js (28191, 2015-12-07)
external\mathjax.js (60573, 2015-12-07)
external\underscore-min.js (16523, 2015-12-07)
gridworld_dp.html (26862, 2015-12-07)
gridworld_td.html (34259, 2015-12-07)
img (0, 2015-12-07)
img\dpsolved.jpeg (49769, 2015-12-07)
img\lambda.png (11661, 2015-12-07)
img\policyiter.png (8654, 2015-12-07)
img\qsa.jpeg (44553, 2015-12-07)
... ...
# REINFORCEjs
**REINFORCEjs** is a Reinforcement Learning library that implements several common RL algorithms, all with web demos. In particular, the library currently includes:
- **Dynamic Programming** methods
- (Tabular) **Temporal Difference Learning** (SARSA/Q-Learning)
- **Deep Q-Learning** for Q-Learning with function approximation with Neural Networks
- **Stochastic/Deterministic Policy Gradients** and Actor Critic architectures for dealing with continuous action spaces. (*very alpha, likely buggy or at the very least finicky and inconsistent*)
See the [main webpage](http://cs.stanford.edu/people/karpathy/reinforcejs) for many more details, documentation and demos.
# Code Sketch
The library exports two global variables: `R`, and `RL`. The former contains various kinds of utilities for building expression graphs (e.g. LSTMs) and performing automatic backpropagation, and is a fork of my other project [recurrentjs](https://github.com/karpathy/recurrentjs). The `RL` object contains the current implementations:
- `RL.DPAgent` for finite state/action spaces with environment dynamics
- `RL.TDAgent` for finite state/action spaces
- `RL.DQNAgent` for continuous state features but discrete actions
A typical usage might look something like:
```javascript
// create an environment object
var env = {};
env.getNumStates = function() { return 8; }
env.getMaxNumActions = function() { return 4; }
// create the DQN agent
var spec = { alpha: 0.01 } // see full options on DQN page
agent = new RL.DQNAgent(env, spec);
setInterval(function(){ // start the learning loop
var action = agent.act(s); // s is an array of length 8
//... execute action in environment and get the reward
agent.learn(reward); // the agent improves its Q,policy,model, etc. reward is a float
}, 0);
```
The full documentation and demos are on the [main webpage](http://cs.stanford.edu/people/karpathy/reinforcejs).
# License
MIT.
近期下载者:
相关文件:
收藏者: