Reinforcement-learning-with-tensorflow-master 联合开发网

Pudn.com > 下载中心 > 其他 > Reinforcement-learning-with-tensorflow-master

Reinforcement-learning-with-tensorflow-master

所属分类：其他
开发工具：Python
文件大小：396KB
下载次数：0
上传日期：2021-03-08 17:52:56
上传者：周冬雨的小号

说明：包含强化学习的各类算法，以及对应的例子，是非常不错的入门算法。
(It includes all kinds of algorithms of reinforcement learning and corresponding examples, which is a very good entry algorithm.)

文件列表:

Reinforcement-learning-with-tensorflow (0, 2020-08-05)
Reinforcement-learning-with-tensorflow\LICENCE (1055, 2020-08-05)
Reinforcement-learning-with-tensorflow\RL_cover.jpg (69700, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents (0, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\10_A3C (0, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\10_A3C\A3C_RNN.py (9429, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\10_A3C\A3C_continuous_action.py (8158, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\10_A3C\A3C_discrete_action.py (7894, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\10_A3C\A3C_distributed_tf.py (9199, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\11_Dyna_Q (0, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\11_Dyna_Q\RL_brain.py (2915, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\11_Dyna_Q\maze_env.py (3898, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\11_Dyna_Q\run_this.py (1500, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\12_Proximal_Policy_Optimization (0, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\12_Proximal_Policy_Optimization\DPPO.py (8270, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\12_Proximal_Policy_Optimization\discrete_DPPO.py (8817, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\12_Proximal_Policy_Optimization\simply_PPO.py (6467, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\1_command_line_reinforcement_learning (0, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\1_command_line_reinforcement_learning\treasure_on_right.py (3444, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\2_Q_Learning_maze (0, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\2_Q_Learning_maze\RL_brain.py (1851, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\2_Q_Learning_maze\maze_env.py (4307, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\2_Q_Learning_maze\run_this.py (1388, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\3_Sarsa_maze (0, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\3_Sarsa_maze\RL_brain.py (2711, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\3_Sarsa_maze\maze_env.py (4012, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\3_Sarsa_maze\run_this.py (1510, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\4_Sarsa_lambda_maze (0, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\4_Sarsa_lambda_maze\RL_brain.py (3177, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\4_Sarsa_lambda_maze\maze_env.py (4013, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\4_Sarsa_lambda_maze\run_this.py (1602, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\5.1_Double_DQN (0, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\5.1_Double_DQN\RL_brain.py (6678, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\5.1_Double_DQN\run_Pendulum.py (2133, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\5.2_Prioritized_Replay_DQN (0, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\5.2_Prioritized_Replay_DQN\RL_brain.py (11823, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\5.2_Prioritized_Replay_DQN\run_MountainCar.py (2113, 2020-08-05)
Reinforcement-learning-with-tensorflow\contents\5.3_Dueling_DQN (0, 2020-08-05)
... ...

# Reinforcement Learning Methods and Tutorials In these tutorials for reinforcement learning, it covers from the basic RL algorithms to advanced algorithms developed recent years. **If you speak Chinese, visit [莫烦 Python](https://mofanpy.com) or my [Youtube channel](https://www.youtube.com/channel/UCdyjiB5H8Pu7aDTNVXTTpcg) for more.** **As many requests about making these tutorials available in English, please find them in this playlist:** ([https://www.youtube.com/playlist?list=PLXO45tsB95cIplu-fLMpUEEZTwrDNh6Ba](https://www.youtube.com/playlist?list=PLXO45tsB95cIplu-fLMpUEEZTwrDNh6Ba)) # Table of Contents * Tutorials * [Simple entry example](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/1_command_line_reinforcement_learning) * [Q-learning](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/2_Q_Learning_maze) * [Sarsa](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/3_Sarsa_maze) * [Sarsa(lambda)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/4_Sarsa_lambda_maze) * [Deep Q Network (DQN)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5_Deep_Q_Network) * [Using OpenAI Gym](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/6_OpenAI_gym) * [Double DQN](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.1_Double_DQN) * [DQN with Prioitized Experience Replay](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.2_Prioritized_Replay_DQN) * [Dueling DQN](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.3_Dueling_DQN) * [Policy Gradients](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/7_Policy_gradient_softmax) * [Actor-Critic](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/8_Actor_Critic_Advantage) * [Deep Deterministic Policy Gradient (DDPG)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/9_Deep_Deterministic_Policy_Gradient_DDPG) * [A3C](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/10_A3C) * [Dyna-Q](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/11_Dyna_Q) * [Proximal Policy Optimization (PPO)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/12_Proximal_Policy_Optimization) * [Curiosity Model](/contents/Curiosity_Model), [Random Network Distillation (RND)](/contents/Curiosity_Model/Random_Network_Distillation.py) * [Some of my experiments](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments) * [2D Car](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments/2D_car) * [Robot arm](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments/Robot_arm) * [BipedalWalker](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments/Solve_BipedalWalker) * [LunarLander](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments/Solve_LunarLander) # Some RL Networks ### [Deep Q Network](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5_Deep_Q_Network)

### [Double DQN](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.1_Double_DQN)

### [Dueling DQN](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.3_Dueling_DQN)

### [Actor Critic](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/8_Actor_Critic_Advantage)

### [Deep Deterministic Policy Gradient](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/9_Deep_Deterministic_Policy_Gradient_DDPG)

### [A3C](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/10_A3C)

### [Proximal Policy Optimization (PPO)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/12_Proximal_Policy_Optimization)

### [Curiosity Model](/contents/Curiosity_Model)

# Donation *If this does help you, please consider donating to support me for better tutorials. Any contribution is greatly appreciated!*

近期下载者：

相关文件：

评论：[我要评论] [举报此文件]

收藏者：