Rock-Paper-Scissors-master 联合开发网

Pudn.com > 下载中心 > 模式识别(视觉/语音等) > Rock-Paper-Scissors-master

Rock-Paper-Scissors-master

所属分类：模式识别(视觉/语音等)
开发工具：WINDOWS
文件大小：135KB
下载次数：0
上传日期：2021-03-22 10:20:07
上传者：toulouse

说明： rock paper scissors algorithm

文件列表:

a.png (55566, 2019-06-19)
b.png (55775, 2019-06-19)
regret_matching.ipynb (166935, 2019-06-19)

## Regret Matching for Rock-Paper-Scissors Regret Matching algorithm for computing Rock-Paper-Scissors Nash Equilibrium ### Algorithm **Regret Matching algorithm** $\hat{H}$ will maintain the **vector of weights assigned to experts** $p$ . After loss vector is revealed we can compute cumulative regret with respect to an expert $i$ at time $t$ (it expresses how we regret not listening particular expert $i$ ):

$R_{i, t}=L^{t}_{\hat{H}}-L^{t}_{i}$

Having that, experts weights are updated with the formula:

$w_{i,t} = (R_{i,t})_+ = \max(0, R_{i,t})$

and finally components of our vector $p^t$ (a probability distribution vector over $N$ experts) are given by:

$p_i^t = \begin{cases} w_{i,t}/\sum_{j \in N}{w_{j,t}} & {\rm if} \sum_{j \in N}{w_{j,t}} > 0 \\ 1/N & {\rm otherwise} \end{cases}$

You can find a more comprehensive explanation of this algorithm [here](https://int8.io/counterfactual-regret-minimization-for-poker-ai/). ### Uasge Check `regret_matching.ipynb` for more infomation. ### Result Both players find the Nash equilibrium of the game and play rock/paper/scissors with roughly the same probability.

strategy of player a	strategy of player b

近期下载者：

相关文件：

评论：[我要评论] [举报此文件]

收藏者：