In the Input File Format Used by the Tool, `r_{1,t,p} r_{2,t,p}` are listed in the order of (t, p) as (1, 1), ..., (1, M-1), (2, 1), ..., (2, M-1), ... .
(1, 1), ..., (1, M-1), (2, 1), ..., (2, M-1), ... の順です。
Q. The tool description states that “$r_{1,t,p}$ is used to determine whether to take a random action or a greedy action, and $r_{2,t,p}$ is used to choose the move destination.” How exactly are the choice between a random or greedy action and the destination determined?
A. Please refer to the implementation of the distributed tool for the specific decision method.
具体的な決定方法については、配布ツールの実装を参照してください