Supplementary Note to the Gridworld Exmaple in Reinforcement Learning: An Introduction
The author of Reinforcement Learning: An Introduction gives an example of a state-value function using a grid (Example 3.5). Readers like me may be confused about how the values of each cell are calculated.
The author provided codes to compute the state value. The codes are written in LISP.