Note here that the sequence
of actions precomputed by the agent. |
|
Subsequent actions are
returned off of the queue, without recomputation. |
|
Only when the queue of
actions is exhausted does the S-P-S-A compute new moves. |
|
To think about: what does
this mean in terms of unexpected results of actions and noisy environments? |