 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
l |
Fully
observable (vs. partially
observable): An agent's
|
|
|
sensors give it
access to the complete state of the
|
|
|
environment at
each point in time.
|
|
|
l |
Deterministic (vs. stochastic): The next state of the
|
|
|
environment is
completely determined by the current
|
|
|
state and the
action executed by the agent. (If the
|
|
|
environment is
deterministic except for the actions of
|
|
|
other agents,
then the environment is strategic)
|
|
|
l |
Episodic
(vs. sequential): The agent's
experience is
|
|
|
divided into
atomic "episodes" (each episode consists of
|
|
the agent
perceiving and then performing a single
|
|
|
action), and the
choice of action in each episode
|
|
|
depends only on
the episode itself.
|
|