Planning under uncertainty¶

Introduction¶

MDP = Markov Decision Processes POMDP = Partially Observable Markov Decision Processes

Quiz: Branching Factor Question

For this problem (and only this problem) assume actions are stochastic in a way that is different than described in 4. MDP Gridworld.

Instead of an action north possibly going east or west, an action north will possibly go northeast or northwest (i.e. to the diagonal squares).

Likewise for the other directions e.g. an action west will possibly go west, northwest or southwest (i.e. to the diagonals).

Stochastic actions are as in 4. MDP Grid World.

An action North moves North with 80% chance otherwise East with 10% chance or West with 10% chance. Likewise for the other directions.

Stochastic actions are as in 4. MDP Grid World.

An action North moves North with 80% chance otherwise East with 10% chance or West with 10% chance. Likewise for the other directions.

Stochastic actions are as in 4. MDP Grid World.

An action North moves North with 80% chance otherwise East with 10% chance or West with 10% chance. Likewise for the other directions.

>>> 77 * 0.8 + (0.1 * -100) - 3
48.6

Further Study

Charles Isbell and Michael Littmann’s ML course

Peter Norvig and Sebastian Thrun’s AI course: