Skip to content

Commit 7c34de1

Browse files
committed
Add pomp-value-iteration markdown.
1 parent d338515 commit 7c34de1

File tree

2 files changed

+21
-0
lines changed

2 files changed

+21
-0
lines changed

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -70,6 +70,7 @@ The file [aima3e-algorithms.pdf](https://github.com/aimacode/pseudocode/blob/mas
7070
|<hr/>|<hr/>|<hr/>|
7171
| &bull; | | [VALUE-ITERATION](md/Value-Iteration.md) |
7272
| &bull; | | |
73+
| &bull; | | [POMDP-VALUE-ITERATION](md/POMDP-Value-Iteration.md) |
7374
|<hr/>|<hr/>|<hr/>|
7475
| &bull; | | |
7576
|<hr/>|<hr/>|<hr/>|

md/POMDP-Value-Iteration.md

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,20 @@
1+
# POMDP-VALUE-ITERATION
2+
3+
## AIMA3e
4+
__function__ POMDP-VALUE-ITERATION(_pomdp_, _&epsi;_) __returns__ a utility function
5+
&emsp;__inputs__: _pomdp_, a POMDP with states _S_, actions _A_(_s_), transition model _P_(_s&prime;_ &vert; _s_, _a_),
6+
&emsp;&emsp;&emsp;&emsp;&emsp;&emsp;sensor model _P_(_e_ &vert; _s_), rewards _R_(_s_), discount _&gamma;_
7+
&emsp;&emsp;&emsp;&emsp;&emsp;_&epsi;_, the maximum error allowed in the utility of any state
8+
&emsp;__local variables__: _U_, _U&prime;_, sets of plans _p_ with associated utility vectors _&alpha;<sub>p</sub>_
9+
10+
&emsp;_U&prime;_ &larr; a set containing just the empty plan \[\], with _&alpha;<sub>\[\]</sub>_(_s_) = _R_(_s_)
11+
&emsp;__repeat__
12+
&emsp;&emsp;&emsp;_U_ &larr; _U&prime;_
13+
&emsp;&emsp;&emsp;_U&prime;_ &larr; the set of all plans consisting of an action and, for each possible next percept,
14+
&emsp;&emsp;&emsp;&emsp;&emsp;a plan in _U_ with utility vectors computed according to Equation(__??__)
15+
&emsp;&emsp;&emsp;_U&prime;_ &larr; REMOVE\-DOMINATED\-PLANS(_U&prime;_)
16+
&emsp;__until__ MAX\-DIFFERENCE(_U_, _U&prime;_) &lt; _&epsi;_(1 &minus; _&gamma;_) &frasl; _&gamma;_
17+
&emsp;__return__ _U_
18+
19+
---
20+
__Figure ??__ A high\-level sketch of the value iteration algorithm for POMDPs. The REMOVE\-DOMINATED\-PLANS step and MAX\-DIFFERENCE test are typically implemented as linear programs.

0 commit comments

Comments
 (0)