irlc.ex02.dp.DP_stochastic
- irlc.ex02.dp.DP_stochastic(model)[source]
Implement the stochastic DP algorithm. The implementation follows (Her21, Algorithm 1). In case you run into problems, I recommend following the hints in (Her21, Subsection 6.2.1) and focus on the case without a noise term; once it works, you can add the w-terms. When you don’t loop over noise terms, just specify them as w = None in env.f and env.g.