irlc.ex02.dp_model.DPModel
- class irlc.ex02.dp_model.DPModel(N)[source]
- Code for the base DP model class. The lecture notes contains further discussino about the distinction of models/environments/agents which I recommend reading first. Briefly, the role of the DP model is to define (in details) the basic N-step decision problem corresponding to the equations: - x_{k+1} = f_k(x_k, u_k, w_k) cost = g_k(x_k, u_k, w_k). terminal cost = g_N(x_N) - The above corresponds to the first 3 methods below. The rest of the functions specify the available states and actions (see definition of the basic problem in the lecture notes). - Methods - A(x, k)- Pw(x, u, k)- At step k, given x_k, u_k, compute the set of random noise disturbances w and their probabilities as a dict {..., w_i: pw_i, ...} such that - S(k)- __init__(N)- f(x, u, w, k)- g(x, u, w, k)- gN(x)- w_rnd(x, u, k)- generate random disturbances w ~ P_k(x, u) (useful for simulation)