Markov decision process (MDP) – Reinforcement Learning decision model
2019-07-06
* Is a discrete time stochastic control process for decision making in situations where outcomes are partly random and partly under the control of a decision maker. * At each discrete time step, the process is in some state s, and the decision maker may choose any action a thatContinue Reading