Q-learning – Model-free reinforcement learning algorithm
2019-07-06
Learns a policy which tells an agent what action to take under what circumstances. Q-learning learns a policy that is optimal in the sense that it maximizes the expected value of the total reward over any and all successive steps, starting from the current state.Continue Reading