Reinforcement Learning

Tensorflow Quantizaiton

On: October 8, 2019

In: AI, Computer Vision, Deep Machine Learning, NLP, Reinforcement Learning

During inference, precision in floats is not needed and can be reduced to using 8 bits instead of 32 bits this allows to bin continuous values to discrete ranges, and therefore is known as Quantization. This enables us to increase the bandwith sent (or increase the memory footprint in caseContinue Reading

Tensorflow Graph Optimizing

On: October 6, 2019

In: AI, Computer Vision, Deep Machine Learning, NLP, Reinforcement Learning

After training we can optimize a frozen graph or even a dynamic graph by removes training-specific and debug-specific nodes, fusing common operations, and removes code that isn’t used/reached. Code Example from tensorflow.python.tools import optimize_for_inference_lib inputGraph = tf.GraphDef() #read in a frozen model with tf.gfile.Open(‘frozentensorflowModel.pb’, “rb”) as f: data2read = f.read() inputGraph.ParseFromString(data2read) outputGraph = optimize_for_inference_lib.optimize_for_inference(inputGraph, [“inputTensor”], Continue Reading

Tensorflow – Graph Freezing

On: October 2, 2019

In: AI, Computer Vision, Deep Machine Learning, NLP, Reinforcement Learning

In prediction/inference mode, variable types are unnecessary, so by freezing the graph we convert all variables in a graph and checkpoint into constants. Also there are structure nodes and operations related to training that we don’t need and can be removed by the freezing process as well. The freezing processContinue Reading

Hidden Markov Model (HMM)

On: September 8, 2019

In: AI, Algorithms, Autonomous Vehicles, Cars, Computer Vision, Drones, NLP, Reinforcement Learning

Hidden Markov Model (HMM) is a statistical Markov model in which the system being modeled is assumed to be a Markov process with unobservable (i.e. hidden) states. In simpler Markov models (like a Markov chain), the state is directly visible to the observer, and therefore the state transition probabilities areContinue Reading

Hidden states and Observations in Hidden Markov Model

On: September 8, 2019

In: AI, Algorithms, Autonomous Vehicles, Cars, Computer Vision, Drones, Math, NLP, Physics, Reinforcement Learning, Science

Hidden states are the unknowns we try to detect or predict. The Hidden states have a relationship amongst themselves called the transition probabilities. Observations are the evidence variables that we have a priori. Observations and states have a relationship between them called the emission probabilities.Continue Reading

Markov Chain

On: September 8, 2019

In: AI, Algorithms, Autonomous Vehicles, Cars, Computer Vision, Drones, NLP, Reinforcement Learning

A Markov chain is a stochastic model describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event. (“memorylessness”)Continue Reading

Deep Q-learning(DQN) – Experience Replay

On: July 19, 2019

In: AI, Deep Machine Learning, Reinforcement Learning

To perform experience replay we store the agent’s experiences et=(st,at,rt,st+1) Then we use a random sample of these prior actions instead of the most recent action to proceed. * This removes correlations in the observation sequence and smooths changes in the data distribution. * Iterative update adjusts Q towards targetContinue Reading

Q-learning – Model-free reinforcement learning algorithm

On: July 6, 2019

In: AI, Algorithms, Deep Machine Learning, Reinforcement Learning

Learns a policy which tells an agent what action to take under what circumstances. Q-learning learns a policy that is optimal in the sense that it maximizes the expected value of the total reward over any and all successive steps, starting from the current state.Continue Reading

Markov decision process (MDP) – Reinforcement Learning decision model

On: July 6, 2019

In: AI, Deep Machine Learning, Reinforcement Learning

* Is a discrete time stochastic control process for decision making in situations where outcomes are partly random and partly under the control of a decision maker. * At each discrete time step, the process is in some state s, and the decision maker may choose any action a thatContinue Reading

Variation AutoEncoders probability distribution in hidden space

On: July 5, 2019

In: AI, Computer Vision, Deep Machine Learning, NLP, Reinforcement Learning

A variational autoencoder provides a probability distribution for describing an observation/attribute in latent/hidden space.Continue Reading

Search

Reinforcement Learning

Tensorflow Quantizaiton

Tensorflow Graph Optimizing

Tensorflow – Graph Freezing

Hidden Markov Model (HMM)

Hidden states and Observations in Hidden Markov Model

Markov Chain

Deep Q-learning(DQN) – Experience Replay

Q-learning – Model-free reinforcement learning algorithm

Markov decision process (MDP) – Reinforcement Learning decision model

Variation AutoEncoders probability distribution in hidden space

יאיר שנער

Yair Shinar