Artificial Intelligence : Reinforcement Learning- A3C (Part 38)
The three A's in A3C
Actor Critic
Now, we can reduce the size of the last one (red) for another output
Now, we have 2 outputs in last
So, the top part have Q values and is called Actor as it has the leading role
and the second one has 1 value an...