Taming the Deep: Exploding Gradients, He Initialization, and Inverted Dropout
In our previous post, we unlocked the black box. We successfully built a 2-layer Neural Network entirely from scratch using raw NumPy matrix calculus.
However, if you take that exact same code, expand
rishiii2.hashnode.dev7 min read