activation functions 2 Why is initialization essential to deep networks? Aug 7, 2024 Activation functions and the vanishing gradient problem Aug 1, 2024