1- Is it possible to avoid local minima by combining a crude form of simulated annealing with backprop?<BR><BR>specifically, make the activation or weights stochastic, and gradually reduce the ...
Many new data scientists have voiced what they feel is the lack of a satisfying way to learn the concepts of back propagation/gradient computation in neural networks when taking undergrad level ML ...