Working Notes: a commonplace notebook for recording & exploring ideas.
Home. Site Map. Subscribe. More at expLog.
Backprop
- Chain rule: g(h(s))' = g'(h(s)) * h'(s)
- Consider the behavior caused by a perturbation
- While backprop
- use computed some of gradients using the weights backwards
— Kunal