Revisiting Natural Gradient for Deep Networks | Synapse