Nesterov Momentum: A variant of momentum that uses the gradient of the future position instead of the current position to compute the update.
Nesterov Momentum: A variant of momentum that uses the gradient of the future position instead of the current position to compute the update.