Back to archive

Thread

2 tweets

1
Spent an embarrasingly long time this afternoon putting together an ipython notebook for sgd. Reminds me to take the "simple" stuff serious.
2
Also, bare stochastic gradient descent is not trivial to get to perform well. There's a reason why we have so many variants of this.