No More Adam: Learning Rate Scaling at Initialization Is All You NeedComments on Hacker News | Source