2023-7-16 arXiv roundup: Weird step sizes help gradient descent, Better CPU matmuls
dblalock.substack.com
This newsletter made possible by MosaicML. Also a reminder that I’m now retweeting high-quality threads about papers to try to improve the signal-to-noise ratio on ML twitter. If you write a thread explaining a paper, just tag me in it and I’ll share it with ~11k followers.
2023-7-16 arXiv roundup: Weird step sizes help gradient descent, Better CPU matmuls
2023-7-16 arXiv roundup: Weird step sizes…
2023-7-16 arXiv roundup: Weird step sizes help gradient descent, Better CPU matmuls
This newsletter made possible by MosaicML. Also a reminder that I’m now retweeting high-quality threads about papers to try to improve the signal-to-noise ratio on ML twitter. If you write a thread explaining a paper, just tag me in it and I’ll share it with ~11k followers.