Davis Summarizes Papers

Davis Summarizes Papers

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2021-12-4: Sparsity is Enough in Scaling Transformers, Sparse ImageNet transfer

2021-12-4: Sparsity is Enough in Scaling…

Davis Blalock
May 3, 2022

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2021-12-4: Sparsity is Enough in Scaling Transformers, Sparse ImageNet transfer

Adaptive Optimization with Examplewise Gradients

Read →
Comments
User's avatar
© 2025 Davis Blalock
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share