Davis Summarizes Papers

Davis Summarizes Papers

Home
AI Analysis
Archive
About
2024-8-4 arXiv roundup: LLama 3.1, training a 100T biological neural net
In case you’re wondering what I’ve been up to instead of posting for the past couple months, I was kicking off a training run for a 100T parameter…
Aug 5, 2024 • 
Davis Blalock
36

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2024-8-4 arXiv roundup: LLama 3.1, training a 100T biological neural net
2
2024-8-25: Scaling curves for All of the Things
Good news: we got a bunch of important findings this week.
Aug 26, 2024 • 
Davis Blalock
31

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2024-8-25: Scaling curves for All of the Things
8
2024-4-7 arXiv roundup: DBRX, Backlog highlights part 1
It’s good to be back.
Apr 8, 2024 • 
Davis Blalock
20

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2024-4-7 arXiv roundup: DBRX, Backlog highlights part 1
7
2023-11-19 arXiv roundup: Inverse-free inverse Hessians, Faster LLMs, Closed-form diffusion
A fundamental result in queueing theory is that, if items enter the queue faster than they’re processed, the length of the queue tends to infinity.
Nov 19, 2023 • 
Davis Blalock
25

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2023-11-19 arXiv roundup: Inverse-free inverse Hessians, Faster LLMs, Closed-form diffusion
2023-9 arXiv roundup: A bunch of good ML systems and Empirical science papers
Got behind the curve again and ended up taking me more than a week to catch up.
Oct 6, 2023 • 
Davis Blalock
27

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2023-9 arXiv roundup: A bunch of good ML systems and Empirical science papers
2024-4-28 arXiv roundup: data and scaling, backlog highlights part 3
Besides getting to cover unusually interesting work, the upside of having a big backlog is that you can group your coverage thematically.
Apr 29, 2024 • 
Davis Blalock
24

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2024-4-28 arXiv roundup: data and scaling, backlog highlights part 3
1
2024-4-14 arXiv roundup: backlog highlights part 2
Bunch of interesting stuff this week. Before we jump in, one quick clarification from last week: I mentioned how it was an interesting marketing lesson…
Apr 15, 2024 • 
Davis Blalock
16

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2024-4-14 arXiv roundup: backlog highlights part 2
1
2023-11-26 arXiv roundup: Big potential wins, 1 bit per parameter, Simplifying transformers
Stella Nera: Achieving 161 TOp/s/W with Multiplier-free DNN Acceleration based on Approximate Matrix Multiplication
Dec 1, 2023 • 
Davis Blalock
26

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2023-11-26 arXiv roundup: Big potential wins, 1 bit per parameter, Simplifying transformers
2023-7-23 arXiv roundup: OpenAI breaking changes, Much better attention and image captions
This newsletter made possible by MosaicML.
Jul 25, 2023 • 
Davis Blalock
18

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2023-7-23 arXiv roundup: OpenAI breaking changes, Much better attention and image captions
7
2023-10-16 arXiv roundup: Cornucopia of easy (claimed) wins for LLMs
Also, I was on the AI Stories podcast!
Oct 17, 2023 • 
Davis Blalock
20

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2023-10-16 arXiv roundup: Cornucopia of easy (claimed) wins for LLMs
4
2023-8 arXiv roundup: Look I gave a talk, SILO-ing language models, lots of MoE + tool use papers
(I’m still planning on doing weekly installments in general—I just got behind and it took a while to catch up).
Sep 1, 2023 • 
Davis Blalock
20

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2023-8 arXiv roundup: Look I gave a talk, SILO-ing language models, lots of MoE + tool use papers
2
2023-7-9 arXiv roundup: LLMs ignore the middle of their context, MoE + instruction tuning rocks
This newsletter made possible by MosaicML.
Jul 11, 2023 • 
Davis Blalock
20

Share this post

Davis Summarizes Papers
Davis Summarizes Papers
2023-7-9 arXiv roundup: LLMs ignore the middle of their context, MoE + instruction tuning rocks
© 2025 Davis Blalock
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share