Davis Summarizes Papers
Subscribe
Sign in
Home
AI Analysis
Archive
About
Latest
Top
Discussions
2024-8-25: Scaling curves for All of the Things
Good news: we got a bunch of important findings this week.
Aug 26
•
Davis Blalock
24
Share this post
2024-8-25: Scaling curves for All of the Things
dblalock.substack.com
Copy link
Facebook
Email
Note
Other
8
2024-8-4 arXiv roundup: LLama 3.1, training a 100T biological neural net
In case you’re wondering what I’ve been up to instead of posting for the past couple months, I was kicking off a training run for a 100T parameter…
Aug 5
•
Davis Blalock
33
Share this post
2024-8-4 arXiv roundup: LLama 3.1, training a 100T biological neural net
dblalock.substack.com
Copy link
Facebook
Email
Note
Other
2
April 2024
2024-4-28 arXiv roundup: data and scaling, backlog highlights part 3
Besides getting to cover unusually interesting work, the upside of having a big backlog is that you can group your coverage thematically.
Apr 29
•
Davis Blalock
22
Share this post
2024-4-28 arXiv roundup: data and scaling, backlog highlights part 3
dblalock.substack.com
Copy link
Facebook
Email
Note
Other
1
Research Archetypes: Scientists, Mathematicians, Inventors
What "research" entails day-to-day varies by field, subfield, problem, and individual researcher.
Apr 21
•
Davis Blalock
5
Share this post
Research Archetypes: Scientists, Mathematicians, Inventors
dblalock.substack.com
Copy link
Facebook
Email
Note
Other
2024-4-14 arXiv roundup: backlog highlights part 2
Bunch of interesting stuff this week. Before we jump in, one quick clarification from last week: I mentioned how it was an interesting marketing lesson…
Apr 15
•
Davis Blalock
15
Share this post
2024-4-14 arXiv roundup: backlog highlights part 2
dblalock.substack.com
Copy link
Facebook
Email
Note
Other
1
2024-4-7 arXiv roundup: DBRX, Backlog highlights part 1
It’s good to be back.
Apr 8
•
Davis Blalock
19
Share this post
2024-4-7 arXiv roundup: DBRX, Backlog highlights part 1
dblalock.substack.com
Copy link
Facebook
Email
Note
Other
7
December 2023
2023-11-26 arXiv roundup: Big potential wins, 1 bit per parameter, Simplifying transformers
Stella Nera: Achieving 161 TOp/s/W with Multiplier-free DNN Acceleration based on Approximate Matrix Multiplication
Dec 1, 2023
•
Davis Blalock
26
Share this post
2023-11-26 arXiv roundup: Big potential wins, 1 bit per parameter, Simplifying transformers
dblalock.substack.com
Copy link
Facebook
Email
Note
Other
November 2023
2023-11-19 arXiv roundup: Inverse-free inverse Hessians, Faster LLMs, Closed-form diffusion
A fundamental result in queueing theory is that, if items enter the queue faster than they’re processed, the length of the queue tends to infinity.
Nov 19, 2023
•
Davis Blalock
24
Share this post
2023-11-19 arXiv roundup: Inverse-free inverse Hessians, Faster LLMs, Closed-form diffusion
dblalock.substack.com
Copy link
Facebook
Email
Note
Other
October 2023
2023-10-16 arXiv roundup: Cornucopia of easy (claimed) wins for LLMs
Also, I was on the AI Stories podcast!
Oct 17, 2023
•
Davis Blalock
20
Share this post
2023-10-16 arXiv roundup: Cornucopia of easy (claimed) wins for LLMs
dblalock.substack.com
Copy link
Facebook
Email
Note
Other
4
2023-9 arXiv roundup: A bunch of good ML systems and Empirical science papers
Got behind the curve again and ended up taking me more than a week to catch up.
Oct 6, 2023
•
Davis Blalock
27
Share this post
2023-9 arXiv roundup: A bunch of good ML systems and Empirical science papers
dblalock.substack.com
Copy link
Facebook
Email
Note
Other
September 2023
2023-8 arXiv roundup: Look I gave a talk, SILO-ing language models, lots of MoE + tool use papers
(I’m still planning on doing weekly installments in general—I just got behind and it took a while to catch up).
Sep 1, 2023
•
Davis Blalock
20
Share this post
2023-8 arXiv roundup: Look I gave a talk, SILO-ing language models, lots of MoE + tool use papers
dblalock.substack.com
Copy link
Facebook
Email
Note
Other
2
August 2023
2023-7-30 arXiv roundup: Better image captions, Scaling EMA, Chain of thought empiricism
Heads up that we’ve hit the late summer slump in arXiv submissions, so there’s less content than usual this week.
Aug 1, 2023
•
Davis Blalock
16
Share this post
2023-7-30 arXiv roundup: Better image captions, Scaling EMA, Chain of thought empiricism
dblalock.substack.com
Copy link
Facebook
Email
Note
Other
1
Share
Copy link
Facebook
Email
Note
Other
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts