Archive - Davis Summarizes Papers

2024-4-28 arXiv roundup: data and scaling, backlog highlights part 3

Besides getting to cover unusually interesting work, the upside of having a big backlog is that you can group your coverage thematically. This week’s…

Apr 29 •

Research Archetypes: Scientists, Mathematicians, Inventors

What "research" entails day-to-day varies by field, subfield, problem, and individual researcher. However, there are clusters of project types and…

Apr 21 •

2024-4-14 arXiv roundup: backlog highlights part 2

Bunch of interesting stuff this week. Before we jump in, one quick clarification from last week: I mentioned how it was an interesting marketing lesson…

Apr 15 •

2024-4-7 arXiv roundup: DBRX, Backlog highlights part 1

It’s good to be back.

Apr 8 •

December 2023

2023-11-26 arXiv roundup: Big potential wins, 1 bit per parameter, Simplifying transformers

Stella Nera: Achieving 161 TOp/s/W with Multiplier-free DNN Acceleration based on Approximate Matrix Multiplication

Dec 1, 2023 •

November 2023

2023-11-19 arXiv roundup: Inverse-free inverse Hessians, Faster LLMs, Closed-form diffusion

A fundamental result in queueing theory is that, if items enter the queue faster than they’re processed, the length of the queue tends to infinity. Just…

Nov 19, 2023 •

October 2023

2023-10-16 arXiv roundup: Cornucopia of easy (claimed) wins for LLMs

Also, I was on the AI Stories podcast! In case anyone assumed I was incredibly handsome, this is the perfect chance to disillusion yourself.

Oct 17, 2023 •

2023-9 arXiv roundup: A bunch of good ML systems and Empirical science papers

Got behind the curve again and ended up taking me more than a week to catch up. Y’all need to not write so many papers…

Oct 6, 2023 •

September 2023

2023-8 arXiv roundup: Look I gave a talk, SILO-ing language models, lots of MoE + tool use papers

(I’m still planning on doing weekly installments in general—I just got behind and it took a while to catch up).

Sep 1, 2023 •

August 2023

2023-7-30 arXiv roundup: Better image captions, Scaling EMA, Chain of thought empiricism

Heads up that we’ve hit the late summer slump in arXiv submissions, so there’s less content than usual this week.

Aug 1, 2023 •

July 2023

2023-7-23 arXiv roundup: OpenAI breaking changes, Much better attention and image captions

This newsletter made possible by MosaicML.

Jul 25, 2023 •

2023-7-16 arXiv roundup: Weird step sizes help gradient descent, Better CPU matmuls

This newsletter made possible by MosaicML.

Jul 20, 2023 •

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts