2023-1-15 arXiv roundup: Way faster BERT…

Davis Blalock

Jan 16, 2023

This newsletter made possible by MosaicML.

Read →

2 Comments

Jan Brauner

Jan 16, 2023

Hi David,

If you liked the parametrisation in the Tracr, you'll LOVE Anthropic's "A Mathematical Framework for Transformer Circuits". This is where I've first seen this parametrisation, and it also contains some other nuggets for describing transformers in more intuitive and useful terms.

https://transformer-circuits.pub/2021/framework/index.html

Best,

Jan

Expand full comment

Chintan Shah

Jan 16, 2023

Hi David,

Thanks for your newsletter. I had one piece of feedback, is there a way to add a table of contents at the top in substack? Line or breaks betweenpapers are very hard to see in substacks at the moment.

Thanks

Expand full comment

Davis Summarizes Papers

2023-1-15 arXiv roundup: Way faster BERT…