This newsletter made possible by MosaicML. Thanks to Charlie Blake for the Twitter shoutout this week! Training Language Models with Language Feedback at Scale When talking to other humans, we tend to give feedback via language—not binary labels or numeric ratings. Can we get large text models to learn from this sort of free-form language feedback?
2023-4-2 arXiv roundup: LLMs improving LLM output, BloombergGPT, LLM opinions differ from humans
HI, I am new to your newsletter, and I am very excited for the future.
I really think that you should do a compilation post on good papers that lays the foundation of MLAI these papers are good to start
1. A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT https://arxiv.org/abs/2302.09419
2. Talking About Large Language Models https://arxiv.org/abs/2212.03551
3. Attention Is All You Need https://arxiv.org/abs/1706.03762
4. A Survey of Large Language Models https://arxiv.org/abs/2303.18223