2023-7-9 arXiv roundup: LLMs ignore the middle of their context, MoE + instruction tuning rocks
dblalock.substack.com
This newsletter made possible by MosaicML. A mini-announcement Because it’s gotten increasingly difficult to find technical ML content in all the AI noise, I hereby announce a new (experimental) policy: If you write a Twitter thread explaining a paper,
2023-7-9 arXiv roundup: LLMs ignore the middle of their context, MoE + instruction tuning rocks
2023-7-9 arXiv roundup: LLMs ignore the…
2023-7-9 arXiv roundup: LLMs ignore the middle of their context, MoE + instruction tuning rocks
This newsletter made possible by MosaicML. A mini-announcement Because it’s gotten increasingly difficult to find technical ML content in all the AI noise, I hereby announce a new (experimental) policy: If you write a Twitter thread explaining a paper,