This newsletter made possible by MosaicML. An Impartial Take to the CNN vs Transformer Robustness Contest A robustness throwdown featuring {ViT, Swin} vs {BiT, ConvNeXt} evaluated on various tasks. First, they measure the learning of spurious features using datasets designed to assess simplicity bias, background bias, and texture bias. The transformers and the CNNs behave similarly.
2022-7-31 arXiv roundup: Transformer vs CNN showdown, 1000x smaller DLRM, ELECTRA improvements
2022-7-31 arXiv roundup: Transformer vs CNN…
2022-7-31 arXiv roundup: Transformer vs CNN showdown, 1000x smaller DLRM, ELECTRA improvements
This newsletter made possible by MosaicML. An Impartial Take to the CNN vs Transformer Robustness Contest A robustness throwdown featuring {ViT, Swin} vs {BiT, ConvNeXt} evaluated on various tasks. First, they measure the learning of spurious features using datasets designed to assess simplicity bias, background bias, and texture bias. The transformers and the CNNs behave similarly.