2021-8-29 arXiv roundup
"Understanding and Co-designing the Data Ingestion Pipeline for Industry-Scale RecSys Training" (https://arxiv.org/abs/2108.09373) Facebook talks about their system for data preprocessing.
"Bag of Tricks for Training Deeper Graph Neural Networks: A Comprehensive Benchmark Study" (https://arxiv.org/abs/2108.10521) "...it is difficult to disentangle the advantages brought by a deep GNN architecture from those "tricks" necessary to train such an architecture. Moreover...[there's a] lack of a standardized benchmark with fair and consistent experimental settings...we discover the combo of superior training tricks, that lead us to attain the new state-of-the-art results for deep GCNs"
"edge-SR: Super-Resolution For The Masses" (https://arxiv.org/abs/2108.10335) People either do super-resolution with simple upsampling algorithms or big neural nets. Why not something in between that's still practical for wimpy devices?
Learning image codecs designed for machine consumption (https://arxiv.org/abs/2108.09992, https://arxiv.org/abs/2108.09993). JPEG, etc, assume that data is for humans to look at, not for neural nets to train on. Can optimize for the latter to get slightly better rate-distortion tradefoff.