2021-8-29 arXiv roundup

May 03, 2022

Understanding and Co-designing the Data Ingestion Pipeline for Industry-Scale RecSys Training

Facebook talks about their system for data preprocessing.

Bag of Tricks for Training Deeper Graph Neural Networks: A Comprehensive Benchmark Study

"...it is difficult to disentangle the advantages brought by a deep GNN architecture from those "tricks" necessary to train such an architecture. Moreover...[there's a] lack of a standardized benchmark with fair and consistent experimental settings...we discover the combo of superior training tricks, that lead us to attain the new state-of-the-art results for deep GCNs"

edge-SR: Super-Resolution For The Masses

People either do super-resolution with simple upsampling algorithms or big neural nets. Why not something in between that's still practical for wimpy devices?

Learning image codecs designed for machine consumption

Split across: https://arxiv.org/abs/2108.09992 and https://arxiv.org/abs/2108.09993. JPEG, etc, assume that data is for humans to look at, not for neural nets to train on. Can optimize for the latter to get slightly better rate-distortion tradeoff.

Davis Summarizes Papers

Discussion about this post