* We use empirical tools of mode connectivity and SVCCA to investigate
neural network training heuristics of learning rate restarts, warmup and
knowledge distillation. arXiv admin note: text overlap with arXiv:1806.06977 Access Paper or Ask Questions