Alert button
Picture for Tian Ding

Tian Ding

Alert button

Why Transformers Need Adam: A Hessian Perspective

Add code
Bookmark button
Alert button
Feb 26, 2024
Yushun Zhang, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun, Zhi-Quan Luo

Viaarxiv icon

Federated Learning with Lossy Distributed Source Coding: Analysis and Optimization

Add code
Bookmark button
Alert button
Apr 23, 2022
Huiyuan Yang, Tian Ding, Xiaojun Yuan

Figure 1 for Federated Learning with Lossy Distributed Source Coding: Analysis and Optimization
Figure 2 for Federated Learning with Lossy Distributed Source Coding: Analysis and Optimization
Figure 3 for Federated Learning with Lossy Distributed Source Coding: Analysis and Optimization
Figure 4 for Federated Learning with Lossy Distributed Source Coding: Analysis and Optimization
Viaarxiv icon

The Global Landscape of Neural Networks: An Overview

Add code
Bookmark button
Alert button
Jul 02, 2020
Ruoyu Sun, Dawei Li, Shiyu Liang, Tian Ding, R Srikant

Figure 1 for The Global Landscape of Neural Networks: An Overview
Figure 2 for The Global Landscape of Neural Networks: An Overview
Figure 3 for The Global Landscape of Neural Networks: An Overview
Figure 4 for The Global Landscape of Neural Networks: An Overview
Viaarxiv icon

Sub-Optimal Local Minima Exist for Almost All Over-parameterized Neural Networks

Add code
Bookmark button
Alert button
Nov 04, 2019
Tian Ding, Dawei Li, Ruoyu Sun

Figure 1 for Sub-Optimal Local Minima Exist for Almost All Over-parameterized Neural Networks
Viaarxiv icon

Over-Parameterized Deep Neural Networks Have No Strict Local Minima For Any Continuous Activations

Add code
Bookmark button
Alert button
Dec 28, 2018
Dawei Li, Tian Ding, Ruoyu Sun

Figure 1 for Over-Parameterized Deep Neural Networks Have No Strict Local Minima For Any Continuous Activations
Viaarxiv icon