Picture for Bao Wang

Bao Wang

Adaptive and Implicit Regularization for Matrix Completion

Add code
Aug 11, 2022
Figure 1 for Adaptive and Implicit Regularization for Matrix Completion
Figure 2 for Adaptive and Implicit Regularization for Matrix Completion
Figure 3 for Adaptive and Implicit Regularization for Matrix Completion
Figure 4 for Adaptive and Implicit Regularization for Matrix Completion
Viaarxiv icon

Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization

Add code
Aug 01, 2022
Figure 1 for Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization
Figure 2 for Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization
Figure 3 for Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization
Figure 4 for Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization
Viaarxiv icon

Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs

Add code
Apr 19, 2022
Figure 1 for Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs
Figure 2 for Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs
Figure 3 for Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs
Figure 4 for Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs
Viaarxiv icon

Learning POD of Complex Dynamics Using Heavy-ball Neural ODEs

Add code
Feb 24, 2022
Figure 1 for Learning POD of Complex Dynamics Using Heavy-ball Neural ODEs
Figure 2 for Learning POD of Complex Dynamics Using Heavy-ball Neural ODEs
Figure 3 for Learning POD of Complex Dynamics Using Heavy-ball Neural ODEs
Figure 4 for Learning POD of Complex Dynamics Using Heavy-ball Neural ODEs
Viaarxiv icon

glassoformer: a query-sparse transformer for post-fault power grid voltage prediction

Add code
Jan 22, 2022
Figure 1 for glassoformer: a query-sparse transformer for post-fault power grid voltage prediction
Figure 2 for glassoformer: a query-sparse transformer for post-fault power grid voltage prediction
Figure 3 for glassoformer: a query-sparse transformer for post-fault power grid voltage prediction
Figure 4 for glassoformer: a query-sparse transformer for post-fault power grid voltage prediction
Viaarxiv icon

Efficient and Reliable Overlay Networks for Decentralized Federated Learning

Add code
Dec 12, 2021
Figure 1 for Efficient and Reliable Overlay Networks for Decentralized Federated Learning
Figure 2 for Efficient and Reliable Overlay Networks for Decentralized Federated Learning
Figure 3 for Efficient and Reliable Overlay Networks for Decentralized Federated Learning
Figure 4 for Efficient and Reliable Overlay Networks for Decentralized Federated Learning
Viaarxiv icon

How Does Momentum Benefit Deep Neural Networks Architecture Design? A Few Case Studies

Add code
Oct 19, 2021
Figure 1 for How Does Momentum Benefit Deep Neural Networks Architecture Design? A Few Case Studies
Figure 2 for How Does Momentum Benefit Deep Neural Networks Architecture Design? A Few Case Studies
Figure 3 for How Does Momentum Benefit Deep Neural Networks Architecture Design? A Few Case Studies
Figure 4 for How Does Momentum Benefit Deep Neural Networks Architecture Design? A Few Case Studies
Viaarxiv icon

Training Deep Neural Networks with Adaptive Momentum Inspired by the Quadratic Optimization

Add code
Oct 18, 2021
Figure 1 for Training Deep Neural Networks with Adaptive Momentum Inspired by the Quadratic Optimization
Figure 2 for Training Deep Neural Networks with Adaptive Momentum Inspired by the Quadratic Optimization
Figure 3 for Training Deep Neural Networks with Adaptive Momentum Inspired by the Quadratic Optimization
Figure 4 for Training Deep Neural Networks with Adaptive Momentum Inspired by the Quadratic Optimization
Viaarxiv icon

Heavy Ball Neural Ordinary Differential Equations

Add code
Oct 10, 2021
Figure 1 for Heavy Ball Neural Ordinary Differential Equations
Figure 2 for Heavy Ball Neural Ordinary Differential Equations
Figure 3 for Heavy Ball Neural Ordinary Differential Equations
Figure 4 for Heavy Ball Neural Ordinary Differential Equations
Viaarxiv icon

FMMformer: Efficient and Flexible Transformer via Decomposed Near-field and Far-field Attention

Add code
Aug 05, 2021
Figure 1 for FMMformer: Efficient and Flexible Transformer via Decomposed Near-field and Far-field Attention
Figure 2 for FMMformer: Efficient and Flexible Transformer via Decomposed Near-field and Far-field Attention
Figure 3 for FMMformer: Efficient and Flexible Transformer via Decomposed Near-field and Far-field Attention
Figure 4 for FMMformer: Efficient and Flexible Transformer via Decomposed Near-field and Far-field Attention
Viaarxiv icon