Alert button
Picture for Chen Zhu

Chen Zhu

Alert button

Long-Short Transformer: Efficient Transformers for Language and Vision

Add code
Bookmark button
Alert button
Jul 27, 2021
Chen Zhu, Wei Ping, Chaowei Xiao, Mohammad Shoeybi, Tom Goldstein, Anima Anandkumar, Bryan Catanzaro

Figure 1 for Long-Short Transformer: Efficient Transformers for Language and Vision
Figure 2 for Long-Short Transformer: Efficient Transformers for Language and Vision
Figure 3 for Long-Short Transformer: Efficient Transformers for Language and Vision
Figure 4 for Long-Short Transformer: Efficient Transformers for Language and Vision
Viaarxiv icon

A Field Guide to Federated Optimization

Add code
Bookmark button
Alert button
Jul 14, 2021
Jianyu Wang, Zachary Charles, Zheng Xu, Gauri Joshi, H. Brendan McMahan, Blaise Aguera y Arcas, Maruan Al-Shedivat, Galen Andrew, Salman Avestimehr, Katharine Daly, Deepesh Data, Suhas Diggavi, Hubert Eichner, Advait Gadhikar, Zachary Garrett, Antonious M. Girgis, Filip Hanzely, Andrew Hard, Chaoyang He, Samuel Horvath, Zhouyuan Huo, Alex Ingerman, Martin Jaggi, Tara Javidi, Peter Kairouz, Satyen Kale, Sai Praneeth Karimireddy, Jakub Konecny, Sanmi Koyejo, Tian Li, Luyang Liu, Mehryar Mohri, Hang Qi, Sashank J. Reddi, Peter Richtarik, Karan Singhal, Virginia Smith, Mahdi Soltanolkotabi, Weikang Song, Ananda Theertha Suresh, Sebastian U. Stich, Ameet Talwalkar, Hongyi Wang, Blake Woodworth, Shanshan Wu, Felix X. Yu, Honglin Yuan, Manzil Zaheer, Mi Zhang, Tong Zhang, Chunxiang Zheng, Chen Zhu, Wennan Zhu

Figure 1 for A Field Guide to Federated Optimization
Figure 2 for A Field Guide to Federated Optimization
Figure 3 for A Field Guide to Federated Optimization
Figure 4 for A Field Guide to Federated Optimization
Viaarxiv icon

Inductive Predictions of Extreme Hydrologic Events in The Wabash River Watershed

Add code
Bookmark button
Alert button
Apr 25, 2021
Nicholas Majeske, Bidisha Abesh, Chen Zhu, Ariful Azad

Figure 1 for Inductive Predictions of Extreme Hydrologic Events in The Wabash River Watershed
Figure 2 for Inductive Predictions of Extreme Hydrologic Events in The Wabash River Watershed
Figure 3 for Inductive Predictions of Extreme Hydrologic Events in The Wabash River Watershed
Figure 4 for Inductive Predictions of Extreme Hydrologic Events in The Wabash River Watershed
Viaarxiv icon

The Intrinsic Dimension of Images and Its Impact on Learning

Add code
Bookmark button
Alert button
Apr 18, 2021
Phillip Pope, Chen Zhu, Ahmed Abdelkader, Micah Goldblum, Tom Goldstein

Figure 1 for The Intrinsic Dimension of Images and Its Impact on Learning
Figure 2 for The Intrinsic Dimension of Images and Its Impact on Learning
Figure 3 for The Intrinsic Dimension of Images and Its Impact on Learning
Figure 4 for The Intrinsic Dimension of Images and Its Impact on Learning
Viaarxiv icon

GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training

Add code
Bookmark button
Alert button
Feb 16, 2021
Chen Zhu, Renkun Ni, Zheng Xu, Kezhi Kong, W. Ronny Huang, Tom Goldstein

Figure 1 for GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training
Figure 2 for GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training
Figure 3 for GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training
Figure 4 for GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training
Viaarxiv icon

Modifying Memories in Transformer Models

Add code
Bookmark button
Alert button
Dec 01, 2020
Chen Zhu, Ankit Singh Rawat, Manzil Zaheer, Srinadh Bhojanapalli, Daliang Li, Felix Yu, Sanjiv Kumar

Figure 1 for Modifying Memories in Transformer Models
Figure 2 for Modifying Memories in Transformer Models
Figure 3 for Modifying Memories in Transformer Models
Viaarxiv icon

Spherical Knowledge Distillation

Add code
Bookmark button
Alert button
Oct 29, 2020
Jia Guo, Minghao Chen, Yao Hu, Chen Zhu, Xiaofei He, Deng Cai

Figure 1 for Spherical Knowledge Distillation
Figure 2 for Spherical Knowledge Distillation
Figure 3 for Spherical Knowledge Distillation
Figure 4 for Spherical Knowledge Distillation
Viaarxiv icon

Are Adversarial Examples Created Equal? A Learnable Weighted Minimax Risk for Robustness under Non-uniform Attacks

Add code
Bookmark button
Alert button
Oct 24, 2020
Huimin Zeng, Chen Zhu, Tom Goldstein, Furong Huang

Figure 1 for Are Adversarial Examples Created Equal? A Learnable Weighted Minimax Risk for Robustness under Non-uniform Attacks
Figure 2 for Are Adversarial Examples Created Equal? A Learnable Weighted Minimax Risk for Robustness under Non-uniform Attacks
Figure 3 for Are Adversarial Examples Created Equal? A Learnable Weighted Minimax Risk for Robustness under Non-uniform Attacks
Figure 4 for Are Adversarial Examples Created Equal? A Learnable Weighted Minimax Risk for Robustness under Non-uniform Attacks
Viaarxiv icon

FLAG: Adversarial Data Augmentation for Graph Neural Networks

Add code
Bookmark button
Alert button
Oct 19, 2020
Kezhi Kong, Guohao Li, Mucong Ding, Zuxuan Wu, Chen Zhu, Bernard Ghanem, Gavin Taylor, Tom Goldstein

Figure 1 for FLAG: Adversarial Data Augmentation for Graph Neural Networks
Figure 2 for FLAG: Adversarial Data Augmentation for Graph Neural Networks
Figure 3 for FLAG: Adversarial Data Augmentation for Graph Neural Networks
Figure 4 for FLAG: Adversarial Data Augmentation for Graph Neural Networks
Viaarxiv icon