Alert button
Picture for Yu Cheng

Yu Cheng

Alert button

InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective

Add code
Bookmark button
Alert button
Oct 05, 2020
Boxin Wang, Shuohang Wang, Yu Cheng, Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu

Figure 1 for InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective
Figure 2 for InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective
Figure 3 for InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective
Figure 4 for InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective
Viaarxiv icon

Efficient Robust Training via Backward Smoothing

Add code
Bookmark button
Alert button
Oct 03, 2020
Jinghui Chen, Yu Cheng, Zhe Gan, Quanquan Gu, Jingjing Liu

Figure 1 for Efficient Robust Training via Backward Smoothing
Figure 2 for Efficient Robust Training via Backward Smoothing
Figure 3 for Efficient Robust Training via Backward Smoothing
Figure 4 for Efficient Robust Training via Backward Smoothing
Viaarxiv icon

Contrastive Distillation on Intermediate Representations for Language Model Compression

Add code
Bookmark button
Alert button
Sep 29, 2020
Siqi Sun, Zhe Gan, Yu Cheng, Yuwei Fang, Shuohang Wang, Jingjing Liu

Figure 1 for Contrastive Distillation on Intermediate Representations for Language Model Compression
Figure 2 for Contrastive Distillation on Intermediate Representations for Language Model Compression
Figure 3 for Contrastive Distillation on Intermediate Representations for Language Model Compression
Figure 4 for Contrastive Distillation on Intermediate Representations for Language Model Compression
Viaarxiv icon

Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding

Add code
Bookmark button
Alert button
Sep 13, 2020
Shuohang Wang, Luowei Zhou, Zhe Gan, Yen-Chun Chen, Yuwei Fang, Siqi Sun, Yu Cheng, Jingjing Liu

Figure 1 for Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding
Figure 2 for Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding
Figure 3 for Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding
Figure 4 for Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding
Viaarxiv icon

Fine-grained Iterative Attention Network for TemporalLanguage Localization in Videos

Add code
Bookmark button
Alert button
Aug 06, 2020
Xiaoye Qu, Pengwei Tang, Zhikang Zhou, Yu Cheng, Jianfeng Dong, Pan Zhou

Figure 1 for Fine-grained Iterative Attention Network for TemporalLanguage Localization in Videos
Figure 2 for Fine-grained Iterative Attention Network for TemporalLanguage Localization in Videos
Figure 3 for Fine-grained Iterative Attention Network for TemporalLanguage Localization in Videos
Figure 4 for Fine-grained Iterative Attention Network for TemporalLanguage Localization in Videos
Viaarxiv icon

Graph Optimal Transport for Cross-Domain Alignment

Add code
Bookmark button
Alert button
Jun 29, 2020
Liqun Chen, Zhe Gan, Yu Cheng, Linjie Li, Lawrence Carin, Jingjing Liu

Figure 1 for Graph Optimal Transport for Cross-Domain Alignment
Figure 2 for Graph Optimal Transport for Cross-Domain Alignment
Figure 3 for Graph Optimal Transport for Cross-Domain Alignment
Figure 4 for Graph Optimal Transport for Cross-Domain Alignment
Viaarxiv icon

Adaptive Learning Rates with Maximum Variation Averaging

Add code
Bookmark button
Alert button
Jun 21, 2020
Chen Zhu, Yu Cheng, Zhe Gan, Furong Huang, Jingjing Liu, Tom Goldstein

Figure 1 for Adaptive Learning Rates with Maximum Variation Averaging
Figure 2 for Adaptive Learning Rates with Maximum Variation Averaging
Figure 3 for Adaptive Learning Rates with Maximum Variation Averaging
Figure 4 for Adaptive Learning Rates with Maximum Variation Averaging
Viaarxiv icon

Large-Scale Adversarial Training for Vision-and-Language Representation Learning

Add code
Bookmark button
Alert button
Jun 11, 2020
Zhe Gan, Yen-Chun Chen, Linjie Li, Chen Zhu, Yu Cheng, Jingjing Liu

Figure 1 for Large-Scale Adversarial Training for Vision-and-Language Representation Learning
Figure 2 for Large-Scale Adversarial Training for Vision-and-Language Representation Learning
Figure 3 for Large-Scale Adversarial Training for Vision-and-Language Representation Learning
Figure 4 for Large-Scale Adversarial Training for Vision-and-Language Representation Learning
Viaarxiv icon

Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

Add code
Bookmark button
Alert button
May 15, 2020
Jize Cao, Zhe Gan, Yu Cheng, Licheng Yu, Yen-Chun Chen, Jingjing Liu

Figure 1 for Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models
Figure 2 for Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models
Figure 3 for Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models
Figure 4 for Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models
Viaarxiv icon

Low-shot Object Detection via Classification Refinement

Add code
Bookmark button
Alert button
May 06, 2020
Yiting Li, Yu Cheng, Lu Liu, Sichao Tian, Haiyue Zhu, Cheng Xiang, Prahlad Vadakkepat, Cheksing Teo, Tongheng Lee

Figure 1 for Low-shot Object Detection via Classification Refinement
Figure 2 for Low-shot Object Detection via Classification Refinement
Figure 3 for Low-shot Object Detection via Classification Refinement
Figure 4 for Low-shot Object Detection via Classification Refinement
Viaarxiv icon