Picture for Pengcheng He

Pengcheng He

Mixing and Shifting: Exploiting Global and Local Dependencies in Vision MLPs

Add code
Feb 14, 2022
Viaarxiv icon

Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention

Add code
Dec 14, 2021
Figure 1 for Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention
Figure 2 for Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention
Figure 3 for Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention
Figure 4 for Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention
Viaarxiv icon

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

Add code
Nov 18, 2021
Figure 1 for DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
Figure 2 for DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
Figure 3 for DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
Figure 4 for DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
Viaarxiv icon

ARCH: Efficient Adversarial Regularized Training with Caching

Add code
Sep 15, 2021
Figure 1 for ARCH: Efficient Adversarial Regularized Training with Caching
Figure 2 for ARCH: Efficient Adversarial Regularized Training with Caching
Figure 3 for ARCH: Efficient Adversarial Regularized Training with Caching
Figure 4 for ARCH: Efficient Adversarial Regularized Training with Caching
Viaarxiv icon

Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization

Add code
Jun 08, 2021
Figure 1 for Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization
Figure 2 for Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization
Figure 3 for Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization
Figure 4 for Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization
Viaarxiv icon

Adversarial Training as Stackelberg Game: An Unrolled Optimization Approach

Add code
Apr 11, 2021
Figure 1 for Adversarial Training as Stackelberg Game: An Unrolled Optimization Approach
Figure 2 for Adversarial Training as Stackelberg Game: An Unrolled Optimization Approach
Figure 3 for Adversarial Training as Stackelberg Game: An Unrolled Optimization Approach
Figure 4 for Adversarial Training as Stackelberg Game: An Unrolled Optimization Approach
Viaarxiv icon

Token-wise Curriculum Learning for Neural Machine Translation

Add code
Mar 20, 2021
Figure 1 for Token-wise Curriculum Learning for Neural Machine Translation
Figure 2 for Token-wise Curriculum Learning for Neural Machine Translation
Figure 3 for Token-wise Curriculum Learning for Neural Machine Translation
Figure 4 for Token-wise Curriculum Learning for Neural Machine Translation
Viaarxiv icon

Greedy Multi-step Off-Policy Reinforcement Learning

Add code
Mar 07, 2021
Figure 1 for Greedy Multi-step Off-Policy Reinforcement Learning
Figure 2 for Greedy Multi-step Off-Policy Reinforcement Learning
Figure 3 for Greedy Multi-step Off-Policy Reinforcement Learning
Figure 4 for Greedy Multi-step Off-Policy Reinforcement Learning
Viaarxiv icon

Reader-Guided Passage Reranking for Open-Domain Question Answering

Add code
Jan 01, 2021
Figure 1 for Reader-Guided Passage Reranking for Open-Domain Question Answering
Figure 2 for Reader-Guided Passage Reranking for Open-Domain Question Answering
Figure 3 for Reader-Guided Passage Reranking for Open-Domain Question Answering
Figure 4 for Reader-Guided Passage Reranking for Open-Domain Question Answering
Viaarxiv icon

UnitedQA: A Hybrid Approach for Open Domain Question Answering

Add code
Jan 01, 2021
Figure 1 for UnitedQA: A Hybrid Approach for Open Domain Question Answering
Figure 2 for UnitedQA: A Hybrid Approach for Open Domain Question Answering
Figure 3 for UnitedQA: A Hybrid Approach for Open Domain Question Answering
Figure 4 for UnitedQA: A Hybrid Approach for Open Domain Question Answering
Viaarxiv icon