Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kunbo Ding

A Simple and Effective Method to Improve Zero-Shot Cross-Lingual Transfer Learning

Oct 18, 2022

Kunbo Ding, Weijie Liu, Yuejian Fang, Weiquan Mao, Zhe Zhao, Tao Zhu, Haoyan Liu, Rong Tian, Yiren Chen

Figure 1 for A Simple and Effective Method to Improve Zero-Shot Cross-Lingual Transfer Learning

Figure 2 for A Simple and Effective Method to Improve Zero-Shot Cross-Lingual Transfer Learning

Figure 3 for A Simple and Effective Method to Improve Zero-Shot Cross-Lingual Transfer Learning

Figure 4 for A Simple and Effective Method to Improve Zero-Shot Cross-Lingual Transfer Learning

Abstract:Existing zero-shot cross-lingual transfer methods rely on parallel corpora or bilingual dictionaries, which are expensive and impractical for low-resource languages. To disengage from these dependencies, researchers have explored training multilingual models on English-only resources and transferring them to low-resource languages. However, its effect is limited by the gap between embedding clusters of different languages. To address this issue, we propose Embedding-Push, Attention-Pull, and Robust targets to transfer English embeddings to virtual multilingual embeddings without semantic loss, thereby improving cross-lingual transferability. Experimental results on mBERT and XLM-R demonstrate that our method significantly outperforms previous works on the zero-shot cross-lingual text classification task and can obtain a better multilingual alignment.

* Published at COLING2022

Via

Access Paper or Ask Questions

Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching

Sep 13, 2022

Kunbo Ding, Weijie Liu, Yuejian Fang, Zhe Zhao, Qi Ju, Xuefeng Yang

Figure 1 for Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching

Figure 2 for Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching

Figure 3 for Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching

Figure 4 for Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching

Abstract:Previous studies have proved that cross-lingual knowledge distillation can significantly improve the performance of pre-trained models for cross-lingual similarity matching tasks. However, the student model needs to be large in this operation. Otherwise, its performance will drop sharply, thus making it impractical to be deployed to memory-limited devices. To address this issue, we delve into cross-lingual knowledge distillation and propose a multi-stage distillation framework for constructing a small-size but high-performance cross-lingual model. In our framework, contrastive learning, bottleneck, and parameter recurrent strategies are combined to prevent performance from being compromised during the compression process. The experimental results demonstrate that our method can compress the size of XLM-R and MiniLM by more than 50\%, while the performance is only reduced by about 1%.

* Published at Findings of NAACL, 2022

Via

Access Paper or Ask Questions