Alert button
Picture for Weikang Wang

Weikang Wang

Alert button

Multilingual Sentence Transformer as A Multilingual Word Aligner

Jan 28, 2023
Weikang Wang, Guanhua Chen, Hanqing Wang, Yue Han, Yun Chen

Figure 1 for Multilingual Sentence Transformer as A Multilingual Word Aligner
Figure 2 for Multilingual Sentence Transformer as A Multilingual Word Aligner
Figure 3 for Multilingual Sentence Transformer as A Multilingual Word Aligner
Figure 4 for Multilingual Sentence Transformer as A Multilingual Word Aligner

Multilingual pretrained language models (mPLMs) have shown their effectiveness in multilingual word alignment induction. However, these methods usually start from mBERT or XLM-R. In this paper, we investigate whether multilingual sentence Transformer LaBSE is a strong multilingual word aligner. This idea is non-trivial as LaBSE is trained to learn language-agnostic sentence-level embeddings, while the alignment extraction task requires the more fine-grained word-level embeddings to be language-agnostic. We demonstrate that the vanilla LaBSE outperforms other mPLMs currently used in the alignment task, and then propose to finetune LaBSE on parallel corpus for further improvement. Experiment results on seven language pairs show that our best aligner outperforms previous state-of-the-art models of all varieties. In addition, our aligner supports different language pairs in a single model, and even achieves new state-of-the-art on zero-shot language pairs that does not appear in the finetuning process.

* Published at Findings of EMNLP 2022 
Viaarxiv icon

DS-Net: Dynamic Spatiotemporal Network for Video Salient Object Detection

Dec 09, 2020
Yuting Su, Weikang Wang, Jing Liu, Peiguang Jing, Xiaokang Yang

Figure 1 for DS-Net: Dynamic Spatiotemporal Network for Video Salient Object Detection
Figure 2 for DS-Net: Dynamic Spatiotemporal Network for Video Salient Object Detection
Figure 3 for DS-Net: Dynamic Spatiotemporal Network for Video Salient Object Detection
Figure 4 for DS-Net: Dynamic Spatiotemporal Network for Video Salient Object Detection

As moving objects always draw more attention of human eyes, the temporal motive information is always exploited complementarily with spatial information to detect salient objects in videos. Although efficient tools such as optical flow have been proposed to extract temporal motive information, it often encounters difficulties when used for saliency detection due to the movement of camera or the partial movement of salient objects. In this paper, we investigate the complimentary roles of spatial and temporal information and propose a novel dynamic spatiotemporal network (DS-Net) for more effective fusion of spatiotemporal information. We construct a symmetric two-bypass network to explicitly extract spatial and temporal features. A dynamic weight generator (DWG) is designed to automatically learn the reliability of corresponding saliency branch. And a top-down cross attentive aggregation (CAA) procedure is designed so as to facilitate dynamic complementary aggregation of spatiotemporal features. Finally, the features are modified by spatial attention with the guidance of coarse saliency map and then go through decoder part for final saliency map. Experimental results on five benchmarks VOS, DAVIS, FBMS, SegTrack-v2, and ViSal demonstrate that the proposed method achieves superior performance than state-of-the-art algorithms. The source code is available at https://github.com/TJUMMG/DS-Net.

Viaarxiv icon

Are You for Real? Detecting Identity Fraud via Dialogue Interactions

Aug 19, 2019
Weikang Wang, Jiajun Zhang, Qian Li, Chengqing Zong, Zhifei Li

Figure 1 for Are You for Real? Detecting Identity Fraud via Dialogue Interactions
Figure 2 for Are You for Real? Detecting Identity Fraud via Dialogue Interactions
Figure 3 for Are You for Real? Detecting Identity Fraud via Dialogue Interactions
Figure 4 for Are You for Real? Detecting Identity Fraud via Dialogue Interactions

Identity fraud detection is of great importance in many real-world scenarios such as the financial industry. However, few studies addressed this problem before. In this paper, we focus on identity fraud detection in loan applications and propose to solve this problem with a novel interactive dialogue system which consists of two modules. One is the knowledge graph (KG) constructor organizing the personal information for each loan applicant. The other is structured dialogue management that can dynamically generate a series of questions based on the personal KG to ask the applicants and determine their identity states. We also present a heuristic user simulator based on problem analysis to evaluate our method. Experiments have shown that the trainable dialogue system can effectively detect fraudsters, and achieve higher recognition accuracy compared with rule-based systems. Furthermore, our learned dialogue strategies are interpretable and flexible, which can help promote real-world applications.

* EMNLP-IJCNLP 2019 
Viaarxiv icon

Incremental Learning from Scratch for Task-Oriented Dialogue Systems

Jun 12, 2019
Weikang Wang, Jiajun Zhang, Qian Li, Mei-Yuh Hwang, Chengqing Zong, Zhifei Li

Figure 1 for Incremental Learning from Scratch for Task-Oriented Dialogue Systems
Figure 2 for Incremental Learning from Scratch for Task-Oriented Dialogue Systems
Figure 3 for Incremental Learning from Scratch for Task-Oriented Dialogue Systems
Figure 4 for Incremental Learning from Scratch for Task-Oriented Dialogue Systems

Clarifying user needs is essential for existing task-oriented dialogue systems. However, in real-world applications, developers can never guarantee that all possible user demands are taken into account in the design phase. Consequently, existing systems will break down when encountering unconsidered user needs. To address this problem, we propose a novel incremental learning framework to design task-oriented dialogue systems, or for short Incremental Dialogue System (IDS), without pre-defining the exhaustive list of user needs. Specifically, we introduce an uncertainty estimation module to evaluate the confidence of giving correct responses. If there is high confidence, IDS will provide responses to users. Otherwise, humans will be involved in the dialogue process, and IDS can learn from human intervention through an online learning module. To evaluate our method, we propose a new dataset which simulates unanticipated user needs in the deployment stage. Experiments show that IDS is robust to unconsidered user actions, and can update itself online by smartly selecting only the most effective training data, and hence attains better performance with less annotation cost.

* ACL2019 
Viaarxiv icon