Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chufan Wu

Mitigating Knowledge Conflicts in Language Model-Driven Question Answering

Nov 18, 2024

Han Cao, Zhaoyang Zhang, Xiangtian Li, Chufan Wu, Hansong Zhang, Wenqing Zhang

Abstract:Knowledge-aware sequence to sequence generation tasks such as document question answering and abstract summarization typically requires two types of knowledge: encoded parametric knowledge and retrieved contextual information. Previous work show improper correlation between parametric knowledge and answers in the training set could cause the model ignore input information at test time, resulting in un-desirable model behaviour such as over-stability and hallucination. In this work, we argue that hallucination could be mitigated via explicit correlation between input source and generated content. We focus on a typical example of hallucination, entity-based knowledge conflicts in question answering, where correlation of entities and their description at training time hinders model behaviour during inference.

Via

Access Paper or Ask Questions

Reinforced Axial Refinement Network for Monocular 3D Object Detection

Aug 31, 2020

Lijie Liu, Chufan Wu, Jiwen Lu, Lingxi Xie, Jie Zhou, Qi Tian

Figure 1 for Reinforced Axial Refinement Network for Monocular 3D Object Detection

Figure 2 for Reinforced Axial Refinement Network for Monocular 3D Object Detection

Figure 3 for Reinforced Axial Refinement Network for Monocular 3D Object Detection

Figure 4 for Reinforced Axial Refinement Network for Monocular 3D Object Detection

Abstract:Monocular 3D object detection aims to extract the 3D position and properties of objects from a 2D input image. This is an ill-posed problem with a major difficulty lying in the information loss by depth-agnostic cameras. Conventional approaches sample 3D bounding boxes from the space and infer the relationship between the target object and each of them, however, the probability of effective samples is relatively small in the 3D space. To improve the efficiency of sampling, we propose to start with an initial prediction and refine it gradually towards the ground truth, with only one 3d parameter changed in each step. This requires designing a policy which gets a reward after several steps, and thus we adopt reinforcement learning to optimize it. The proposed framework, Reinforced Axial Refinement Network (RAR-Net), serves as a post-processing stage which can be freely integrated into existing monocular 3D detection methods, and improve the performance on the KITTI dataset with small extra computational costs.

* Accepted by ECCV 2020

Via

Access Paper or Ask Questions

LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression

Apr 08, 2020

Yihuan Mao, Yujing Wang, Chufan Wu, Chen Zhang, Yang Wang, Yaming Yang, Quanlu Zhang, Yunhai Tong, Jing Bai

Figure 1 for LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression

Figure 2 for LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression

Figure 3 for LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression

Figure 4 for LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression

Abstract:BERT is a cutting-edge language representation model pre-trained by a large corpus, which achieves superior performances on various natural language understanding tasks. However, a major blocking issue of applying BERT to online services is that it is memory-intensive and leads to unsatisfactory latency of user requests, raising the necessity of model compression. Existing solutions leverage the knowledge distillation framework to learn a smaller model that imitates the behaviors of BERT. However, the training procedure of knowledge distillation is expensive itself as it requires sufficient training data to imitate the teacher model. In this paper, we address this issue by proposing a hybrid solution named LadaBERT (Lightweight adaptation of BERT through hybrid model compression), which combines the advantages of different model compression methods, including weight pruning, matrix factorization and knowledge distillation. LadaBERT achieves state-of-the-art accuracy on various public datasets while the training overheads can be reduced by an order of magnitude.

Via

Access Paper or Ask Questions