Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Akari Asai

One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval

Jul 26, 2021

Akari Asai, Xinyan Yu, Jungo Kasai, Hannaneh Hajishirzi

Figure 1 for One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval

Figure 2 for One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval

Figure 3 for One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval

Figure 4 for One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval

Abstract:We present CORA, a Cross-lingual Open-Retrieval Answer Generation model that can answer questions across many languages even when language-specific annotated data or knowledge sources are unavailable. We introduce a new dense passage retrieval algorithm that is trained to retrieve documents across languages for a question. Combined with a multilingual autoregressive generation model, CORA answers directly in the target language without any translation or in-language retrieval modules as used in prior work. We propose an iterative training method that automatically extends annotated data available only in high-resource languages to low-resource ones. Our results show that CORA substantially outperforms the previous state of the art on multilingual open question answering benchmarks across 26 languages, 9 of which are unseen during training. Our analyses show the significance of cross-lingual retrieval and generation in many languages, particularly under low-resource settings.

* Our code and trained model are publicly available at https://github.com/AkariAsai/CORA

Via

Access Paper or Ask Questions

Efficient Passage Retrieval with Hashing for Open-domain Question Answering

Jun 02, 2021

Ikuya Yamada, Akari Asai, Hannaneh Hajishirzi

Figure 1 for Efficient Passage Retrieval with Hashing for Open-domain Question Answering

Figure 2 for Efficient Passage Retrieval with Hashing for Open-domain Question Answering

Figure 3 for Efficient Passage Retrieval with Hashing for Open-domain Question Answering

Figure 4 for Efficient Passage Retrieval with Hashing for Open-domain Question Answering

Abstract:Most state-of-the-art open-domain question answering systems use a neural retrieval model to encode passages into continuous vectors and extract them from a knowledge source. However, such retrieval models often require large memory to run because of the massive size of their passage index. In this paper, we introduce Binary Passage Retriever (BPR), a memory-efficient neural retrieval model that integrates a learning-to-hash technique into the state-of-the-art Dense Passage Retriever (DPR) to represent the passage index using compact binary codes rather than continuous vectors. BPR is trained with a multi-task objective over two tasks: efficient candidate generation based on binary codes and accurate reranking based on continuous vectors. Compared with DPR, BPR substantially reduces the memory cost from 65GB to 2GB without a loss of accuracy on two standard open-domain question answering benchmarks: Natural Questions and TriviaQA. Our code and trained models are available at https://github.com/studio-ousia/bpr.

* ACL 2021

Via

Access Paper or Ask Questions

MultiModalQA: Complex Question Answering over Text, Tables and Images

Apr 13, 2021

Alon Talmor, Ori Yoran, Amnon Catav, Dan Lahav, Yizhong Wang, Akari Asai, Gabriel Ilharco, Hannaneh Hajishirzi, Jonathan Berant

Figure 1 for MultiModalQA: Complex Question Answering over Text, Tables and Images

Figure 2 for MultiModalQA: Complex Question Answering over Text, Tables and Images

Figure 3 for MultiModalQA: Complex Question Answering over Text, Tables and Images

Figure 4 for MultiModalQA: Complex Question Answering over Text, Tables and Images

Abstract:When answering complex questions, people can seamlessly combine information from visual, textual and tabular sources. While interest in models that reason over multiple pieces of evidence has surged in recent years, there has been relatively little work on question answering models that reason across multiple modalities. In this paper, we present MultiModalQA(MMQA): a challenging question answering dataset that requires joint reasoning over text, tables and images. We create MMQA using a new framework for generating complex multi-modal questions at scale, harvesting tables from Wikipedia, and attaching images and text paragraphs using entities that appear in each table. We then define a formal language that allows us to take questions that can be answered from a single modality, and combine them to generate cross-modal questions. Last, crowdsourcing workers take these automatically-generated questions and rephrase them into more fluent language. We create 29,918 questions through this procedure, and empirically demonstrate the necessity of a multi-modal multi-hop approach to solve our task: our multi-hop model, ImplicitDecomp, achieves an average F1of 51.7 over cross-modal questions, substantially outperforming a strong baseline that achieves 38.2 F1, but still lags significantly behind human performance, which is at 90.1 F1

* ICLR 2021

Via

Access Paper or Ask Questions

The Aleatoric Uncertainty Estimation Using a Separate Formulation with Virtual Residuals

Nov 03, 2020

Takumi Kawashima, Qing Yu, Akari Asai, Daiki Ikami, Kiyoharu Aizawa

Figure 1 for The Aleatoric Uncertainty Estimation Using a Separate Formulation with Virtual Residuals

Figure 2 for The Aleatoric Uncertainty Estimation Using a Separate Formulation with Virtual Residuals

Figure 3 for The Aleatoric Uncertainty Estimation Using a Separate Formulation with Virtual Residuals

Figure 4 for The Aleatoric Uncertainty Estimation Using a Separate Formulation with Virtual Residuals

Abstract:We propose a new optimization framework for aleatoric uncertainty estimation in regression problems. Existing methods can quantify the error in the target estimation, but they tend to underestimate it. To obtain the predictive uncertainty inherent in an observation, we propose a new separable formulation for the estimation of a signal and of its uncertainty, avoiding the effect of overfitting. By decoupling target estimation and uncertainty estimation, we also control the balance between signal estimation and uncertainty estimation. We conduct three types of experiments: regression with simulation data, age estimation, and depth estimation. We demonstrate that the proposed method outperforms a state-of-the-art technique for signal and uncertainty estimation.

Via

Access Paper or Ask Questions

XOR QA: Cross-lingual Open-Retrieval Question Answering

Oct 24, 2020

Akari Asai, Jungo Kasai, Jonathan H. Clark, Kenton Lee, Eunsol Choi, Hannaneh Hajishirzi

Figure 1 for XOR QA: Cross-lingual Open-Retrieval Question Answering

Figure 2 for XOR QA: Cross-lingual Open-Retrieval Question Answering

Figure 3 for XOR QA: Cross-lingual Open-Retrieval Question Answering

Figure 4 for XOR QA: Cross-lingual Open-Retrieval Question Answering

Abstract:Multilingual question answering tasks typically assume answers exist in the same language as the question. Yet in practice, many languages face both information scarcity---where languages have few reference articles---and information asymmetry---where questions reference concepts from other cultures. This work extends open-retrieval question answering to a cross-lingual setting enabling questions from one language to be answered via answer content from another language. We construct a large-scale dataset built on questions from TyDi QA lacking same-language answers. Our task formulation, called Cross-lingual Open Retrieval Question Answering (XOR QA), includes 40k information-seeking questions from across 7 diverse non-English languages. Based on this dataset, we introduce three new tasks that involve cross-lingual document retrieval using multi-lingual and English resources. We establish baselines with state-of-the-art machine translation systems and cross-lingual pretrained models. Experimental results suggest that XOR QA is a challenging task that will facilitate the development of novel techniques for multilingual question answering. Our data and code are available at https://nlp.cs.washington.edu/xorqa.

* Our data and code are available at https://nlp.cs.washington.edu/xorqa

Via

Access Paper or Ask Questions

Challenges in Information Seeking QA:Unanswerable Questions and Paragraph Retrieval

Oct 22, 2020

Akari Asai, Eunsol Choi

Figure 1 for Challenges in Information Seeking QA:Unanswerable Questions and Paragraph Retrieval

Figure 2 for Challenges in Information Seeking QA:Unanswerable Questions and Paragraph Retrieval

Figure 3 for Challenges in Information Seeking QA:Unanswerable Questions and Paragraph Retrieval

Figure 4 for Challenges in Information Seeking QA:Unanswerable Questions and Paragraph Retrieval

Abstract:Recent progress in pretrained language model "solved" many reading comprehension benchmark datasets. Yet information-seeking Question Answering (QA) datasets, where questions are written without the evidence document, remain unsolved. We analyze two such datasets (Natural Questions and TyDi QA) to identify remaining headrooms: paragraph selection and answerability classification, i.e. determining whether the paired evidence document contains the answer to the query or not. In other words, given a gold paragraph and knowing whether it contains an answer or not, models easily outperform a single annotator in both datasets. After identifying unanswerability as a bottleneck, we further inspect what makes questions unanswerable. Our study points to avenues for future research, both for dataset creation and model development.

Via

Access Paper or Ask Questions

LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

Oct 02, 2020

Ikuya Yamada, Akari Asai, Hiroyuki Shindo, Hideaki Takeda, Yuji Matsumoto

Figure 1 for LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

Figure 2 for LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

Figure 3 for LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

Figure 4 for LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

Abstract:Entity representations are useful in natural language tasks involving entities. In this paper, we propose new pretrained contextualized representations of words and entities based on the bidirectional transformer. The proposed model treats words and entities in a given text as independent tokens, and outputs contextualized representations of them. Our model is trained using a new pretraining task based on the masked language model of BERT. The task involves predicting randomly masked words and entities in a large entity-annotated corpus retrieved from Wikipedia. We also propose an entity-aware self-attention mechanism that is an extension of the self-attention mechanism of the transformer, and considers the types of tokens (words or entities) when computing attention scores. The proposed model achieves impressive empirical performance on a wide range of entity-related tasks. In particular, it obtains state-of-the-art results on five well-known datasets: Open Entity (entity typing), TACRED (relation classification), CoNLL-2003 (named entity recognition), ReCoRD (cloze-style question answering), and SQuAD 1.1 (extractive question answering). Our source code and pretrained representations are available at https://github.com/studio-ousia/luke.

* EMNLP 2020

Via

Access Paper or Ask Questions

Logic-Guided Data Augmentation and Regularization for Consistent Question Answering

May 25, 2020

Akari Asai, Hannaneh Hajishirzi

Figure 1 for Logic-Guided Data Augmentation and Regularization for Consistent Question Answering

Figure 2 for Logic-Guided Data Augmentation and Regularization for Consistent Question Answering

Figure 3 for Logic-Guided Data Augmentation and Regularization for Consistent Question Answering

Figure 4 for Logic-Guided Data Augmentation and Regularization for Consistent Question Answering

Abstract:Many natural language questions require qualitative, quantitative or logical comparisons between two entities or events. This paper addresses the problem of improving the accuracy and consistency of responses to comparison questions by integrating logic rules and neural models. Our method leverages logical and linguistic knowledge to augment labeled training data and then uses a consistency-based regularizer to train the model. Improving the global consistency of predictions, our approach achieves large improvements over previous methods in a variety of question answering (QA) tasks including multiple-choice qualitative reasoning, cause-effect reasoning, and extractive machine reading comprehension. In particular, our method significantly improves the performance of RoBERTa-based models by 1-5% across datasets. We advance the state of the art by around 5-8% on WIQA and QuaRel and reduce consistency violations by 58% on HotpotQA. We further demonstrate that our approach can learn effectively from limited data.

* Published as a conference paper at ACL 2020

Via

Access Paper or Ask Questions

Inferential Text Generation with Multiple Knowledge Sources and Meta-Learning

Apr 15, 2020

Daya Guo, Akari Asai, Duyu Tang, Nan Duan, Ming Gong, Linjun Shou, Daxin Jiang, Jian Yin, Ming Zhou

Figure 1 for Inferential Text Generation with Multiple Knowledge Sources and Meta-Learning

Figure 2 for Inferential Text Generation with Multiple Knowledge Sources and Meta-Learning

Figure 3 for Inferential Text Generation with Multiple Knowledge Sources and Meta-Learning

Figure 4 for Inferential Text Generation with Multiple Knowledge Sources and Meta-Learning

Abstract:We study the problem of generating inferential texts of events for a variety of commonsense like \textit{if-else} relations. Existing approaches typically use limited evidence from training examples and learn for each relation individually. In this work, we use multiple knowledge sources as fuels for the model. Existing commonsense knowledge bases like ConceptNet are dominated by taxonomic knowledge (e.g., \textit{isA} and \textit{relatedTo} relations), having a limited number of inferential knowledge. We use not only structured commonsense knowledge bases, but also natural language snippets from search-engine results. These sources are incorporated into a generative base model via key-value memory network. In addition, we introduce a meta-learning based multi-task learning algorithm. For each targeted commonsense relation, we regard the learning of examples from other relations as the meta-training process, and the evaluation on examples from the targeted relation as the meta-test process. We conduct experiments on Event2Mind and ATOMIC datasets. Results show that both the integration of multiple knowledge sources and the use of the meta-learning algorithm improve the performance.

Via

Access Paper or Ask Questions

Adv-BERT: BERT is not robust on misspellings! Generating nature adversarial samples on BERT

Feb 27, 2020

Lichao Sun, Kazuma Hashimoto, Wenpeng Yin, Akari Asai, Jia Li, Philip Yu, Caiming Xiong

Figure 1 for Adv-BERT: BERT is not robust on misspellings! Generating nature adversarial samples on BERT

Figure 2 for Adv-BERT: BERT is not robust on misspellings! Generating nature adversarial samples on BERT

Figure 3 for Adv-BERT: BERT is not robust on misspellings! Generating nature adversarial samples on BERT

Figure 4 for Adv-BERT: BERT is not robust on misspellings! Generating nature adversarial samples on BERT

Abstract:There is an increasing amount of literature that claims the brittleness of deep neural networks in dealing with adversarial examples that are created maliciously. It is unclear, however, how the models will perform in realistic scenarios where \textit{natural rather than malicious} adversarial instances often exist. This work systematically explores the robustness of BERT, the state-of-the-art Transformer-style model in NLP, in dealing with noisy data, particularly mistakes in typing the keyboard, that occur inadvertently. Intensive experiments on sentiment analysis and question answering benchmarks indicate that: (i) Typos in various words of a sentence do not influence equally. The typos in informative words make severer damages; (ii) Mistype is the most damaging factor, compared with inserting, deleting, etc.; (iii) Humans and machines have different focuses on recognizing adversarial attacks.

Via

Access Paper or Ask Questions