Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

W. Bruce Croft

AREDSUM: Adaptive Redundancy-Aware Iterative Sentence Ranking for Extractive Document Summarization

Apr 13, 2020

Keping Bi, Rahul Jha, W. Bruce Croft, Asli Celikyilmaz

Figure 1 for AREDSUM: Adaptive Redundancy-Aware Iterative Sentence Ranking for Extractive Document Summarization

Figure 2 for AREDSUM: Adaptive Redundancy-Aware Iterative Sentence Ranking for Extractive Document Summarization

Figure 3 for AREDSUM: Adaptive Redundancy-Aware Iterative Sentence Ranking for Extractive Document Summarization

Figure 4 for AREDSUM: Adaptive Redundancy-Aware Iterative Sentence Ranking for Extractive Document Summarization

Abstract:Redundancy-aware extractive summarization systems score the redundancy of the sentences to be included in a summary either jointly with their salience information or separately as an additional sentence scoring step. Previous work shows the efficacy of jointly scoring and selecting sentences with neural sequence generation models. It is, however, not well-understood if the gain is due to better encoding techniques or better redundancy reduction approaches. Similarly, the contribution of salience versus diversity components on the created summary is not studied well. Building on the state-of-the-art encoding methods for summarization, we present two adaptive learning models: AREDSUM-SEQ that jointly considers salience and novelty during sentence selection; and a two-step AREDSUM-CTX that scores salience first, then learns to balance salience and redundancy, enabling the measurement of the impact of each aspect. Empirical results on CNN/DailyMail and NYT50 datasets show that by modeling diversity explicitly in a separate step, AREDSUM-CTX achieves significantly better performance than AREDSUM-SEQ as well as state-of-the-art extractive summarization baselines.

Via

Access Paper or Ask Questions

IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems

Feb 03, 2020

Liu Yang, Minghui Qiu, Chen Qu, Cen Chen, Jiafeng Guo, Yongfeng Zhang, W. Bruce Croft, Haiqing Chen

Figure 1 for IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems

Figure 2 for IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems

Figure 3 for IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems

Figure 4 for IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems

Abstract:Personal assistant systems, such as Apple Siri, Google Assistant, Amazon Alexa, and Microsoft Cortana, are becoming ever more widely used. Understanding user intent such as clarification questions, potential answers and user feedback in information-seeking conversations is critical for retrieving good responses. In this paper, we analyze user intent patterns in information-seeking conversations and propose an intent-aware neural response ranking model "IART", which refers to "Intent-Aware Ranking with Transformers". IART is built on top of the integration of user intent modeling and language representation learning with the Transformer architecture, which relies entirely on a self-attention mechanism instead of recurrent nets. It incorporates intent-aware utterance attention to derive an importance weighting scheme of utterances in conversation context with the aim of better conversation history understanding. We conduct extensive experiments with three information-seeking conversation data sets including both standard benchmarks and commercial data. Our proposed model outperforms all baseline methods with respect to a variety of metrics. We also perform case studies and analysis of learned user intent and its impact on response ranking in information-seeking conversations to provide interpretation of results.

* Accepted by WWW2020

Via

Access Paper or Ask Questions

Asking Clarifying Questions in Open-Domain Information-Seeking Conversations

Jul 15, 2019

Mohammad Aliannejadi, Hamed Zamani, Fabio Crestani, W. Bruce Croft

Figure 1 for Asking Clarifying Questions in Open-Domain Information-Seeking Conversations

Figure 2 for Asking Clarifying Questions in Open-Domain Information-Seeking Conversations

Figure 3 for Asking Clarifying Questions in Open-Domain Information-Seeking Conversations

Figure 4 for Asking Clarifying Questions in Open-Domain Information-Seeking Conversations

Abstract:Users often fail to formulate their complex information needs in a single query. As a consequence, they may need to scan multiple result pages or reformulate their queries, which may be a frustrating experience. Alternatively, systems can improve user satisfaction by proactively asking questions of the users to clarify their information needs. Asking clarifying questions is especially important in conversational systems since they can only return a limited number of (often only one) result(s). In this paper, we formulate the task of asking clarifying questions in open-domain information-seeking conversational systems. To this end, we propose an offline evaluation methodology for the task and collect a dataset, called Qulac, through crowdsourcing. Our dataset is built on top of the TREC Web Track 2009-2012 data and consists of over 10K question-answer pairs for 198 TREC topics with 762 facets. Our experiments on an oracle model demonstrate that asking only one good question leads to over 170% retrieval performance improvement in terms of P@1, which clearly demonstrates the potential impact of the task. We further propose a retrieval framework consisting of three components: question retrieval, question selection, and document retrieval. In particular, our question selection model takes into account the original query and previous question-answer interactions while selecting the next question. Our model significantly outperforms competitive baselines. To foster research in this area, we have made Qulac publicly available.

* To appear in SIGIR 2019

Via

Access Paper or Ask Questions

ANTIQUE: A Non-Factoid Question Answering Benchmark

May 22, 2019

Helia Hashemi, Mohammad Aliannejadi, Hamed Zamani, W. Bruce Croft

Figure 1 for ANTIQUE: A Non-Factoid Question Answering Benchmark

Figure 2 for ANTIQUE: A Non-Factoid Question Answering Benchmark

Figure 3 for ANTIQUE: A Non-Factoid Question Answering Benchmark

Figure 4 for ANTIQUE: A Non-Factoid Question Answering Benchmark

Abstract:Considering the widespread use of mobile and voice search, answer passage retrieval for non-factoid questions plays a critical role in modern information retrieval systems. Despite the importance of the task, the community still feels the significant lack of large-scale non-factoid question answering collections with real questions and comprehensive relevance judgments. In this paper, we develop and release a collection of 2,626 open-domain non-factoid questions from a diverse set of categories. The dataset, called ANTIQUE, contains 34,011 manual relevance annotations. The questions were asked by real users in a community question answering service, i.e., Yahoo! Answers. Relevance judgments for all the answers to each question were collected through crowdsourcing. To facilitate further research, we also include a brief analysis of the data as well as baseline results on both classical and recently developed neural IR models.

Via

Access Paper or Ask Questions

Investigating the Successes and Failures of BERT for Passage Re-Ranking

May 05, 2019

Harshith Padigela, Hamed Zamani, W. Bruce Croft

Figure 1 for Investigating the Successes and Failures of BERT for Passage Re-Ranking

Figure 2 for Investigating the Successes and Failures of BERT for Passage Re-Ranking

Figure 3 for Investigating the Successes and Failures of BERT for Passage Re-Ranking

Figure 4 for Investigating the Successes and Failures of BERT for Passage Re-Ranking

Abstract:The bidirectional encoder representations from transformers (BERT) model has recently advanced the state-of-the-art in passage re-ranking. In this paper, we analyze the results produced by a fine-tuned BERT model to better understand the reasons behind such substantial improvements. To this aim, we focus on the MS MARCO passage re-ranking dataset and provide potential reasons for the successes and failures of BERT for retrieval. In more detail, we empirically study a set of hypotheses and provide additional analysis to explain the successful performance of BERT.

Via

Access Paper or Ask Questions

A Hybrid Retrieval-Generation Neural Conversation Model

Apr 19, 2019

Liu Yang, Junjie Hu, Minghui Qiu, Chen Qu, Jianfeng Gao, W. Bruce Croft, Xiaodong Liu, Yelong Shen, Jingjing Liu

Figure 1 for A Hybrid Retrieval-Generation Neural Conversation Model

Figure 2 for A Hybrid Retrieval-Generation Neural Conversation Model

Figure 3 for A Hybrid Retrieval-Generation Neural Conversation Model

Figure 4 for A Hybrid Retrieval-Generation Neural Conversation Model

Abstract:Intelligent personal assistant systems, with either text-based or voice-based conversational interfaces, are becoming increasingly popular. Most previous research has used either retrieval-based or generation-based methods. Retrieval-based methods have the advantage of returning fluent and informative responses with great diversity. The retrieved responses are easier to control and explain. However, the response retrieval performance is limited by the size of the response repository. On the other hand, although generation-based methods can return highly coherent responses given conversation context, they are likely to return universal or general responses with insufficient ground knowledge information. In this paper, we build a hybrid neural conversation model with the capability of both response retrieval and generation, in order to combine the merits of these two types of methods. Experimental results on Twitter and Foursquare data show that the proposed model can outperform both retrieval-based methods and generation-based methods (including a recently proposed knowledge-grounded neural conversation model) under both automatic evaluation metrics and human evaluation. Our models and research findings provide new insights on how to integrate text retrieval and text generation models for building conversation systems.

* 11 pages

Via

Access Paper or Ask Questions

Learning to Selectively Transfer: Reinforced Transfer Learning for Deep Text Matching

Dec 30, 2018

Chen Qu, Feng Ji, Minghui Qiu, Liu Yang, Zhiyu Min, Haiqing Chen, Jun Huang, W. Bruce Croft

Figure 1 for Learning to Selectively Transfer: Reinforced Transfer Learning for Deep Text Matching

Figure 2 for Learning to Selectively Transfer: Reinforced Transfer Learning for Deep Text Matching

Figure 3 for Learning to Selectively Transfer: Reinforced Transfer Learning for Deep Text Matching

Figure 4 for Learning to Selectively Transfer: Reinforced Transfer Learning for Deep Text Matching

Abstract:Deep text matching approaches have been widely studied for many applications including question answering and information retrieval systems. To deal with a domain that has insufficient labeled data, these approaches can be used in a Transfer Learning (TL) setting to leverage labeled data from a resource-rich source domain. To achieve better performance, source domain data selection is essential in this process to prevent the "negative transfer" problem. However, the emerging deep transfer models do not fit well with most existing data selection methods, because the data selection policy and the transfer learning model are not jointly trained, leading to sub-optimal training efficiency. In this paper, we propose a novel reinforced data selector to select high-quality source domain data to help the TL model. Specifically, the data selector "acts" on the source domain data to find a subset for optimization of the TL model, and the performance of the TL model can provide "rewards" in turn to update the selector. We build the reinforced data selector based on the actor-critic framework and integrate it to a DNN based transfer learning model, resulting in a Reinforced Transfer Learning (RTL) method. We perform a thorough experimental evaluation on two major tasks for text matching, namely, paraphrase identification and natural language inference. Experimental results show the proposed RTL can significantly improve the performance of the TL model. We further investigate different settings of states, rewards, and policy optimization methods to examine the robustness of our method. Last, we conduct a case study on the selected data and find our method is able to select source domain data whose Wasserstein distance is close to the target domain data. This is reasonable and intuitive as such source domain data can provide more transferability power to the model.

* Accepted to WSDM 2019

Via

Access Paper or Ask Questions

Transfer Learning for Context-Aware Question Matching in Information-seeking Conversations in E-commerce

Jun 14, 2018

Minghui Qiu, Liu Yang, Feng Ji, Weipeng Zhao, Wei Zhou, Jun Huang, Haiqing Chen, W. Bruce Croft, Wei Lin

Figure 1 for Transfer Learning for Context-Aware Question Matching in Information-seeking Conversations in E-commerce

Figure 2 for Transfer Learning for Context-Aware Question Matching in Information-seeking Conversations in E-commerce

Figure 3 for Transfer Learning for Context-Aware Question Matching in Information-seeking Conversations in E-commerce

Figure 4 for Transfer Learning for Context-Aware Question Matching in Information-seeking Conversations in E-commerce

Abstract:Building multi-turn information-seeking conversation systems is an important and challenging research topic. Although several advanced neural text matching models have been proposed for this task, they are generally not efficient for industrial applications. Furthermore, they rely on a large amount of labeled data, which may not be available in real-world applications. To alleviate these problems, we study transfer learning for multi-turn information seeking conversations in this paper. We first propose an efficient and effective multi-turn conversation model based on convolutional neural networks. After that, we extend our model to adapt the knowledge learned from a resource-rich domain to enhance the performance. Finally, we deployed our model in an industrial chatbot called AliMe Assist (https://consumerservice.taobao.com/online-help) and observed a significant improvement over the existing online model.

* ACL 2018
* 6

Via

Access Paper or Ask Questions

Response Ranking with Deep Matching Networks and External Knowledge in Information-seeking Conversation Systems

May 09, 2018

Liu Yang, Minghui Qiu, Chen Qu, Jiafeng Guo, Yongfeng Zhang, W. Bruce Croft, Jun Huang, Haiqing Chen

Figure 1 for Response Ranking with Deep Matching Networks and External Knowledge in Information-seeking Conversation Systems

Figure 2 for Response Ranking with Deep Matching Networks and External Knowledge in Information-seeking Conversation Systems

Figure 3 for Response Ranking with Deep Matching Networks and External Knowledge in Information-seeking Conversation Systems

Figure 4 for Response Ranking with Deep Matching Networks and External Knowledge in Information-seeking Conversation Systems

Abstract:Intelligent personal assistant systems with either text-based or voice-based conversational interfaces are becoming increasingly popular around the world. Retrieval-based conversation models have the advantages of returning fluent and informative responses. Most existing studies in this area are on open domain "chit-chat" conversations or task / transaction oriented conversations. More research is needed for information-seeking conversations. There is also a lack of modeling external knowledge beyond the dialog utterances among current conversational models. In this paper, we propose a learning framework on the top of deep neural matching networks that leverages external knowledge for response ranking in information-seeking conversation systems. We incorporate external knowledge into deep neural models with pseudo-relevance feedback and QA correspondence knowledge distillation. Extensive experiments with three information-seeking conversation data sets including both open benchmarks and commercial data show that, our methods outperform various baseline methods including several deep text matching models and the state-of-the-art method on response selection in multi-turn conversations. We also perform analysis over different response types, model variations and ranking examples. Our models and research findings provide new insights on how to utilize external knowledge with deep neural models for response selection and have implications for the design of the next generation of information-seeking conversation systems.

* Accepted by the 41th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018), Ann Arbor, Michigan, U.S.A. July 8-12, 2018 (Full Oral Paper)

Via

Access Paper or Ask Questions

Analyzing and Characterizing User Intent in Information-seeking Conversations

Apr 23, 2018

Chen Qu, Liu Yang, W. Bruce Croft, Johanne R. Trippas, Yongfeng Zhang, Minghui Qiu

Figure 1 for Analyzing and Characterizing User Intent in Information-seeking Conversations

Figure 2 for Analyzing and Characterizing User Intent in Information-seeking Conversations

Figure 3 for Analyzing and Characterizing User Intent in Information-seeking Conversations

Figure 4 for Analyzing and Characterizing User Intent in Information-seeking Conversations

Abstract:Understanding and characterizing how people interact in information-seeking conversations is crucial in developing conversational search systems. In this paper, we introduce a new dataset designed for this purpose and use it to analyze information-seeking conversations by user intent distribution, co-occurrence, and flow patterns. The MSDialog dataset is a labeled dialog dataset of question answering (QA) interactions between information seekers and providers from an online forum on Microsoft products. The dataset contains more than 2,000 multi-turn QA dialogs with 10,000 utterances that are annotated with user intent on the utterance level. Annotations were done using crowdsourcing. With MSDialog, we find some highly recurring patterns in user intent during an information-seeking process. They could be useful for designing conversational search systems. We will make our dataset freely available to encourage exploration of information-seeking conversation models.

* Accepted by SIGIR 2018 as a short paper

Via

Access Paper or Ask Questions