Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jiahai Wang

Learning Generalizable Models for Vehicle Routing Problems via Knowledge Distillation

Oct 14, 2022

Jieyi Bi, Yining Ma, Jiahai Wang, Zhiguang Cao, Jinbiao Chen, Yuan Sun, Yeow Meng Chee

Figure 1 for Learning Generalizable Models for Vehicle Routing Problems via Knowledge Distillation

Figure 2 for Learning Generalizable Models for Vehicle Routing Problems via Knowledge Distillation

Figure 3 for Learning Generalizable Models for Vehicle Routing Problems via Knowledge Distillation

Figure 4 for Learning Generalizable Models for Vehicle Routing Problems via Knowledge Distillation

Abstract:Recent neural methods for vehicle routing problems always train and test the deep models on the same instance distribution (i.e., uniform). To tackle the consequent cross-distribution generalization concerns, we bring the knowledge distillation to this field and propose an Adaptive Multi-Distribution Knowledge Distillation (AMDKD) scheme for learning more generalizable deep models. Particularly, our AMDKD leverages various knowledge from multiple teachers trained on exemplar distributions to yield a light-weight yet generalist student model. Meanwhile, we equip AMDKD with an adaptive strategy that allows the student to concentrate on difficult distributions, so as to absorb hard-to-master knowledge more effectively. Extensive experimental results show that, compared with the baseline neural methods, our AMDKD is able to achieve competitive results on both unseen in-distribution and out-of-distribution instances, which are either randomly synthesized or adopted from benchmark datasets (i.e., TSPLIB and CVRPLIB). Notably, our AMDKD is generic, and consumes less computational resources for inference.

* Accepted at NeurIPS 2022

Via

Access Paper or Ask Questions

Improving Task Generalization via Unified Schema Prompt

Aug 05, 2022

Wanjun Zhong, Yifan Gao, Ning Ding, Zhiyuan Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan

Figure 1 for Improving Task Generalization via Unified Schema Prompt

Figure 2 for Improving Task Generalization via Unified Schema Prompt

Figure 3 for Improving Task Generalization via Unified Schema Prompt

Figure 4 for Improving Task Generalization via Unified Schema Prompt

Abstract:Task generalization has been a long standing challenge in Natural Language Processing (NLP). Recent research attempts to improve the task generalization ability of pre-trained language models by mapping NLP tasks into human-readable prompted forms. However, these approaches require laborious and inflexible manual collection of prompts, and different prompts on the same downstream task may receive unstable performance. We propose Unified Schema Prompt, a flexible and extensible prompting method, which automatically customizes the learnable prompts for each task according to the task input schema. It models the shared knowledge between tasks, while keeping the characteristics of different task schema, and thus enhances task generalization ability. The schema prompt takes the explicit data structure of each task to formulate prompts so that little human effort is involved. To test the task generalization ability of schema prompt at scale, we conduct schema prompt-based multitask pre-training on a wide variety of general NLP tasks. The framework achieves strong zero-shot and few-shot generalization performance on 16 unseen downstream tasks from 8 task types (e.g., QA, NLI, etc). Furthermore, comprehensive analyses demonstrate the effectiveness of each component in the schema prompt, its flexibility in task compositionality, and its ability to improve performance under a full-data fine-tuning setting.

Via

Access Paper or Ask Questions

ProQA: Structural Prompt-based Pre-training for Unified Question Answering

May 09, 2022

Wanjun Zhong, Yifan Gao, Ning Ding, Yujia Qin, Zhiyuan Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan

Figure 1 for ProQA: Structural Prompt-based Pre-training for Unified Question Answering

Figure 2 for ProQA: Structural Prompt-based Pre-training for Unified Question Answering

Figure 3 for ProQA: Structural Prompt-based Pre-training for Unified Question Answering

Figure 4 for ProQA: Structural Prompt-based Pre-training for Unified Question Answering

Abstract:Question Answering (QA) is a longstanding challenge in natural language processing. Existing QA works mostly focus on specific question types, knowledge domains, or reasoning skills. The specialty in QA research hinders systems from modeling commonalities between tasks and generalization for wider applications. To address this issue, we present ProQA, a unified QA paradigm that solves various tasks through a single model. ProQA takes a unified structural prompt as the bridge and improves the QA-centric ability by structural prompt-based pre-training. Through a structurally designed prompt-based input schema, ProQA concurrently models the knowledge generalization for all QA tasks while keeping the knowledge customization for every specific QA task. Furthermore, ProQA is pre-trained with structural prompt-formatted large-scale synthesized corpus, which empowers the model with the commonly-required QA ability. Experimental results on 11 QA benchmarks demonstrate that ProQA consistently boosts performance on both full data fine-tuning, few-shot learning, and zero-shot testing scenarios. Furthermore, ProQA exhibits strong ability in both continual learning and transfer learning by taking the advantages of the structural prompt.

* NAACL 2022

Via

Access Paper or Ask Questions

Graph Neural Networks with Dynamic and Static Representations for Social Recommendation

Jan 31, 2022

Junfa Lin, Siyuan Chen, Jiahai Wang

Figure 1 for Graph Neural Networks with Dynamic and Static Representations for Social Recommendation

Figure 2 for Graph Neural Networks with Dynamic and Static Representations for Social Recommendation

Figure 3 for Graph Neural Networks with Dynamic and Static Representations for Social Recommendation

Figure 4 for Graph Neural Networks with Dynamic and Static Representations for Social Recommendation

Abstract:Recommender systems based on graph neural networks receive increasing research interest due to their excellent ability to learn a variety of side information including social networks. However, previous works usually focus on modeling users, not much attention is paid to items. Moreover, the possible changes in the attraction of items over time, which is like the dynamic interest of users are rarely considered, and neither do the correlations among items. To overcome these limitations, this paper proposes graph neural networks with dynamic and static representations for social recommendation (GNN-DSR), which considers both dynamic and static representations of users and items and incorporates their relational influence. GNN-DSR models the short-term dynamic and long-term static interactional representations of the user's interest and the item's attraction, respectively. Furthermore, the attention mechanism is used to aggregate the social influence of users on the target user and the correlative items' influence on a given item. The final latent factors of user and item are combined to make a prediction. Experiments on three real-world recommender system datasets validate the effectiveness of GNN-DSR.

* 17 pages, 4 figures. Extended version of paper accepted by DASFAA 2022

Via

Access Paper or Ask Questions

Reasoning over Hybrid Chain for Table-and-Text Open Domain QA

Jan 15, 2022

Wanjun Zhong, Junjie Huang, Qian Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan

Figure 1 for Reasoning over Hybrid Chain for Table-and-Text Open Domain QA

Figure 2 for Reasoning over Hybrid Chain for Table-and-Text Open Domain QA

Figure 3 for Reasoning over Hybrid Chain for Table-and-Text Open Domain QA

Figure 4 for Reasoning over Hybrid Chain for Table-and-Text Open Domain QA

Abstract:Tabular and textual question answering requires systems to perform reasoning over heterogeneous information, considering table structure, and the connections among table and text. In this paper, we propose a ChAin-centric Reasoning and Pre-training framework (CARP). CARP utilizes hybrid chain to model the explicit intermediate reasoning process across table and text for question answering. We also propose a novel chain-centric pre-training method, to enhance the pre-trained model in identifying the cross-modality reasoning process and alleviating the data sparsity problem. This method constructs the large-scale reasoning corpus by synthesizing pseudo heterogeneous reasoning paths from Wikipedia and generating corresponding questions. We evaluate our system on OTT-QA, a large-scale table-and-text open-domain question answering benchmark, and our system achieves the state-of-the-art performance. Further analyses illustrate that the explicit hybrid chain offers substantial performance improvement and interpretablity of the intermediate reasoning process, and the chain-centric pre-training boosts the performance on the chain extraction.

Via

Access Paper or Ask Questions

MODRL/D-EL: Multiobjective Deep Reinforcement Learning with Evolutionary Learning for Multiobjective Optimization

Jul 16, 2021

Yongxin Zhang, Jiahai Wang, Zizhen Zhang, Yalan Zhou

Figure 1 for MODRL/D-EL: Multiobjective Deep Reinforcement Learning with Evolutionary Learning for Multiobjective Optimization

Figure 2 for MODRL/D-EL: Multiobjective Deep Reinforcement Learning with Evolutionary Learning for Multiobjective Optimization

Figure 3 for MODRL/D-EL: Multiobjective Deep Reinforcement Learning with Evolutionary Learning for Multiobjective Optimization

Figure 4 for MODRL/D-EL: Multiobjective Deep Reinforcement Learning with Evolutionary Learning for Multiobjective Optimization

Abstract:Learning-based heuristics for solving combinatorial optimization problems has recently attracted much academic attention. While most of the existing works only consider the single objective problem with simple constraints, many real-world problems have the multiobjective perspective and contain a rich set of constraints. This paper proposes a multiobjective deep reinforcement learning with evolutionary learning algorithm for a typical complex problem called the multiobjective vehicle routing problem with time windows (MO-VRPTW). In the proposed algorithm, the decomposition strategy is applied to generate subproblems for a set of attention models. The comprehensive context information is introduced to further enhance the attention models. The evolutionary learning is also employed to fine-tune the parameters of the models. The experimental results on MO-VRPTW instances demonstrate the superiority of the proposed algorithm over other learning-based and iterative-based approaches.

Via

Access Paper or Ask Questions

Topic-to-Essay Generation with Comprehensive Knowledge Enhancement

Jun 29, 2021

Zhiyue Liu, Jiahai Wang, Zhenghong Li

Figure 1 for Topic-to-Essay Generation with Comprehensive Knowledge Enhancement

Figure 2 for Topic-to-Essay Generation with Comprehensive Knowledge Enhancement

Figure 3 for Topic-to-Essay Generation with Comprehensive Knowledge Enhancement

Figure 4 for Topic-to-Essay Generation with Comprehensive Knowledge Enhancement

Abstract:Generating high-quality and diverse essays with a set of topics is a challenging task in natural language generation. Since several given topics only provide limited source information, utilizing various topic-related knowledge is essential for improving essay generation performance. However, previous works cannot sufficiently use that knowledge to facilitate the generation procedure. This paper aims to improve essay generation by extracting information from both internal and external knowledge. Thus, a topic-to-essay generation model with comprehensive knowledge enhancement, named TEGKE, is proposed. For internal knowledge enhancement, both topics and related essays are fed to a teacher network as source information. Then, informative features would be obtained from the teacher network and transferred to a student network which only takes topics as input but provides comparable information compared with the teacher network. For external knowledge enhancement, a topic knowledge graph encoder is proposed. Unlike the previous works only using the nearest neighbors of topics in the commonsense base, our topic knowledge graph encoder could exploit more structural and semantic information of the commonsense knowledge graph to facilitate essay generation. Moreover, the adversarial training based on the Wasserstein distance is proposed to improve generation quality. Experimental results demonstrate that TEGKE could achieve state-of-the-art performance on both automatic and human evaluation.

* 20 pages

Via

Access Paper or Ask Questions

Meta-Learning-based Deep Reinforcement Learning for Multiobjective Optimization Problems

May 06, 2021

Zizhen Zhang, Zhiyuan Wu, Jiahai Wang

Figure 1 for Meta-Learning-based Deep Reinforcement Learning for Multiobjective Optimization Problems

Figure 2 for Meta-Learning-based Deep Reinforcement Learning for Multiobjective Optimization Problems

Figure 3 for Meta-Learning-based Deep Reinforcement Learning for Multiobjective Optimization Problems

Figure 4 for Meta-Learning-based Deep Reinforcement Learning for Multiobjective Optimization Problems

Abstract:Deep reinforcement learning (DRL) has recently shown its success in tackling complex combinatorial optimization problems. When these problems are extended to multiobjective ones, it becomes difficult for the existing DRL approaches to flexibly and efficiently deal with multiple subproblems determined by weight decomposition of objectives. This paper proposes a concise meta-learning-based DRL approach. It first trains a meta-model by meta-learning. The meta-model is fine-tuned with a few update steps to derive submodels for the corresponding subproblems. The Pareto front is built accordingly. The computational experiments on multiobjective traveling salesman problems demonstrate the superiority of our method over most of learning-based and iteration-based approaches.

Via

Access Paper or Ask Questions

AR-LSAT: Investigating Analytical Reasoning of Text

Apr 15, 2021

Wanjun Zhong, Siyuan Wang, Duyu Tang, Zenan Xu, Daya Guo, Jiahai Wang, Jian Yin, Ming Zhou, Nan Duan

Figure 1 for AR-LSAT: Investigating Analytical Reasoning of Text

Figure 2 for AR-LSAT: Investigating Analytical Reasoning of Text

Figure 3 for AR-LSAT: Investigating Analytical Reasoning of Text

Figure 4 for AR-LSAT: Investigating Analytical Reasoning of Text

Abstract:Analytical reasoning is an essential and challenging task that requires a system to analyze a scenario involving a set of particular circumstances and perform reasoning over it to make conclusions. In this paper, we study the challenge of analytical reasoning of text and introduce a new dataset consisting of questions from the Law School Admission Test from 1991 to 2016. We analyze what knowledge understanding and reasoning abilities are required to do well on this task. Furthermore, to address this reasoning challenge, we design two different baselines: (1) a Transformer-based method which leverages the state-of-the-art pre-trained language models and (2) Analytical Reasoning Machine (ARM), a logical-level reasoning framework extracting symbolic knowledge (e.g, participants, facts, logical functions) to deduce legitimate solutions. In our experiments, we find that the Transformer-based models struggle to solve this task as their performance is close to random guess and ARM achieves better performance by leveraging symbolic knowledge and interpretable reasoning steps. Results show that both methods still lag far behind human performance, which leave further space for future research.

* 13 pages, 5 figures

Via

Access Paper or Ask Questions

Scale-free Network-based Differential Evolution

Jan 27, 2021

Yang Yu, Shangce Gao, MengChu Zhou, Yirui Wang, Zhenyu Lei, Tengfei Zhang, Jiahai Wang

Figure 1 for Scale-free Network-based Differential Evolution

Figure 2 for Scale-free Network-based Differential Evolution

Figure 3 for Scale-free Network-based Differential Evolution

Figure 4 for Scale-free Network-based Differential Evolution

Abstract:Some recent research reveals that a topological structure in meta-heuristic algorithms can effectively enhance the interaction of population, and thus improve their performance. Inspired by it, we creatively investigate the effectiveness of using a scale-free network in differential evolution methods, and propose a scale-free network-based differential evolution method. The novelties of this paper include a scale-free network-based population structure and a new mutation operator designed to fully utilize the neighborhood information provided by a scale-free structure. The elite individuals and population at the latest generation are both employed to guide a global optimization process. In this manner, the proposed algorithm owns balanced exploration and exploitation capabilities to alleviate the drawbacks of premature convergence. Experimental and statistical analyses are performed on the CEC'17 benchmark function suite and three real world problems. Results demonstrate its superior effectiveness and efficiency in comparison with its competitive peers.

Via

Access Paper or Ask Questions