Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xi Chen

Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback

Sep 06, 2020
Zifeng Wang, Xi Chen, Rui Wen, Shao-Lun Huang, Ercan E. Kuruoglu, Yefeng Zheng

Figure 1 for Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback

Figure 2 for Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback

Figure 3 for Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback

Figure 4 for Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback

Counterfactual learning for dealing with missing-not-at-random data (MNAR) is an intriguing topic in the recommendation literature, since MNAR data are ubiquitous in modern recommender systems. Missing-at-random (MAR) data, namely randomized controlled trials (RCTs), are usually required by most previous counterfactual learning methods. However, the execution of RCTs is extraordinarily expensive in practice. To circumvent the use of RCTs, we build an information theoretic counterfactual variational information bottleneck (CVIB), as an alternative for debiasing learning without RCTs. By separating the task-aware mutual information term in the original information bottleneck Lagrangian into factual and counterfactual parts, we derive a contrastive information loss and an additional output confidence penalty, which facilitates balanced learning between the factual and counterfactual domains. Empirical evaluation on real-world datasets shows that our CVIB significantly enhances both shallow and deep models, which sheds light on counterfactual learning in recommendation that goes beyond RCTs.

Via

Access Paper or Ask Questions

On the Sample Complexity of Reinforcement Learning with Policy Space Generalization

Aug 17, 2020
Wenlong Mou, Zheng Wen, Xi Chen

We study the optimal sample complexity in large-scale Reinforcement Learning (RL) problems with policy space generalization, i.e. the agent has a prior knowledge that the optimal policy lies in a known policy space. Existing results show that without a generalization model, the sample complexity of an RL algorithm will inevitably depend on the cardinalities of state space and action space, which are intractably large in many practical problems. To avoid such undesirable dependence on the state and action space sizes, this paper proposes a new notion of eluder dimension for the policy space, which characterizes the intrinsic complexity of policy learning in an arbitrary Markov Decision Process (MDP). Using a simulator oracle, we prove a near-optimal sample complexity upper bound that only depends linearly on the eluder dimension. We further prove a similar regret bound in deterministic systems without the simulator.

Via

Access Paper or Ask Questions

Transformer with Bidirectional Decoder for Speech Recognition

Aug 11, 2020
Xi Chen, Songyang Zhang, Dandan Song, Peng Ouyang, Shouyi Yin

Figure 1 for Transformer with Bidirectional Decoder for Speech Recognition

Figure 2 for Transformer with Bidirectional Decoder for Speech Recognition

Figure 3 for Transformer with Bidirectional Decoder for Speech Recognition

Figure 4 for Transformer with Bidirectional Decoder for Speech Recognition

Attention-based models have made tremendous progress on end-to-end automatic speech recognition(ASR) recently. However, the conventional transformer-based approaches usually generate the sequence results token by token from left to right, leaving the right-to-left contexts unexploited. In this work, we introduce a bidirectional speech transformer to utilize the different directional contexts simultaneously. Specifically, the outputs of our proposed transformer include a left-to-right target, and a right-to-left target. In inference stage, we use the introduced bidirectional beam search method, which can not only generate left-to-right candidates but also generate right-to-left candidates, and determine the best hypothesis by the score. To demonstrate our proposed speech transformer with a bidirectional decoder(STBD), we conduct extensive experiments on the AISHELL-1 dataset. The results of experiments show that STBD achieves a 3.6\% relative CER reduction(CERR) over the unidirectional speech transformer baseline. Besides, the strongest model in this paper called STBD-Big can achieve 6.64\% CER on the test set, without language model rescoring and any extra data augmentation strategies.

* Accepted by InterSpeech 2020

Via

Access Paper or Ask Questions

Variable Skipping for Autoregressive Range Density Estimation

Jul 10, 2020
Eric Liang, Zongheng Yang, Ion Stoica, Pieter Abbeel, Yan Duan, Xi Chen

Figure 1 for Variable Skipping for Autoregressive Range Density Estimation

Figure 2 for Variable Skipping for Autoregressive Range Density Estimation

Figure 3 for Variable Skipping for Autoregressive Range Density Estimation

Figure 4 for Variable Skipping for Autoregressive Range Density Estimation

Deep autoregressive models compute point likelihood estimates of individual data points. However, many applications (i.e., database cardinality estimation) require estimating range densities, a capability that is under-explored by current neural density estimation literature. In these applications, fast and accurate range density estimates over high-dimensional data directly impact user-perceived performance. In this paper, we explore a technique, variable skipping, for accelerating range density estimation over deep autoregressive models. This technique exploits the sparse structure of range density queries to avoid sampling unnecessary variables during approximate inference. We show that variable skipping provides 10-100$\times$ efficiency improvements when targeting challenging high-quantile error metrics, enables complex applications such as text pattern matching, and can be realized via a simple data augmentation procedure without changing the usual maximum likelihood objective.

* ICML 2020. Code released at: https://var-skip.github.io/

Via

Access Paper or Ask Questions

Human-centered collaborative robots with deep reinforcement learning

Jul 02, 2020
Ali Ghadirzadeh, Xi Chen, Wenjie Yin, Zhengrong Yi, Mårten Björkman, Danica Kragic

Figure 1 for Human-centered collaborative robots with deep reinforcement learning

Figure 2 for Human-centered collaborative robots with deep reinforcement learning

Figure 3 for Human-centered collaborative robots with deep reinforcement learning

Figure 4 for Human-centered collaborative robots with deep reinforcement learning

We present a reinforcement learning based framework for human-centered collaborative systems. The framework is proactive and balances the benefits of timely actions with the risk of taking improper actions by minimizing the total time spent to complete the task. The framework is learned end-to-end in an unsupervised fashion addressing the perception uncertainties and decision making in an integrated manner. The framework is shown to provide more fluent coordination between human and robot partners on an example task of packaging compared to alternatives for which perception and decision-making systems are learned independently, using supervised learning. The foremost benefit of the proposed approach is that it allows for fast adaptation to new human partners and tasks since tedious annotation of motion data is avoided and the learning is performed on-line.

Via

Access Paper or Ask Questions

NeuroCard: One Cardinality Estimator for All Tables

Jun 15, 2020
Zongheng Yang, Amog Kamsetty, Sifei Luan, Eric Liang, Yan Duan, Xi Chen, Ion Stoica

Figure 1 for NeuroCard: One Cardinality Estimator for All Tables

Figure 2 for NeuroCard: One Cardinality Estimator for All Tables

Figure 3 for NeuroCard: One Cardinality Estimator for All Tables

Figure 4 for NeuroCard: One Cardinality Estimator for All Tables

Query optimizers rely on accurate cardinality estimates to produce good execution plans. Despite decades of research, existing cardinality estimators are inaccurate for complex queries, due to making lossy modeling assumptions and not capturing inter-table correlations. In this work, we show that it is possible to learn the correlations across all tables in a database without any independence assumptions. We present NeuroCard, a join cardinality estimator that builds a single neural density estimator over an entire database. Leveraging join sampling and modern deep autoregressive models, NeuroCard makes no inter-table or inter-column independence assumptions in its probabilistic modeling. NeuroCard achieves orders of magnitude higher accuracy than the best prior methods (a new state-of-the-art result of 8.5$\times$ maximum error on JOB-light), scales to dozens of tables, while being compact in space (several MBs) and efficient to construct or update (seconds to minutes).

Via

Access Paper or Ask Questions

COVID-19 Public Opinion and Emotion Monitoring System Based on Time Series Thermal New Word Mining

May 23, 2020
Yixian Zhang, Jieren Chen, Boyi Liu, Yifan Yang, Haocheng Li, Xinyi Zheng, Xi Chen, Tenglong Ren, Naixue Xiong

Figure 1 for COVID-19 Public Opinion and Emotion Monitoring System Based on Time Series Thermal New Word Mining

Figure 2 for COVID-19 Public Opinion and Emotion Monitoring System Based on Time Series Thermal New Word Mining

Figure 3 for COVID-19 Public Opinion and Emotion Monitoring System Based on Time Series Thermal New Word Mining

Figure 4 for COVID-19 Public Opinion and Emotion Monitoring System Based on Time Series Thermal New Word Mining

With the spread and development of new epidemics, it is of great reference value to identify the changing trends of epidemics in public emotions. We designed and implemented the COVID-19 public opinion monitoring system based on time series thermal new word mining. A new word structure discovery scheme based on the timing explosion of network topics and a Chinese sentiment analysis method for the COVID-19 public opinion environment is proposed. Establish a "Scrapy-Redis-Bloomfilter" distributed crawler framework to collect data. The system can judge the positive and negative emotions of the reviewer based on the comments, and can also reflect the depth of the seven emotions such as Hopeful, Happy, and Depressed. Finally, we improved the sentiment discriminant model of this system and compared the sentiment discriminant error of COVID-19 related comments with the Jiagu deep learning model. The results show that our model has better generalization ability and smaller discriminant error. We designed a large data visualization screen, which can clearly show the trend of public emotions, the proportion of various emotion categories, keywords, hot topics, etc., and fully and intuitively reflect the development of public opinion.

Via

Access Paper or Ask Questions

Non-iterative Simultaneous Rigid Registration Method for Serial Sections of Biological Tissue

May 11, 2020
Chang Shu, Xi Chen, Qiwei Xie, Chi Xiao, Hua Han

Figure 1 for Non-iterative Simultaneous Rigid Registration Method for Serial Sections of Biological Tissue

Figure 2 for Non-iterative Simultaneous Rigid Registration Method for Serial Sections of Biological Tissue

Figure 3 for Non-iterative Simultaneous Rigid Registration Method for Serial Sections of Biological Tissue

Figure 4 for Non-iterative Simultaneous Rigid Registration Method for Serial Sections of Biological Tissue

In this paper, we propose a novel non-iterative algorithm to simultaneously estimate optimal rigid transformation for serial section images, which is a key component in volume reconstruction of serial sections of biological tissue. In order to avoid error accumulation and propagation caused by current algorithms, we add extra condition that the position of the first and the last section images should remain unchanged. This constrained simultaneous registration problem has not been solved before. Our algorithm method is non-iterative, it can simultaneously compute rigid transformation for a large number of serial section images in a short time. We prove that our algorithm gets optimal solution under ideal condition. And we test our algorithm with synthetic data and real data to verify our algorithm's effectiveness.

* 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), Washington, DC, 2018, pp. 436-440
* appears in IEEE International Symposium on Biomedical Imaging 2018 (ISBI 2018)

Via

Access Paper or Ask Questions

Learning and Testing Junta Distributions with Subcube Conditioning

Apr 26, 2020
Xi Chen, Rajesh Jayaram, Amit Levi, Erik Waingarten

Figure 1 for Learning and Testing Junta Distributions with Subcube Conditioning

We study the problems of learning and testing junta distributions on $\{-1,1\}^n$ with respect to the uniform distribution, where a distribution $p$ is a $k$-junta if its probability mass function $p(x)$ depends on a subset of at most $k$ variables. The main contribution is an algorithm for finding relevant coordinates in a $k$-junta distribution with subcube conditioning [BC18, CCKLW20]. We give two applications: 1. An algorithm for learning $k$-junta distributions with $\tilde{O}(k/\epsilon^2) \log n + O(2^k/\epsilon^2)$ subcube conditioning queries, and 2. An algorithm for testing $k$-junta distributions with $\tilde{O}((k + \sqrt{n})/\epsilon^2)$ subcube conditioning queries. All our algorithms are optimal up to poly-logarithmic factors. Our results show that subcube conditioning, as a natural model for accessing high-dimensional distributions, enables significant savings in learning and testing junta distributions compared to the standard sampling model. This addresses an open question posed by Aliakbarpour, Blais, and Rubinfeld [ABR17].

Via

Access Paper or Ask Questions

DAN: A Deformation-Aware Network for Consecutive Biomedical Image Interpolation

Apr 23, 2020
Zejin Wang, Guoqing Li, Xi Chen, Hua Han

Figure 1 for DAN: A Deformation-Aware Network for Consecutive Biomedical Image Interpolation

Figure 2 for DAN: A Deformation-Aware Network for Consecutive Biomedical Image Interpolation

Figure 3 for DAN: A Deformation-Aware Network for Consecutive Biomedical Image Interpolation

Figure 4 for DAN: A Deformation-Aware Network for Consecutive Biomedical Image Interpolation

The continuity of biological tissue between consecutive biomedical images makes it possible for the video interpolation algorithm, to recover large area defects and tears that are common in biomedical images. However, noise and blur differences, large deformation, and drift between biomedical images, make the task challenging. To address the problem, this paper introduces a deformation-aware network to synthesize each pixel in accordance with the continuity of biological tissue. First, we develop a deformation-aware layer for consecutive biomedical images interpolation that implicitly adopting global perceptual deformation. Second, we present an adaptive style-balance loss to take the style differences of consecutive biomedical images such as blur and noise into consideration. Guided by the deformation-aware module, we synthesize each pixel from a global domain adaptively which further improves the performance of pixel synthesis. Quantitative and qualitative experiments on the benchmark dataset show that the proposed method is superior to the state-of-the-art approaches.

Via

Access Paper or Ask Questions