Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hang Li

X$^2$-VLM: All-In-One Pre-trained Model For Vision-Language Tasks

Nov 22, 2022
Yan Zeng, Xinsong Zhang, Hang Li, Jiawei Wang, Jipeng Zhang, Wangchunshu Zhou

Figure 1 for X$^2$-VLM: All-In-One Pre-trained Model For Vision-Language Tasks

Figure 2 for X$^2$-VLM: All-In-One Pre-trained Model For Vision-Language Tasks

Figure 3 for X$^2$-VLM: All-In-One Pre-trained Model For Vision-Language Tasks

Figure 4 for X$^2$-VLM: All-In-One Pre-trained Model For Vision-Language Tasks

Vision language pre-training aims to learn alignments between vision and language from a large amount of data. We proposed multi-grained vision language pre-training, a unified approach which can learn vision language alignments in multiple granularity. This paper advances the proposed method by unifying image and video encoding in one model and scaling up the model with large-scale data. We present X$^2$-VLM, a pre-trained VLM with a modular architecture for both image-text tasks and video-text tasks. Experiment results show that X$^2$-VLM performs the best on base and large scale for both image-text and video-text tasks, making a good trade-off between performance and model scale. Moreover, we show that the modular design of X$^2$-VLM results in high transferability for X$^2$-VLM to be utilized in any language or domain. For example, by simply replacing the text encoder with XLM-R, X$^2$-VLM outperforms state-of-the-art multilingual multi-modal pre-trained models without any multilingual pre-training. The code and pre-trained models will be available at github.com/zengyan-97/X2-VLM.

* 21 pages, 8 figures

Via

Access Paper or Ask Questions

Learning to Counterfactually Explain Recommendations

Nov 17, 2022
Yuanshun Yao, Chong Wang, Hang Li

Figure 1 for Learning to Counterfactually Explain Recommendations

Figure 2 for Learning to Counterfactually Explain Recommendations

Figure 3 for Learning to Counterfactually Explain Recommendations

Figure 4 for Learning to Counterfactually Explain Recommendations

Recommender system practitioners are facing increasing pressure to explain recommendations. We explore how to explain recommendations using counterfactual logic, i.e. "Had you not interacted with the following items before, it is likely we would not recommend this item." Compared to traditional explanation logic, counterfactual explanations are easier to understand and more technically verifiable. The major challenge of generating such explanations is the computational cost because it requires repeatedly retraining the models to obtain the effect on a recommendation caused by removing user (interaction) history. We propose a learning-based framework to generate counterfactual explanations. The key idea is to train a surrogate model to learn the effect of removing a subset of user history on the recommendation. To this end, we first artificially simulate the counterfactual outcomes on the recommendation after deleting subsets of history. Then we train surrogate models to learn the mapping between a history deletion and the change in the recommendation caused by the deletion. Finally, to generate an explanation, we find the history subset predicted by the surrogate model that is most likely to remove the recommendation. Through offline experiments and online user studies, we show our method, compared to baselines, can generate explanations that are more counterfactually valid and more satisfactory considered by users.

Via

Access Paper or Ask Questions

Evaluating Fairness Without Sensitive Attributes: A Framework Using Only Auxiliary Models

Oct 06, 2022
Zhaowei Zhu, Yuanshun Yao, Jiankai Sun, Yang Liu, Hang Li

Figure 1 for Evaluating Fairness Without Sensitive Attributes: A Framework Using Only Auxiliary Models

Figure 2 for Evaluating Fairness Without Sensitive Attributes: A Framework Using Only Auxiliary Models

Figure 3 for Evaluating Fairness Without Sensitive Attributes: A Framework Using Only Auxiliary Models

Figure 4 for Evaluating Fairness Without Sensitive Attributes: A Framework Using Only Auxiliary Models

Although the volume of literature and public attention on machine learning fairness has been growing significantly, in practice some tasks as basic as measuring fairness, which is the first step in studying and promoting fairness, can be challenging. This is because sensitive attributes are often unavailable due to privacy regulations. The straightforward solution is to use auxiliary models to predict the missing sensitive attributes. However, our theoretical analyses show that the estimation error of the directly measured fairness metrics is proportional to the error rates of auxiliary models' predictions. Existing works that attempt to reduce the estimation error often require strong assumptions, e.g. access to the ground-truth sensitive attributes or some form of conditional independence. In this paper, we drop those assumptions and propose a framework that uses only off-the-shelf auxiliary models. The main challenge is how to reduce the negative impact of imperfectly predicted sensitive attributes on the fairness metrics without knowing the ground-truth sensitive attributes. Inspired by the noisy label learning literature, we first derive a closed-form relationship between the directly measured fairness metrics and their corresponding ground-truth metrics. And then we estimate some key statistics (most importantly transition matrix in the noisy label literature), which we use, together with the derived relationship, to calibrate the fairness metrics. In addition, we theoretically prove the upper bound of the estimation error in our calibrated metrics and show our method can substantially decrease the estimation error especially when auxiliary models are inaccurate or the target model is highly biased. Experiments on COMPAS and CelebA validate our theoretical analyses and show our method can measure fairness significantly more accurately than baselines under favorable circumstances.

Via

Access Paper or Ask Questions

Optimal Power Allocation for Integrated Visible Light Positioning and Communication System with a Single LED-Lamp

Aug 30, 2022
Shuai Ma, Ruixin Yang, Bing Li, Yongyan Chen, Hang Li, Youlong Wu, Majid Safari, Shiyin Li, Naofal Al-Dhahir

Figure 1 for Optimal Power Allocation for Integrated Visible Light Positioning and Communication System with a Single LED-Lamp

Figure 2 for Optimal Power Allocation for Integrated Visible Light Positioning and Communication System with a Single LED-Lamp

Figure 3 for Optimal Power Allocation for Integrated Visible Light Positioning and Communication System with a Single LED-Lamp

Figure 4 for Optimal Power Allocation for Integrated Visible Light Positioning and Communication System with a Single LED-Lamp

In this paper, we investigate an integrated visible light positioning and communication (VLPC) system with a single LED-lamp. First, by leveraging the fact that the VLC channel model is a function of the receiver's location, we propose a system model that estimates the channel state information (CSI) based on the positioning information without transmitting pilot sequences. Second, we derive the Cramer-Rao lower bound (CRLB) on the positioning error variance and a lower bound on the achievable rate with on-off keying modulation. Third, based on the derived performance metrics, we optimize the power allocation to minimize the CRLB, while satisfying the rate outage probability constraint. To tackle this non-convex optimization problem, we apply the worst-case distribution of the Conditional Value-at-Risk (CVaR) and the block coordinate descent (BCD) methods to obtain the feasible solutions. Finally, the effects of critical system parameters, such as outage probability, rate threshold, total power threshold, are revealed by numerical results.

* 13 pages, 14 figures, accepted by IEEE Transactions on Communications

Via

Access Paper or Ask Questions

Forgetting Fast in Recommender Systems

Aug 14, 2022
Wenyan Liu, Juncheng Wan, Xiaoling Wang, Weinan Zhang, Dell Zhang, Hang Li

Figure 1 for Forgetting Fast in Recommender Systems

Figure 2 for Forgetting Fast in Recommender Systems

Figure 3 for Forgetting Fast in Recommender Systems

Figure 4 for Forgetting Fast in Recommender Systems

Users of a recommender system may want part of their data being deleted, not only from the data repository but also from the underlying machine learning model, for privacy or utility reasons. Such right-to-be-forgotten requests could be fulfilled by simply retraining the recommendation model from scratch, but that would be too slow and too expensive in practice. In this paper, we investigate fast machine unlearning techniques for recommender systems that can remove the effect of a small amount of training data from the recommendation model without incurring the full cost of retraining. A natural idea to speed this process up is to fine-tune the current recommendation model on the remaining training data instead of starting from a random initialization. This warm-start strategy indeed works for neural recommendation models using standard 1st-order neural network optimizers (like AdamW). However, we have found that even greater acceleration could be achieved by employing 2nd-order (Newton or quasi-Newton) optimization methods instead. To overcome the prohibitively high computational cost of 2nd-order optimizers, we propose a new recommendation unlearning approach AltEraser which divides the optimization problem of unlearning into many small tractable sub-problems. Extensive experiments on three real-world recommendation datasets show promising results of AltEraser in terms of consistency (forgetting thoroughness), accuracy (recommendation effectiveness), and efficiency (unlearning speed). To our knowledge, this work represents the first attempt at fast approximate machine unlearning for state-of-the-art neural recommendation models.

Via

Access Paper or Ask Questions

Covert Beamforming Design for Integrated Radar Sensing and Communication Systems

Aug 11, 2022
Shuai Ma, Haihong Sheng, Ruixin Yang, Hang Li, Youlong Wu, Chao Shen, Naofal Al-Dhahir, Shiyin Li

Figure 1 for Covert Beamforming Design for Integrated Radar Sensing and Communication Systems

Figure 2 for Covert Beamforming Design for Integrated Radar Sensing and Communication Systems

Figure 3 for Covert Beamforming Design for Integrated Radar Sensing and Communication Systems

Figure 4 for Covert Beamforming Design for Integrated Radar Sensing and Communication Systems

We propose covert beamforming design frameworks for integrated radar sensing and communication (IRSC) systems, where the radar can covertly communicate with legitimate users under the cover of the probing waveforms without being detected by the eavesdropper. Specifically, by jointly designing the target detection beamformer and communication beamformer, we aim to maximize the radar detection mutual information (MI) (or the communication rate) subject to the covert constraint, the communication rate constraint (or the radar detection MI constraint), and the total power constraint. For the perfect eavesdropper's channel state information (CSI) scenario, we transform the covert beamforming design problems into a series of convex subproblems, by exploiting semidefinite relaxation, which can be solved via the bisection search method. Considering the high complexity of iterative optimization, we further propose a single-iterative covert beamformer design scheme based on the zero-forcing criterion. For the imperfect eavesdropper's CSI scenario, we develop a relaxation and restriction method to tackle the robust covert beamforming design problems. Simulation results demonstrate the effectiveness of the proposed covert beamforming schemes for perfect and imperfect CSI scenarios.

Via

Access Paper or Ask Questions

Biologically Inspired Neural Path Finding

Jun 13, 2022
Hang Li, Qadeer Khan, Volker Tresp, Daniel Cremers

Figure 1 for Biologically Inspired Neural Path Finding

Figure 2 for Biologically Inspired Neural Path Finding

Figure 3 for Biologically Inspired Neural Path Finding

Figure 4 for Biologically Inspired Neural Path Finding

The human brain can be considered to be a graphical structure comprising of tens of billions of biological neurons connected by synapses. It has the remarkable ability to automatically re-route information flow through alternate paths in case some neurons are damaged. Moreover, the brain is capable of retaining information and applying it to similar but completely unseen scenarios. In this paper, we take inspiration from these attributes of the brain, to develop a computational framework to find the optimal low cost path between a source node and a destination node in a generalized graph. We show that our framework is capable of handling unseen graphs at test time. Moreover, it can find alternate optimal paths, when nodes are arbitrarily added or removed during inference, while maintaining a fixed prediction time. Code is available here: https://github.com/hangligit/pathfinding

Via

Access Paper or Ask Questions

On Calibration of Graph Neural Networks for Node Classification

Jun 03, 2022
Tong Liu, Yushan Liu, Marcel Hildebrandt, Mitchell Joblin, Hang Li, Volker Tresp

Figure 1 for On Calibration of Graph Neural Networks for Node Classification

Figure 2 for On Calibration of Graph Neural Networks for Node Classification

Figure 3 for On Calibration of Graph Neural Networks for Node Classification

Figure 4 for On Calibration of Graph Neural Networks for Node Classification

Graphs can model real-world, complex systems by representing entities and their interactions in terms of nodes and edges. To better exploit the graph structure, graph neural networks have been developed, which learn entity and edge embeddings for tasks such as node classification and link prediction. These models achieve good performance with respect to accuracy, but the confidence scores associated with the predictions might not be calibrated. That means that the scores might not reflect the ground-truth probabilities of the predicted events, which would be especially important for safety-critical applications. Even though graph neural networks are used for a wide range of tasks, the calibration thereof has not been sufficiently explored yet. We investigate the calibration of graph neural networks for node classification, study the effect of existing post-processing calibration methods, and analyze the influence of model capacity, graph density, and a new loss function on calibration. Further, we propose a topology-aware calibration method that takes the neighboring nodes into account and yields improved calibration compared to baseline methods.

* Accepted by IJCNN 2022 (IEEE WCCI 2022)

Via

Access Paper or Ask Questions

Directed Acyclic Transformer for Non-Autoregressive Machine Translation

May 16, 2022
Fei Huang, Hao Zhou, Yang Liu, Hang Li, Minlie Huang

Figure 1 for Directed Acyclic Transformer for Non-Autoregressive Machine Translation

Figure 2 for Directed Acyclic Transformer for Non-Autoregressive Machine Translation

Figure 3 for Directed Acyclic Transformer for Non-Autoregressive Machine Translation

Figure 4 for Directed Acyclic Transformer for Non-Autoregressive Machine Translation

Non-autoregressive Transformers (NATs) significantly reduce the decoding latency by generating all tokens in parallel. However, such independent predictions prevent NATs from capturing the dependencies between the tokens for generating multiple possible translations. In this paper, we propose Directed Acyclic Transfomer (DA-Transformer), which represents the hidden states in a Directed Acyclic Graph (DAG), where each path of the DAG corresponds to a specific translation. The whole DAG simultaneously captures multiple translations and facilitates fast predictions in a non-autoregressive fashion. Experiments on the raw training data of WMT benchmark show that DA-Transformer substantially outperforms previous NATs by about 3 BLEU on average, which is the first NAT model that achieves competitive results with autoregressive Transformers without relying on knowledge distillation.

* accepted at ICML2022

Via

Access Paper or Ask Questions

How does Feedback Signal Quality Impact Effectiveness of Pseudo Relevance Feedback for Passage Retrieval?

May 12, 2022
Hang Li, Ahmed Mourad, Bevan Koopman, Guido Zuccon

Figure 1 for How does Feedback Signal Quality Impact Effectiveness of Pseudo Relevance Feedback for Passage Retrieval?

Figure 2 for How does Feedback Signal Quality Impact Effectiveness of Pseudo Relevance Feedback for Passage Retrieval?

Figure 3 for How does Feedback Signal Quality Impact Effectiveness of Pseudo Relevance Feedback for Passage Retrieval?

Pseudo-Relevance Feedback (PRF) assumes that the top results retrieved by a first-stage ranker are relevant to the original query and uses them to improve the query representation for a second round of retrieval. This assumption however is often not correct: some or even all of the feedback documents may be irrelevant. Indeed, the effectiveness of PRF methods may well depend on the quality of the feedback signal and thus on the effectiveness of the first-stage ranker. This aspect however has received little attention before. In this paper we control the quality of the feedback signal and measure its impact on a range of PRF methods, including traditional bag-of-words methods (Rocchio), and dense vector-based methods (learnt and not learnt). Our results show the important role the quality of the feedback signal plays on the effectiveness of PRF methods. Importantly, and surprisingly, our analysis reveals that not all PRF methods are the same when dealing with feedback signals of varying quality. These findings are critical to gain a better understanding of the PRF methods and of which and when they should be used, depending on the feedback signal quality, and set the basis for future research in this area.

* Accepted at SIGIR 2022

Via

Access Paper or Ask Questions