Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

"Information": models, code, and papers

Leveraging Privacy Profiles to Empower Users in the Digital Society

Apr 01, 2022
Davide Di Ruscio, Paola Inverardi, Patrizio Migliarini, Phuong T. Nguyen

Figure 1 for Leveraging Privacy Profiles to Empower Users in the Digital Society

Figure 2 for Leveraging Privacy Profiles to Empower Users in the Digital Society

Figure 3 for Leveraging Privacy Profiles to Empower Users in the Digital Society

Figure 4 for Leveraging Privacy Profiles to Empower Users in the Digital Society

Privacy and ethics of citizens are at the core of the concerns raised by our increasingly digital society. Profiling users is standard practice for software applications triggering the need for users, also enforced by laws, to properly manage privacy settings. Users need to manage software privacy settings properly to protect personally identifiable information and express personal ethical preferences. AI technologies that empower users to interact with the digital world by reflecting their personal ethical preferences can be key enablers of a trustworthy digital society. We focus on the privacy dimension and contribute a step in the above direction through an empirical study on an existing dataset collected from the fitness domain. We find out which set of questions is appropriate to differentiate users according to their preferences. The results reveal that a compact set of semantic-driven questions (about domain-independent privacy preferences) helps distinguish users better than a complex domain-dependent one. This confirms the study's hypothesis that moral attitudes are the relevant piece of information to collect. Based on the outcome, we implement a recommender system to provide users with suitable recommendations related to privacy choices. We then show that the proposed recommender system provides relevant settings to users, obtaining high accuracy.

* The paper consists of 37 pages, 11 figures

Via

Access Paper or Ask Questions

ConvMAE: Masked Convolution Meets Masked Autoencoders

May 08, 2022
Peng Gao, Teli Ma, Hongsheng Li, Jifeng Dai, Yu Qiao

Figure 1 for ConvMAE: Masked Convolution Meets Masked Autoencoders

Figure 2 for ConvMAE: Masked Convolution Meets Masked Autoencoders

Figure 3 for ConvMAE: Masked Convolution Meets Masked Autoencoders

Figure 4 for ConvMAE: Masked Convolution Meets Masked Autoencoders

Vision Transformers (ViT) become widely-adopted architectures for various vision tasks. Masked auto-encoding for feature pretraining and multi-scale hybrid convolution-transformer architectures can further unleash the potentials of ViT, leading to state-of-the-art performances on image classification, detection and semantic segmentation. In this paper, our ConvMAE framework demonstrates that multi-scale hybrid convolution-transformer can learn more discriminative representations via the mask auto-encoding scheme. However, directly using the original masking strategy leads to the heavy computational cost and pretraining-finetuning discrepancy. To tackle the issue, we adopt the masked convolution to prevent information leakage in the convolution blocks. A simple block-wise masking strategy is proposed to ensure computational efficiency. We also propose to more directly supervise the multi-scale features of the encoder to boost multi-scale features. Based on our pretrained ConvMAE models, ConvMAE-Base improves ImageNet-1K finetuning accuracy by 1.4% compared with MAE-Base. On object detection, ConvMAE-Base finetuned for only 25 epochs surpasses MAE-Base fined-tuned for 100 epochs by 2.9% box AP and 2.2% mask AP respectively. Code and pretrained models are available at https://github.com/Alpha-VL/ConvMAE.

* 10 pages

Via

Access Paper or Ask Questions

COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles

May 04, 2022
Jiaxun Cui, Hang Qiu, Dian Chen, Peter Stone, Yuke Zhu

Figure 1 for COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles

Figure 2 for COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles

Figure 3 for COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles

Figure 4 for COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles

Optical sensors and learning algorithms for autonomous vehicles have dramatically advanced in the past few years. Nonetheless, the reliability of today's autonomous vehicles is hindered by the limited line-of-sight sensing capability and the brittleness of data-driven methods in handling extreme situations. With recent developments of telecommunication technologies, cooperative perception with vehicle-to-vehicle communications has become a promising paradigm to enhance autonomous driving in dangerous or emergency situations. We introduce COOPERNAUT, an end-to-end learning model that uses cross-vehicle perception for vision-based cooperative driving. Our model encodes LiDAR information into compact point-based representations that can be transmitted as messages between vehicles via realistic wireless channels. To evaluate our model, we develop AutoCastSim, a network-augmented driving simulation framework with example accident-prone scenarios. Our experiments on AutoCastSim suggest that our cooperative perception driving models lead to a 40% improvement in average success rate over egocentric driving models in these challenging driving situations and a 5 times smaller bandwidth requirement than prior work V2VNet. COOPERNAUT and AUTOCASTSIM are available at https://ut-austin-rpl.github.io/Coopernaut/.

Via

Access Paper or Ask Questions

Self-Attention for Incomplete Utterance Rewriting

Feb 26, 2022
Yong Zhang, Zhitao Li, Jianzong Wang, Ning Cheng, Jing Xiao

Figure 1 for Self-Attention for Incomplete Utterance Rewriting

Figure 2 for Self-Attention for Incomplete Utterance Rewriting

Figure 3 for Self-Attention for Incomplete Utterance Rewriting

Figure 4 for Self-Attention for Incomplete Utterance Rewriting

Incomplete utterance rewriting (IUR) has recently become an essential task in NLP, aiming to complement the incomplete utterance with sufficient context information for comprehension. In this paper, we propose a novel method by directly extracting the coreference and omission relationship from the self-attention weight matrix of the transformer instead of word embeddings and edit the original text accordingly to generate the complete utterance. Benefiting from the rich information in the self-attention weight matrix, our method achieved competitive results on public IUR datasets.

* Accepted by the 47th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2022)

Via

Access Paper or Ask Questions

Generalizable Person Re-Identification via Self-Supervised Batch Norm Test-Time Adaption

Mar 28, 2022
Ke Han, Chenyang Si, Yan Huang, Liang Wang, Tieniu Tan

Figure 1 for Generalizable Person Re-Identification via Self-Supervised Batch Norm Test-Time Adaption

Figure 2 for Generalizable Person Re-Identification via Self-Supervised Batch Norm Test-Time Adaption

Figure 3 for Generalizable Person Re-Identification via Self-Supervised Batch Norm Test-Time Adaption

Figure 4 for Generalizable Person Re-Identification via Self-Supervised Batch Norm Test-Time Adaption

In this paper, we investigate the generalization problem of person re-identification (re-id), whose major challenge is the distribution shift on an unseen domain. As an important tool of regularizing the distribution, batch normalization (BN) has been widely used in existing methods. However, they neglect that BN is severely biased to the training domain and inevitably suffers the performance drop if directly generalized without being updated. To tackle this issue, we propose Batch Norm Test-time Adaption (BNTA), a novel re-id framework that applies the self-supervised strategy to update BN parameters adaptively. Specifically, BNTA quickly explores the domain-aware information within unlabeled target data before inference, and accordingly modulates the feature distribution normalized by BN to adapt to the target domain. This is accomplished by two designed self-supervised auxiliary tasks, namely part positioning and part nearest neighbor matching, which help the model mine the domain-aware information with respect to the structure and identity of body parts, respectively. To demonstrate the effectiveness of our method, we conduct extensive experiments on three re-id datasets and confirm the superior performance to the state-of-the-art methods.

* accepted by AAAI 2022

Via

Access Paper or Ask Questions

Information Leakage in Embedding Models

Mar 31, 2020
Congzheng Song, Ananth Raghunathan

Figure 1 for Information Leakage in Embedding Models

Figure 2 for Information Leakage in Embedding Models

Figure 3 for Information Leakage in Embedding Models

Figure 4 for Information Leakage in Embedding Models

Embeddings are functions that map raw input data to low-dimensional vector representations, while preserving important semantic information about the inputs. Pre-training embeddings on a large amount of unlabeled data and fine-tuning them for downstream tasks is now a de facto standard in achieving state of the art learning in many domains. We demonstrate that embeddings, in addition to encoding generic semantics, often also present a vector that leaks sensitive information about the input data. We develop three classes of attacks to systematically study information that might be leaked by embeddings. First, embedding vectors can be inverted to partially recover some of the input data. As an example, we show that our attacks on popular sentence embeddings recover between 50\%--70\% of the input words (F1 scores of 0.5--0.7). Second, embeddings may reveal sensitive attributes inherent in inputs and independent of the underlying semantic task at hand. Attributes such as authorship of text can be easily extracted by training an inference model on just a handful of labeled embedding vectors. Third, embedding models leak moderate amount of membership information for infrequent training data inputs. We extensively evaluate our attacks on various state-of-the-art embedding models in the text domain. We also propose and evaluate defenses that can prevent the leakage to some extent at a minor cost in utility.

Via

Access Paper or Ask Questions

EEG-ITNet: An Explainable Inception Temporal Convolutional Network for Motor Imagery Classification

Apr 14, 2022
Abbas Salami, Javier Andreu-Perez, Helge Gillmeister

Figure 1 for EEG-ITNet: An Explainable Inception Temporal Convolutional Network for Motor Imagery Classification

Figure 2 for EEG-ITNet: An Explainable Inception Temporal Convolutional Network for Motor Imagery Classification

Figure 3 for EEG-ITNet: An Explainable Inception Temporal Convolutional Network for Motor Imagery Classification

Figure 4 for EEG-ITNet: An Explainable Inception Temporal Convolutional Network for Motor Imagery Classification

In recent years, neural networks and especially deep architectures have received substantial attention for EEG signal analysis in the field of brain-computer interfaces (BCIs). In this ongoing research area, the end-to-end models are more favoured than traditional approaches requiring signal transformation pre-classification. They can eliminate the need for prior information from experts and the extraction of handcrafted features. However, although several deep learning algorithms have been already proposed in the literature, achieving high accuracies for classifying motor movements or mental tasks, they often face a lack of interpretability and therefore are not quite favoured by the neuroscience community. The reasons behind this issue can be the high number of parameters and the sensitivity of deep neural networks to capture tiny yet unrelated discriminative features. We propose an end-to-end deep learning architecture called EEG-ITNet and a more comprehensible method to visualise the network learned patterns. Using inception modules and causal convolutions with dilation, our model can extract rich spectral, spatial, and temporal information from multi-channel EEG signals with less complexity (in terms of the number of trainable parameters) than other existing end-to-end architectures, such as EEG-Inception and EEG-TCNet. By an exhaustive evaluation on dataset 2a from BCI competition IV and OpenBMI motor imagery dataset, EEG-ITNet shows up to 5.9\% improvement in the classification accuracy in different scenarios with statistical significance compared to its competitors. We also comprehensively explain and support the validity of network illustration from a neuroscientific perspective. We have also made our code open at https://github.com/AbbasSalami/EEG-ITNet

* IEEE Access (2022)

Via

Access Paper or Ask Questions

Local Hypergraph-based Nested Named Entity Recognition as Query-based Sequence Labeling

May 04, 2022
Yukun Yan, Sen Song

Figure 1 for Local Hypergraph-based Nested Named Entity Recognition as Query-based Sequence Labeling

Figure 2 for Local Hypergraph-based Nested Named Entity Recognition as Query-based Sequence Labeling

Figure 3 for Local Hypergraph-based Nested Named Entity Recognition as Query-based Sequence Labeling

Figure 4 for Local Hypergraph-based Nested Named Entity Recognition as Query-based Sequence Labeling

There has been a growing academic interest in the recognition of nested named entities in many domains. We tackle the task with a novel local hypergraph-based method: We first propose start token candidates and generate corresponding queries with their surrounding context, then use a query-based sequence labeling module to form a local hypergraph for each candidate. An end token estimator is used to correct the hypergraphs and get the final predictions. Compared to span-based approaches, our method is free of the high computation cost of span sampling and the risk of losing long entities. Sequential prediction makes it easier to leverage information in word order inside nested structures, and richer representations are built with a local hypergraph. Experiments show that our proposed method outperforms all the previous hypergraph-based and sequence labeling approaches with large margins on all four nested datasets. It achieves a new state-of-the-art F1 score on the ACE 2004 dataset and competitive F1 scores with previous state-of-the-art methods on three other nested NER datasets: ACE 2005, GENIA, and KBP 2017.

Via

Access Paper or Ask Questions

Rethinking Position Bias Modeling with Knowledge Distillation for CTR Prediction

Apr 01, 2022
Congcong Liu, Yuejiang Li, Jian Zhu, Xiwei Zhao, Changping Peng, Zhangang Lin, Jingping Shao

Figure 1 for Rethinking Position Bias Modeling with Knowledge Distillation for CTR Prediction

Figure 2 for Rethinking Position Bias Modeling with Knowledge Distillation for CTR Prediction

Figure 3 for Rethinking Position Bias Modeling with Knowledge Distillation for CTR Prediction

Figure 4 for Rethinking Position Bias Modeling with Knowledge Distillation for CTR Prediction

Click-through rate (CTR) Prediction is of great importance in real-world online ads systems. One challenge for the CTR prediction task is to capture the real interest of users from their clicked items, which is inherently biased by presented positions of items, i.e., more front positions tend to obtain higher CTR values. A popular line of existing works focuses on explicitly estimating position bias by result randomization which is expensive and inefficient, or by inverse propensity weighting (IPW) which relies heavily on the quality of the propensity estimation. Another common solution is modeling position as features during offline training and simply adopting fixed value or dropout tricks when serving. However, training-inference inconsistency can lead to sub-optimal performance. Furthermore, post-click information such as position values is informative while less exploited in CTR prediction. This work proposes a simple yet efficient knowledge distillation framework to alleviate the impact of position bias and leverage position information to improve CTR prediction. We demonstrate the performance of our proposed method on a real-world production dataset and online A/B tests, achieving significant improvements over competing baseline models. The proposed method has been deployed in the real world online ads systems, serving main traffic on one of the world's largest e-commercial platforms.

Via

Access Paper or Ask Questions

Gradient-based Bayesian Experimental Design for Implicit Models using Mutual Information Lower Bounds

May 10, 2021
Steven Kleinegesse, Michael U. Gutmann

Figure 1 for Gradient-based Bayesian Experimental Design for Implicit Models using Mutual Information Lower Bounds

Figure 2 for Gradient-based Bayesian Experimental Design for Implicit Models using Mutual Information Lower Bounds

Figure 3 for Gradient-based Bayesian Experimental Design for Implicit Models using Mutual Information Lower Bounds

Figure 4 for Gradient-based Bayesian Experimental Design for Implicit Models using Mutual Information Lower Bounds

We introduce a framework for Bayesian experimental design (BED) with implicit models, where the data-generating distribution is intractable but sampling from it is still possible. In order to find optimal experimental designs for such models, our approach maximises mutual information lower bounds that are parametrised by neural networks. By training a neural network on sampled data, we simultaneously update network parameters and designs using stochastic gradient-ascent. The framework enables experimental design with a variety of prominent lower bounds and can be applied to a wide range of scientific tasks, such as parameter estimation, model discrimination and improving future predictions. Using a set of intractable toy models, we provide a comprehensive empirical comparison of prominent lower bounds applied to the aforementioned tasks. We further validate our framework on a challenging system of stochastic differential equations from epidemiology.

* Under review

Via

Access Paper or Ask Questions