Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hamid R. Rabiee

ARAE: Adversarially Robust Training of Autoencoders Improves Novelty Detection

Mar 12, 2020
Mohammadreza Salehi, Atrin Arya, Barbod Pajoum, Mohammad Otoofi, Amirreza Shaeiri, Mohammad Hossein Rohban, Hamid R. Rabiee

Figure 1 for ARAE: Adversarially Robust Training of Autoencoders Improves Novelty Detection

Figure 2 for ARAE: Adversarially Robust Training of Autoencoders Improves Novelty Detection

Figure 3 for ARAE: Adversarially Robust Training of Autoencoders Improves Novelty Detection

Figure 4 for ARAE: Adversarially Robust Training of Autoencoders Improves Novelty Detection

Autoencoders (AE) have recently been widely employed to approach the novelty detection problem. Trained only on the normal data, the AE is expected to reconstruct the normal data effectively while fail to regenerate the anomalous data, which could be utilized for novelty detection. However, in this paper, it is demonstrated that this does not always hold. AE often generalizes so perfectly that it can also reconstruct the anomalous data well. To address this problem, we propose a novel AE that can learn more semantically meaningful features. Specifically, we exploit the fact that adversarial robustness promotes learning of meaningful features. Therefore, we force the AE to learn such features by penalizing networks with a bottleneck layer that is unstable against adversarial perturbations. We show that despite using a much simpler architecture in comparison to the prior methods, the proposed AE outperforms or is competitive to state-of-the-art on three benchmark datasets.

Via

Access Paper or Ask Questions

ChOracle: A Unified Statistical Framework for Churn Prediction

Sep 15, 2019
Ali Khodadadi, Seyed Abbas Hosseini, Ehsan Pajouheshgar, Farnam Mansouri, Hamid R. Rabiee

Figure 1 for ChOracle: A Unified Statistical Framework for Churn Prediction

Figure 2 for ChOracle: A Unified Statistical Framework for Churn Prediction

Figure 3 for ChOracle: A Unified Statistical Framework for Churn Prediction

Figure 4 for ChOracle: A Unified Statistical Framework for Churn Prediction

User churn is an important issue in online services that threatens the health and profitability of services. Most of the previous works on churn prediction convert the problem into a binary classification task where the users are labeled as churned and non-churned. More recently, some works have tried to convert the user churn prediction problem into the prediction of user return time. In this approach which is more realistic in real world online services, at each time-step the model predicts the user return time instead of predicting a churn label. However, the previous works in this category suffer from lack of generality and require high computational complexity. In this paper, we introduce \emph{ChOracle}, an oracle that predicts the user churn by modeling the user return times to service by utilizing a combination of Temporal Point Processes and Recurrent Neural Networks. Moreover, we incorporate latent variables into the proposed recurrent neural network to model the latent user loyalty to the system. We also develop an efficient approximate variational algorithm for learning parameters of the proposed RNN by using back propagation through time. Finally, we demonstrate the superior performance of ChOracle on a wide variety of real world datasets.

* 12 pages

Via

Access Paper or Ask Questions

News Labeling as Early as Possible: Real or Fake?

Jun 08, 2019
Maryam Ramezani, Mina Rafiei, Soroush Omranpour, Hamid R. Rabiee

Figure 1 for News Labeling as Early as Possible: Real or Fake?

Figure 2 for News Labeling as Early as Possible: Real or Fake?

Figure 3 for News Labeling as Early as Possible: Real or Fake?

Figure 4 for News Labeling as Early as Possible: Real or Fake?

Making disguise between real and fake news propagation through online social networks is an important issue in many applications. The time gap between the news release time and detection of its label is a significant step towards broadcasting the real information and avoiding the fake. Therefore, one of the challenging tasks in this area is to identify fake and real news in early stages of propagation. However, there is a trade-off between minimizing the time gap and maximizing accuracy. Despite recent efforts in detection of fake news, there has been no significant work that explicitly incorporates early detection in its model. In this paper, we focus on accurate early labeling of news, and propose a model by considering earliness both in modeling and prediction. The proposed method utilizes recurrent neural networks with a novel loss function, and a new stopping rule. Given the context of news, we first embed it with a class-specific text representation. Then, we utilize the available public profile of users, and speed of news diffusion, for early labeling of the news. Experiments on real datasets demonstrate the effectiveness of our model both in terms of early labelling and accuracy, compared to the state of the art baseline and models.

Via

Access Paper or Ask Questions

Multi-layered Graph Embedding with Graph Convolutional Networks

Nov 22, 2018
Mahsa Ghorbani, Mahdieh Soleymani Baghshah, Hamid R. Rabiee

Figure 1 for Multi-layered Graph Embedding with Graph Convolutional Networks

Figure 2 for Multi-layered Graph Embedding with Graph Convolutional Networks

Figure 3 for Multi-layered Graph Embedding with Graph Convolutional Networks

Recently, graph embedding emerges as an effective approach for graph analysis tasks such as node classification and link prediction. The goal of network embedding is to find low dimensional representation of graph nodes that preserves the graph structure. Since there might be signals on nodes as features, recent methods like Graph Convolutional Networks (GCNs) try to consider node signals besides the node relations. On the other hand, multi-layered graph analysis has been received much attention. However, the recent methods for node embedding have not been explored in these networks. In this paper, we study the problem of node embedding in multi-layered graphs and propose a deep method that embeds nodes using both relations (connections within and between layers of the graph) and nodes signals. We evaluate our method on node classification tasks. Experimental results demonstrate the superiority of the proposed method to other multi-layered and single-layered competitors and also proves the effect of using cross-layer edges.

Via

Access Paper or Ask Questions

Adversarial Classifier for Imbalanced Problems

Nov 21, 2018
Ehsan Montahaei, Mahsa Ghorbani, Mahdieh Soleymani Baghshah, Hamid R. Rabiee

Figure 1 for Adversarial Classifier for Imbalanced Problems

Figure 2 for Adversarial Classifier for Imbalanced Problems

Figure 3 for Adversarial Classifier for Imbalanced Problems

Figure 4 for Adversarial Classifier for Imbalanced Problems

Adversarial approach has been widely used for data generation in the last few years. However, this approach has not been extensively utilized for classifier training. In this paper, we propose an adversarial framework for classifier training that can also handle imbalanced data. Indeed, a network is trained via an adversarial approach to give weights to samples of the majority class such that the obtained classification problem becomes more challenging for the discriminator and thus boosts its classification capability. In addition to the general imbalanced classification problems, the proposed method can also be used for problems such as graph representation learning in which it is desired to discriminate similar nodes from dissimilar nodes. Experimental results on imbalanced data classification and on the tasks like graph link prediction show the superiority of the proposed method compared to the state-of-the-art methods.

Via

Access Paper or Ask Questions

On Statistical Learning of Simplices: Unmixing Problem Revisited

Oct 18, 2018
Amir Najafi, Saeed Ilchi, Amir H. Saberi, Abolfazl Motahari, Babak H. Khalaj, Hamid R. Rabiee

Figure 1 for On Statistical Learning of Simplices: Unmixing Problem Revisited

Figure 2 for On Statistical Learning of Simplices: Unmixing Problem Revisited

Figure 3 for On Statistical Learning of Simplices: Unmixing Problem Revisited

Figure 4 for On Statistical Learning of Simplices: Unmixing Problem Revisited

Learning of high-dimensional simplices from uniformly-sampled observations, generally known as the "unmixing problem", is a long-studied task in computer science. More recently, a significant interest is focused on this problem from other areas, such as computational biology and remote sensing. In this paper, we have studied the Probably Approximately Correct (PAC)-learnability of simplices with a focus on sample complexity. Our analysis shows that a sufficient sample size for PAC-learning of $K$-simplices is only $O\left(K^2\log K\right)$, yielding a huge improvement over the existing results, i.e. $O\left(K^{22}\right)$. Moreover, a novel continuously-relaxed optimization scheme is proposed which is guaranteed to achieve a PAC-approximation of the simplex, followed by a corresponding scalable algorithm whose performance is extensively tested on synthetic and real-world datasets. Experimental results show that not only being comparable to other existing strategies on noiseless samples, our method is superior to the state-of-the-art in noisy cases. The overall proposed framework is backed with solid theoretical guarantees and provides a rigorous framework for future research in this area.

Via

Access Paper or Ask Questions

Continuous-Time Relationship Prediction in Dynamic Heterogeneous Information Networks

Oct 08, 2018
Sina Sajadmanesh, Sogol Bazargani, Jiawei Zhang, Hamid R. Rabiee

Figure 1 for Continuous-Time Relationship Prediction in Dynamic Heterogeneous Information Networks

Figure 2 for Continuous-Time Relationship Prediction in Dynamic Heterogeneous Information Networks

Figure 3 for Continuous-Time Relationship Prediction in Dynamic Heterogeneous Information Networks

Figure 4 for Continuous-Time Relationship Prediction in Dynamic Heterogeneous Information Networks

Online social networks, World Wide Web, media and technological networks, and other types of so-called information networks are ubiquitous nowadays. These information networks are inherently heterogeneous and dynamic. They are heterogeneous as they consist of multi-typed objects and relations, and they are dynamic as they are constantly evolving over time. One of the challenging issues in such heterogeneous and dynamic environments is to forecast those relationships in the network that will appear in the future. In this paper, we try to solve the problem of continuous-time relationship prediction in dynamic and heterogeneous information networks. This implies predicting the time it takes for a relationship to appear in the future, given its features that have been extracted by considering both heterogeneity and temporal dynamics of the underlying network. To this end, we first introduce a feature extraction framework that combines the power of meta-path-based modeling and recurrent neural networks to effectively extract features suitable for relationship prediction regarding heterogeneity and dynamicity of the networks. Next, we propose a supervised non-parametric approach, called Non-Parametric Generalized Linear Model (NP-GLM), which infers the hidden underlying probability distribution of the relationship building time given its features. We then present a learning algorithm to train NP-GLM and an inference method to answer time-related queries. Extensive experiments conducted on synthetic data and three real-world datasets, namely Delicious, MovieLens, and DBLP, demonstrate the effectiveness of NP-GLM in solving continuous-time relationship prediction problem vis-a-vis competitive baselines

* Submitted to ACM Transactions on Knowledge Discovery from Data. arXiv admin note: text overlap with arXiv:1706.06783

Via

Access Paper or Ask Questions

A Hybrid Deep Learning Architecture for Privacy-Preserving Mobile Analytics

Apr 18, 2018
Seyed Ali Osia, Ali Shahin Shamsabadi, Ali Taheri, Kleomenis Katevas, Sina Sajadmanesh, Hamid R. Rabiee, Nicholas D. Lane, Hamed Haddadi

Figure 1 for A Hybrid Deep Learning Architecture for Privacy-Preserving Mobile Analytics

Figure 2 for A Hybrid Deep Learning Architecture for Privacy-Preserving Mobile Analytics

Figure 3 for A Hybrid Deep Learning Architecture for Privacy-Preserving Mobile Analytics

Figure 4 for A Hybrid Deep Learning Architecture for Privacy-Preserving Mobile Analytics

Deep Neural Networks are increasingly being used in a variety of machine learning applications applied to user data on the cloud. However, this approach introduces a number of privacy and efficiency challenges, as the cloud operator can perform secondary inferences on the available data. Recently, advances in edge processing have paved the way for more efficient, and private, data processing at the source for simple tasks and lighter models, though they remain a challenge for larger, and more complicated models. In this paper, we present a hybrid approach for breaking down large, complex deep models for cooperative, privacy-preserving analytics. We do this by breaking down the popular deep architectures and fine-tune them in a suitable way. We then evaluate the privacy benefits of this approach based on the information exposed to the cloud service. We also assess the local inference cost of different layers on a modern handset for mobile applications. Our evaluations show that by using certain kind of fine-tuning and embedding techniques and at a small processing cost, we can greatly reduce the level of information available to unintended tasks applied to the data features on the cloud, and hence achieving the desired tradeoff between privacy and performance.

Via

Access Paper or Ask Questions

Deep Private-Feature Extraction

Feb 28, 2018
Seyed Ali Osia, Ali Taheri, Ali Shahin Shamsabadi, Kleomenis Katevas, Hamed Haddadi, Hamid R. Rabiee

Figure 1 for Deep Private-Feature Extraction

Figure 2 for Deep Private-Feature Extraction

Figure 3 for Deep Private-Feature Extraction

Figure 4 for Deep Private-Feature Extraction

We present and evaluate Deep Private-Feature Extractor (DPFE), a deep model which is trained and evaluated based on information theoretic constraints. Using the selective exchange of information between a user's device and a service provider, DPFE enables the user to prevent certain sensitive information from being shared with a service provider, while allowing them to extract approved information using their model. We introduce and utilize the log-rank privacy, a novel measure to assess the effectiveness of DPFE in removing sensitive information and compare different models based on their accuracy-privacy tradeoff. We then implement and evaluate the performance of DPFE on smartphones to understand its complexity, resource demands, and efficiency tradeoffs. Our results on benchmark image datasets demonstrate that under moderate resource utilization, DPFE can achieve high accuracy for primary tasks while preserving the privacy of sensitive features.

Via

Access Paper or Ask Questions

Steering Social Activity: A Stochastic Optimal Control Point Of View

Feb 19, 2018
Ali Zarezade, Abir De, Utkarsh Upadhyay, Hamid R. Rabiee, Manuel Gomez-Rodriguez

Figure 1 for Steering Social Activity: A Stochastic Optimal Control Point Of View

Figure 2 for Steering Social Activity: A Stochastic Optimal Control Point Of View

Figure 3 for Steering Social Activity: A Stochastic Optimal Control Point Of View

Figure 4 for Steering Social Activity: A Stochastic Optimal Control Point Of View

User engagement in online social networking depends critically on the level of social activity in the corresponding platform--the number of online actions, such as posts, shares or replies, taken by their users. Can we design data-driven algorithms to increase social activity? At a user level, such algorithms may increase activity by helping users decide when to take an action to be more likely to be noticed by their peers. At a network level, they may increase activity by incentivizing a few influential users to take more actions, which in turn will trigger additional actions by other users. In this paper, we model social activity using the framework of marked temporal point processes, derive an alternate representation of these processes using stochastic differential equations (SDEs) with jumps and, exploiting this alternate representation, develop two efficient online algorithms with provable guarantees to steer social activity both at a user and at a network level. In doing so, we establish a previously unexplored connection between optimal control of jump SDEs and doubly stochastic marked temporal point processes, which is of independent interest. Finally, we experiment both with synthetic and real data gathered from Twitter and show that our algorithms consistently steer social activity more effectively than the state of the art.

* To appear in JMLR 2018. arXiv admin note: substantial text overlap with arXiv:1610.05773, arXiv:1703.02059

Via

Access Paper or Ask Questions