Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xuan-Son Vu

Umeå university

Self-adaptive Privacy Concern Detection for User-generated Content

Jun 19, 2018

Xuan-Son Vu, Lili Jiang

Figure 1 for Self-adaptive Privacy Concern Detection for User-generated Content

Figure 2 for Self-adaptive Privacy Concern Detection for User-generated Content

Figure 3 for Self-adaptive Privacy Concern Detection for User-generated Content

Figure 4 for Self-adaptive Privacy Concern Detection for User-generated Content

Abstract:To protect user privacy in data analysis, a state-of-the-art strategy is differential privacy in which scientific noise is injected into the real analysis output. The noise masks individual's sensitive information contained in the dataset. However, determining the amount of noise is a key challenge, since too much noise will destroy data utility while too little noise will increase privacy risk. Though previous research works have designed some mechanisms to protect data privacy in different scenarios, most of the existing studies assume uniform privacy concerns for all individuals. Consequently, putting an equal amount of noise to all individuals leads to insufficient privacy protection for some users, while over-protecting others. To address this issue, we propose a self-adaptive approach for privacy concern detection based on user personality. Our experimental studies demonstrate the effectiveness to address a suitable personalized privacy protection for cold-start users (i.e., without their privacy-concern information in training data).

* Proceedings of the 19th International Conference on Computational Linguistics and Intelligent Text Processing, 2018

Via

Access Paper or Ask Questions

NIHRIO at SemEval-2018 Task 3: A Simple and Accurate Neural Network Model for Irony Detection in Twitter

Apr 08, 2018

Thanh Vu, Dat Quoc Nguyen, Xuan-Son Vu, Dai Quoc Nguyen, Michael Catt, Michael Trenell

Figure 1 for NIHRIO at SemEval-2018 Task 3: A Simple and Accurate Neural Network Model for Irony Detection in Twitter

Figure 2 for NIHRIO at SemEval-2018 Task 3: A Simple and Accurate Neural Network Model for Irony Detection in Twitter

Figure 3 for NIHRIO at SemEval-2018 Task 3: A Simple and Accurate Neural Network Model for Irony Detection in Twitter

Figure 4 for NIHRIO at SemEval-2018 Task 3: A Simple and Accurate Neural Network Model for Irony Detection in Twitter

Abstract:This paper describes our NIHRIO system for SemEval-2018 Task 3 "Irony detection in English tweets". We propose to use a simple neural network architecture of Multilayer Perceptron with various types of input features including: lexical, syntactic, semantic and polarity features. Our system achieves very high performance in both subtasks of binary and multi-class irony detection in tweets. In particular, we rank third using the accuracy metric and fifth using the F1 metric. Our code is available at https://github.com/NIHRIO/IronyDetectionInTwitter

* In proceedings of the 12th International Workshop on Semantic Evaluation, SemEval 2018, to appear (6 pages, 2 figures)

Via

Access Paper or Ask Questions

Lexical-semantic resources: yet powerful resources for automatic personality classification

Nov 27, 2017

Xuan-Son Vu, Lucie Flekova, Lili Jiang, Iryna Gurevych

Figure 1 for Lexical-semantic resources: yet powerful resources for automatic personality classification

Figure 2 for Lexical-semantic resources: yet powerful resources for automatic personality classification

Figure 3 for Lexical-semantic resources: yet powerful resources for automatic personality classification

Figure 4 for Lexical-semantic resources: yet powerful resources for automatic personality classification

Abstract:In this paper, we aim to reveal the impact of lexical-semantic resources, used in particular for word sense disambiguation and sense-level semantic categorization, on automatic personality classification task. While stylistic features (e.g., part-of-speech counts) have been shown their power in this task, the impact of semantics beyond targeted word lists is relatively unexplored. We propose and extract three types of lexical-semantic features, which capture high-level concepts and emotions, overcoming the lexical gap of word n-grams. Our experimental results are comparable to state-of-the-art methods, while no personality-specific resources are required.

* GWC 2018 The 9th Global WordNet Conference GWC 2018 The 9th Global WordNet Conference GWC 2018 The 9th Global WordNet Conference GWC 2018, the 9th Global WordNet Conference

Via

Access Paper or Ask Questions

Mining User/Movie Preferred Features Based on Reviews for Video Recommendation System

Feb 09, 2017

Xuan-Son Vu, Seong-Bae Park

Figure 1 for Mining User/Movie Preferred Features Based on Reviews for Video Recommendation System

Figure 2 for Mining User/Movie Preferred Features Based on Reviews for Video Recommendation System

Figure 3 for Mining User/Movie Preferred Features Based on Reviews for Video Recommendation System

Figure 4 for Mining User/Movie Preferred Features Based on Reviews for Video Recommendation System

Abstract:In this work, we present an approach for mining user preferences and recommendation based on reviews. There have been various studies worked on recommendation problem. However, most of the studies beyond one aspect user generated- content such as user ratings, user feedback and so on to state user preferences. There is a prob- lem in one aspect mining is lacking for stating user preferences. As a demonstration, in collaborative filter recommendation, we try to figure out the preference trend of crowded users, then use that trend to predict current user preference. Therefore, there is a gap between real user preferences and the trend of the crowded people. Additionally, user preferences can be addressed from mining user reviews since user often comment about various aspects of products. To solve this problem, we mainly focus on mining product aspects and user aspects inside user reviews to directly state user preferences. We also take into account Social Network Analysis for cold-start item problem. With cold-start user problem, collaborative filter algorithm is employed in our work. The framework is general enough to be applied to different recommendation domains. Theoretically, our method would achieve a significant enhancement.

* The 2nd Workshop on Future Researches of Computer Science and Engineering, Kyungpook National University, pp. 21-24, 2014

Via

Access Paper or Ask Questions

Construction of Vietnamese SentiWordNet by using Vietnamese Dictionary

Dec 27, 2014

Xuan-Son Vu, Seong-Bae Park

Figure 1 for Construction of Vietnamese SentiWordNet by using Vietnamese Dictionary

Figure 2 for Construction of Vietnamese SentiWordNet by using Vietnamese Dictionary

Abstract:SentiWordNet is an important lexical resource supporting sentiment analysis in opinion mining applications. In this paper, we propose a novel approach to construct a Vietnamese SentiWordNet (VSWN). SentiWordNet is typically generated from WordNet in which each synset has numerical scores to indicate its opinion polarities. Many previous studies obtained these scores by applying a machine learning method to WordNet. However, Vietnamese WordNet is not available unfortunately by the time of this paper. Therefore, we propose a method to construct VSWN from a Vietnamese dictionary, not from WordNet. We show the effectiveness of the proposed method by generating a VSWN with 39,561 synsets automatically. The method is experimentally tested with 266 synsets with aspect of positivity and negativity. It attains a competitive result compared with English SentiWordNet that is 0.066 and 0.052 differences for positivity and negativity sets respectively.

* The 40th Conference of the Korea Information Processing Society, pp. 745-748, April 2014, South Korea
* accepted on April-9th-2014, best paper award

Via

Access Paper or Ask Questions