Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ee-Peng Lim

ENCONTER: Entity Constrained Progressive Sequence Generation via Insertion-based Transformer

Mar 17, 2021

Lee-Hsun Hsieh, Yang-Yin Lee, Ee-Peng Lim

Figure 1 for ENCONTER: Entity Constrained Progressive Sequence Generation via Insertion-based Transformer

Figure 2 for ENCONTER: Entity Constrained Progressive Sequence Generation via Insertion-based Transformer

Figure 3 for ENCONTER: Entity Constrained Progressive Sequence Generation via Insertion-based Transformer

Figure 4 for ENCONTER: Entity Constrained Progressive Sequence Generation via Insertion-based Transformer

Abstract:Pretrained using large amount of data, autoregressive language models are able to generate high quality sequences. However, these models do not perform well under hard lexical constraints as they lack fine control of content generation process. Progressive insertion-based transformers can overcome the above limitation and efficiently generate a sequence in parallel given some input tokens as constraint. These transformers however may fail to support hard lexical constraints as their generation process is more likely to terminate prematurely. The paper analyses such early termination problems and proposes the Entity-constrained insertion transformer (ENCONTER), a new insertion transformer that addresses the above pitfall without compromising much generation efficiency. We introduce a new training strategy that considers predefined hard lexical constraints (e.g., entities to be included in the generated sequence). Our experiments show that ENCONTER outperforms other baseline models in several performance metrics rendering it more suitable in practical applications. Our code is available at https://github.com/LARC-CMU-SMU/Enconter

* EACL 2021 long paper

Via

Access Paper or Ask Questions

DeepStyle: User Style Embedding for Authorship Attribution of Short Texts

Mar 14, 2021

Zhiqiang Hu, Roy Ka-Wei Lee, Lei Wang, Ee-Peng Lim, Bo Dai

Figure 1 for DeepStyle: User Style Embedding for Authorship Attribution of Short Texts

Figure 2 for DeepStyle: User Style Embedding for Authorship Attribution of Short Texts

Figure 3 for DeepStyle: User Style Embedding for Authorship Attribution of Short Texts

Figure 4 for DeepStyle: User Style Embedding for Authorship Attribution of Short Texts

Abstract:Authorship attribution (AA), which is the task of finding the owner of a given text, is an important and widely studied research topic with many applications. Recent works have shown that deep learning methods could achieve significant accuracy improvement for the AA task. Nevertheless, most of these proposed methods represent user posts using a single type of feature (e.g., word bi-grams) and adopt a text classification approach to address the task. Furthermore, these methods offer very limited explainability of the AA results. In this paper, we address these limitations by proposing DeepStyle, a novel embedding-based framework that learns the representations of users' salient writing styles. We conduct extensive experiments on two real-world datasets from Twitter and Weibo. Our experiment results show that DeepStyle outperforms the state-of-the-art baselines on the AA task.

* Paper accepted for 4th APWeb-WAIM Joint Conference on Web and Big Data

Via

Access Paper or Ask Questions

Transforming Facial Weight of Real Images by Editing Latent Space of StyleGAN

Nov 05, 2020

V N S Rama Krishna Pinnimty, Matt Zhao, Palakorn Achananuparp, Ee-Peng Lim

Figure 1 for Transforming Facial Weight of Real Images by Editing Latent Space of StyleGAN

Figure 2 for Transforming Facial Weight of Real Images by Editing Latent Space of StyleGAN

Figure 3 for Transforming Facial Weight of Real Images by Editing Latent Space of StyleGAN

Figure 4 for Transforming Facial Weight of Real Images by Editing Latent Space of StyleGAN

Abstract:We present an invert-and-edit framework to automatically transform facial weight of an input face image to look thinner or heavier by leveraging semantic facial attributes encoded in the latent space of Generative Adversarial Networks (GANs). Using a pre-trained StyleGAN as the underlying generator, we first employ an optimization-based embedding method to invert the input image into the StyleGAN latent space. Then, we identify the facial-weight attribute direction in the latent space via supervised learning and edit the inverted latent code by moving it positively or negatively along the extracted feature axis. Our framework is empirically shown to produce high-quality and realistic facial-weight transformations without requiring training GANs with a large amount of labeled face images from scratch. Ultimately, our framework can be utilized as part of an intervention to motivate individuals to make healthier food choices by visualizing the future impacts of their behavior on appearance.

Via

Access Paper or Ask Questions

On Predicting Personal Values of Social Media Users using Community-Specific Language Features and Personal Value Correlation

Jul 16, 2020

Amila Silva, Pei-Chi Lo, Ee-Peng Lim

Figure 1 for On Predicting Personal Values of Social Media Users using Community-Specific Language Features and Personal Value Correlation

Figure 2 for On Predicting Personal Values of Social Media Users using Community-Specific Language Features and Personal Value Correlation

Figure 3 for On Predicting Personal Values of Social Media Users using Community-Specific Language Features and Personal Value Correlation

Figure 4 for On Predicting Personal Values of Social Media Users using Community-Specific Language Features and Personal Value Correlation

Abstract:Personal values have significant influence on individuals' behaviors, preferences, and decision making. It is therefore not a surprise that personal values of a person could influence his or her social media content and activities. Instead of getting users to complete personal value questionnaire, researchers have looked into a non-intrusive and highly scalable approach to predict personal values using user-generated social media data. Nevertheless, geographical differences in word usage and profile information are issues to be addressed when designing such prediction models. In this work, we focus on analyzing Singapore users' personal values, and developing effective models to predict their personal values using their Facebook data. These models leverage on word categories in Linguistic Inquiry and Word Count (LIWC) and correlations among personal values. The LIWC word categories are adapted to non-English word use in Singapore. We incorporate the correlations among personal values into our proposed Stack Model consisting of a task-specific layer of base models and a cross-stitch layer model. Through experiments, we show that our proposed model predicts personal values with considerable improvement of accuracy over the previous works. Moreover, we use the stack model to predict the personal values of a large community of Twitter users using their public tweet content and empirically derive several interesting findings about their online behavior consistent with earlier findings in the social science and social media literature.

Via

Access Paper or Ask Questions

Face to Purchase: Predicting Consumer Choices with Structured Facial and Behavioral Traits Embedding

Jul 14, 2020

Zhe Liu, Xianzhi Wang, Lina Yao, Jake An, Lei Bai, Ee-Peng Lim

Figure 1 for Face to Purchase: Predicting Consumer Choices with Structured Facial and Behavioral Traits Embedding

Figure 2 for Face to Purchase: Predicting Consumer Choices with Structured Facial and Behavioral Traits Embedding

Figure 3 for Face to Purchase: Predicting Consumer Choices with Structured Facial and Behavioral Traits Embedding

Figure 4 for Face to Purchase: Predicting Consumer Choices with Structured Facial and Behavioral Traits Embedding

Abstract:Predicting consumers' purchasing behaviors is critical for targeted advertisement and sales promotion in e-commerce. Human faces are an invaluable source of information for gaining insights into consumer personality and behavioral traits. However, consumer's faces are largely unexplored in previous research, and the existing face-related studies focus on high-level features such as personality traits while neglecting the business significance of learning from facial data. We propose to predict consumers' purchases based on their facial features and purchasing histories. We design a semi-supervised model based on a hierarchical embedding network to extract high-level features of consumers and to predict the top-$N$ purchase destinations of a consumer. Our experimental results on a real-world dataset demonstrate the positive effect of incorporating facial information in predicting consumers' purchasing behaviors.

Via

Access Paper or Ask Questions

RecipeGPT: Generative Pre-training Based Cooking Recipe Generation and Evaluation System

Mar 05, 2020

Helena H. Lee, Ke Shu, Palakorn Achananuparp, Philips Kokoh Prasetyo, Yue Liu, Ee-Peng Lim, Lav R. Varshney

Figure 1 for RecipeGPT: Generative Pre-training Based Cooking Recipe Generation and Evaluation System

Figure 2 for RecipeGPT: Generative Pre-training Based Cooking Recipe Generation and Evaluation System

Figure 3 for RecipeGPT: Generative Pre-training Based Cooking Recipe Generation and Evaluation System

Figure 4 for RecipeGPT: Generative Pre-training Based Cooking Recipe Generation and Evaluation System

Abstract:Interests in the automatic generation of cooking recipes have been growing steadily over the past few years thanks to a large amount of online cooking recipes. We present RecipeGPT, a novel online recipe generation and evaluation system. The system provides two modes of text generations: (1) instruction generation from given recipe title and ingredients; and (2) ingredient generation from recipe title and cooking instructions. Its back-end text generation module comprises a generative pre-trained language model GPT-2 fine-tuned on a large cooking recipe dataset. Moreover, the recipe evaluation module allows the users to conveniently inspect the quality of the generated recipe contents and store the results for future reference. RecipeGPT can be accessed online at https://recipegpt.org/.

* Accepted to WWW 2020. Demo track paper

Via

Access Paper or Ask Questions

JPLink: On Linking Jobs to Vocational Interest Types

Feb 06, 2020

Amila Silva, Pei-Chi Lo, Ee-Peng Lim

Figure 1 for JPLink: On Linking Jobs to Vocational Interest Types

Figure 2 for JPLink: On Linking Jobs to Vocational Interest Types

Figure 3 for JPLink: On Linking Jobs to Vocational Interest Types

Figure 4 for JPLink: On Linking Jobs to Vocational Interest Types

Abstract:Linking job seekers with relevant jobs requires matching based on not only skills, but also personality types. Although the Holland Code also known as RIASEC has frequently been used to group people by their suitability for six different categories of occupations, the RIASEC category labels of individual jobs are often not found in job posts. This is attributed to significant manual efforts required for assigning job posts with RIASEC labels. To cope with assigning massive number of jobs with RIASEC labels, we propose JPLink, a machine learning approach using the text content in job titles and job descriptions. JPLink exploits domain knowledge available in an occupation-specific knowledge base known as O*NET to improve feature representation of job posts. To incorporate relative ranking of RIASEC labels of each job, JPLink proposes a listwise loss function inspired by learning to rank. Both our quantitative and qualitative evaluations show that JPLink outperforms conventional baselines. We conduct an error analysis on JPLink's predictions to show that it can uncover label errors in existing job posts.

Via

Access Paper or Ask Questions

FoodAI: Food Image Recognition via Deep Learning for Smart Food Logging

Sep 26, 2019

Doyen Sahoo, Wang Hao, Shu Ke, Wu Xiongwei, Hung Le, Palakorn Achananuparp, Ee-Peng Lim, Steven C. H. Hoi

Figure 1 for FoodAI: Food Image Recognition via Deep Learning for Smart Food Logging

Figure 2 for FoodAI: Food Image Recognition via Deep Learning for Smart Food Logging

Figure 3 for FoodAI: Food Image Recognition via Deep Learning for Smart Food Logging

Figure 4 for FoodAI: Food Image Recognition via Deep Learning for Smart Food Logging

Abstract:An important aspect of health monitoring is effective logging of food consumption. This can help management of diet-related diseases like obesity, diabetes, and even cardiovascular diseases. Moreover, food logging can help fitness enthusiasts, and people who wanting to achieve a target weight. However, food-logging is cumbersome, and requires not only taking additional effort to note down the food item consumed regularly, but also sufficient knowledge of the food item consumed (which is difficult due to the availability of a wide variety of cuisines). With increasing reliance on smart devices, we exploit the convenience offered through the use of smart phones and propose a smart-food logging system: FoodAI, which offers state-of-the-art deep-learning based image recognition capabilities. FoodAI has been developed in Singapore and is particularly focused on food items commonly consumed in Singapore. FoodAI models were trained on a corpus of 400,000 food images from 756 different classes. In this paper we present extensive analysis and insights into the development of this system. FoodAI has been deployed as an API service and is one of the components powering Healthy 365, a mobile app developed by Singapore's Heath Promotion Board. We have over 100 registered organizations (universities, companies, start-ups) subscribing to this service and actively receive several API requests a day. FoodAI has made food logging convenient, aiding smart consumption and a healthy lifestyle.

* Published at KDD 2019 (Applied Data Science track). Demo is accessible at https://foodai.org/

Via

Access Paper or Ask Questions

Estimating Glycemic Impact of Cooking Recipes via Online Crowdsourcing and Machine Learning

Sep 17, 2019

Helena Lee, Palakorn Achananuparp, Yue Liu, Ee-Peng Lim, Lav R. Varshney

Figure 1 for Estimating Glycemic Impact of Cooking Recipes via Online Crowdsourcing and Machine Learning

Abstract:Consumption of diets with low glycemic impact is highly recommended for diabetics and pre-diabetics as it helps maintain their blood glucose levels. However, laboratory analysis of dietary glycemic potency is time-consuming and expensive. In this paper, we explore a data-driven approach utilizing online crowdsourcing and machine learning to estimate the glycemic impact of cooking recipes. We show that a commonly used healthiness metric may not always be effective in determining recipes suitable for diabetics, thus emphasizing the importance of the glycemic-impact estimation task. Our best classification model, trained on nutritional and crowdsourced data obtained from Amazon Mechanical Turk (AMT), can accurately identify recipes which are unhealthful for diabetics.

* To appear in the Proceedings of Digital Public Health 2019 as short paper

Via

Access Paper or Ask Questions

Characterizing and Predicting Repeat Food Consumption Behavior for Just-in-Time Interventions

Sep 17, 2019

Yue Liu, Helena Lee, Palakorn Achananuparp, Ee-Peng Lim, Tzu-Ling Cheng, Shou-De Lin

Figure 1 for Characterizing and Predicting Repeat Food Consumption Behavior for Just-in-Time Interventions

Figure 2 for Characterizing and Predicting Repeat Food Consumption Behavior for Just-in-Time Interventions

Figure 3 for Characterizing and Predicting Repeat Food Consumption Behavior for Just-in-Time Interventions

Figure 4 for Characterizing and Predicting Repeat Food Consumption Behavior for Just-in-Time Interventions

Abstract:Human beings are creatures of habit. In their daily life, people tend to repeatedly consume similar types of food items over several days and occasionally switch to consuming different types of items when the consumptions become overly monotonous. However, the novel and repeat consumption behaviors have not been studied in food recommendation research. More importantly, the ability to predict daily eating habits of individuals is crucial to improve the effectiveness of food recommender systems in facilitating healthy lifestyle change. In this study, we analyze the patterns of repeat food consumptions using large-scale consumption data from a popular online fitness community called MyFitnessPal (MFP), conduct an offline evaluation of various state-of-the-art algorithms in predicting the next-day food consumption, and analyze their performance across different demographic groups and contexts. The experiment results show that algorithms incorporating the exploration-and-exploitation and temporal dynamics are more effective in the next-day recommendation task than most state-of-the-art algorithms.

* To appear in the Proceedings of Digital Public Health 2019

Via

Access Paper or Ask Questions