Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yushi Yao

Context-aware Sentiment Word Identification: sentiword2vec

Dec 12, 2016

Yushi Yao, Guangjian Li

Abstract:Traditional sentiment analysis often uses sentiment dictionary to extract sentiment information in text and classify documents. However, emerging informal words and phrases in user generated content call for analysis aware to the context. Usually, they have special meanings in a particular context. Because of its great performance in representing inter-word relation, we use sentiment word vectors to identify the special words. Based on the distributed language model word2vec, in this paper we represent a novel method about sentiment representation of word under particular context, to be detailed, to identify the words with abnormal sentiment polarity in long answers. Result shows the improved model shows better performance in representing the words with special meaning, while keep doing well in representing special idiomatic pattern. Finally, we will discuss the meaning of vectors representing in the field of sentiment, which may be different from general object-based conditions.

* 15 pages

Via

Access Paper or Ask Questions

Bi-directional LSTM Recurrent Neural Network for Chinese Word Segmentation

Feb 16, 2016

Yushi Yao, Zheng Huang

Figure 1 for Bi-directional LSTM Recurrent Neural Network for Chinese Word Segmentation

Figure 2 for Bi-directional LSTM Recurrent Neural Network for Chinese Word Segmentation

Figure 3 for Bi-directional LSTM Recurrent Neural Network for Chinese Word Segmentation

Figure 4 for Bi-directional LSTM Recurrent Neural Network for Chinese Word Segmentation

Abstract:Recurrent neural network(RNN) has been broadly applied to natural language processing(NLP) problems. This kind of neural network is designed for modeling sequential data and has been testified to be quite efficient in sequential tagging tasks. In this paper, we propose to use bi-directional RNN with long short-term memory(LSTM) units for Chinese word segmentation, which is a crucial preprocess task for modeling Chinese sentences and articles. Classical methods focus on designing and combining hand-craft features from context, whereas bi-directional LSTM network(BLSTM) does not need any prior knowledge or pre-designing, and it is expert in keeping the contextual information in both directions. Experiment result shows that our approach gets state-of-the-art performance in word segmentation on both traditional Chinese datasets and simplified Chinese datasets.

* 2 figures

Via

Access Paper or Ask Questions