Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kim Anh Nguyen

Attentive Neural Network for Named Entity Recognition in Vietnamese

Oct 31, 2018

Ngan Dong, Kim Anh Nguyen

Figure 1 for Attentive Neural Network for Named Entity Recognition in Vietnamese

Figure 2 for Attentive Neural Network for Named Entity Recognition in Vietnamese

Figure 3 for Attentive Neural Network for Named Entity Recognition in Vietnamese

Figure 4 for Attentive Neural Network for Named Entity Recognition in Vietnamese

Abstract:We propose an attentive neural network for the task of named entity recognition in Vietnamese. The proposed attentive neural model makes use of character-based language models and word embeddings to encode words as vector representations. A neural network architecture of encoder, attention, and decoder layers is then utilized to encode knowledge of input sentences and to label entity tags. The experimental results show that the proposed attentive neural network achieves the state-of-the-art results on the benchmark named entity recognition datasets in Vietnamese in comparison to both hand-crafted features based models and neural models.

Via

Access Paper or Ask Questions

Introducing two Vietnamese Datasets for Evaluating Semantic Models of (Dis-)Similarity and Relatedness

Apr 19, 2018

Kim Anh Nguyen, Sabine Schulte im Walde, Ngoc Thang Vu

Figure 1 for Introducing two Vietnamese Datasets for Evaluating Semantic Models of (Dis-)Similarity and Relatedness

Figure 2 for Introducing two Vietnamese Datasets for Evaluating Semantic Models of (Dis-)Similarity and Relatedness

Figure 3 for Introducing two Vietnamese Datasets for Evaluating Semantic Models of (Dis-)Similarity and Relatedness

Figure 4 for Introducing two Vietnamese Datasets for Evaluating Semantic Models of (Dis-)Similarity and Relatedness

Abstract:We present two novel datasets for the low-resource language Vietnamese to assess models of semantic similarity: ViCon comprises pairs of synonyms and antonyms across word classes, thus offering data to distinguish between similarity and dissimilarity. ViSim-400 provides degrees of similarity across five semantic relations, as rated by human judges. The two datasets are verified through standard co-occurrence and neural network models, showing results comparable to the respective English datasets.

* The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2018)

Via

Access Paper or Ask Questions

Hierarchical Embeddings for Hypernymy Detection and Directionality

Jul 23, 2017

Kim Anh Nguyen, Maximilian Köper, Sabine Schulte im Walde, Ngoc Thang Vu

Figure 1 for Hierarchical Embeddings for Hypernymy Detection and Directionality

Figure 2 for Hierarchical Embeddings for Hypernymy Detection and Directionality

Figure 3 for Hierarchical Embeddings for Hypernymy Detection and Directionality

Figure 4 for Hierarchical Embeddings for Hypernymy Detection and Directionality

Abstract:We present a novel neural model HyperVec to learn hierarchical embeddings for hypernymy detection and directionality. While previous embeddings have shown limitations on prototypical hypernyms, HyperVec represents an unsupervised measure where embeddings are learned in a specific order and capture the hypernym$-$hyponym distributional hierarchy. Moreover, our model is able to generalize over unseen hypernymy pairs, when using only small sets of training data, and by mapping to other languages. Results on benchmark datasets show that HyperVec outperforms both state$-$of$-$the$-$art unsupervised measures and embedding models on hypernymy detection and directionality, and on predicting graded lexical entailment.

* 11 pages, accepted as long paper at EMNLP 2017

Via

Access Paper or Ask Questions

Distinguishing Antonyms and Synonyms in a Pattern-based Neural Network

Jan 11, 2017

Kim Anh Nguyen, Sabine Schulte im Walde, Ngoc Thang Vu

Figure 1 for Distinguishing Antonyms and Synonyms in a Pattern-based Neural Network

Figure 2 for Distinguishing Antonyms and Synonyms in a Pattern-based Neural Network

Figure 3 for Distinguishing Antonyms and Synonyms in a Pattern-based Neural Network

Figure 4 for Distinguishing Antonyms and Synonyms in a Pattern-based Neural Network

Abstract:Distinguishing between antonyms and synonyms is a key task to achieve high performance in NLP systems. While they are notoriously difficult to distinguish by distributional co-occurrence models, pattern-based methods have proven effective to differentiate between the relations. In this paper, we present a novel neural network model AntSynNET that exploits lexico-syntactic patterns from syntactic parse trees. In addition to the lexical and syntactic information, we successfully integrate the distance between the related words along the syntactic path as a new pattern feature. The results from classification experiments show that AntSynNET improves the performance over prior pattern-based methods.

* EACL2017
* EACL 2017, 10 pages

Via

Access Paper or Ask Questions

Neural-based Noise Filtering from Word Embeddings

Oct 06, 2016

Kim Anh Nguyen, Sabine Schulte im Walde, Ngoc Thang Vu

Figure 1 for Neural-based Noise Filtering from Word Embeddings

Figure 2 for Neural-based Noise Filtering from Word Embeddings

Figure 3 for Neural-based Noise Filtering from Word Embeddings

Figure 4 for Neural-based Noise Filtering from Word Embeddings

Abstract:Word embeddings have been demonstrated to benefit NLP tasks impressively. Yet, there is room for improvement in the vector representations, because current word embeddings typically contain unnecessary information, i.e., noise. We propose two novel models to improve word embeddings by unsupervised learning, in order to yield word denoising embeddings. The word denoising embeddings are obtained by strengthening salient information and weakening noise in the original word embeddings, based on a deep feed-forward neural network filter. Results from benchmark tasks show that the filtered word denoising embeddings outperform the original word embeddings.

* 9 pages, 4 figures, COLING 2016

Via

Access Paper or Ask Questions

Integrating Distributional Lexical Contrast into Word Embeddings for Antonym-Synonym Distinction

May 25, 2016

Kim Anh Nguyen, Sabine Schulte im Walde, Ngoc Thang Vu

Figure 1 for Integrating Distributional Lexical Contrast into Word Embeddings for Antonym-Synonym Distinction

Figure 2 for Integrating Distributional Lexical Contrast into Word Embeddings for Antonym-Synonym Distinction

Figure 3 for Integrating Distributional Lexical Contrast into Word Embeddings for Antonym-Synonym Distinction

Figure 4 for Integrating Distributional Lexical Contrast into Word Embeddings for Antonym-Synonym Distinction

Abstract:We propose a novel vector representation that integrates lexical contrast into distributional vectors and strengthens the most salient features for determining degrees of word similarity. The improved vectors significantly outperform standard models and distinguish antonyms from synonyms with an average precision of 0.66-0.76 across word classes (adjectives, nouns, verbs). Moreover, we integrate the lexical contrast vectors into the objective function of a skip-gram model. The novel embedding outperforms state-of-the-art models on predicting word similarities in SimLex-999, and on distinguishing antonyms from synonyms.

* 6 pages, 4 figures, InProc ACL 2016

Via

Access Paper or Ask Questions