Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jugal Kalita

University of Colorado at Colorado Springs

Building a Chatbot on a Closed Domain using RASA

Aug 12, 2022

Khang Nhut Lam, Nam Nhat Le, Jugal Kalita

Figure 1 for Building a Chatbot on a Closed Domain using RASA

Figure 2 for Building a Chatbot on a Closed Domain using RASA

Figure 3 for Building a Chatbot on a Closed Domain using RASA

Figure 4 for Building a Chatbot on a Closed Domain using RASA

Abstract:In this study, we build a chatbot system in a closed domain with the RASA framework, using several models such as SVM for classifying intents, CRF for extracting entities and LSTM for predicting action. To improve responses from the bot, the kNN algorithm is used to transform false entities extracted into true entities. The knowledge domain of our chatbot is about the College of Information and Communication Technology of Can Tho University, Vietnam. We manually construct a chatbot corpus with 19 intents, 441 sentence patterns of intents, 253 entities and 133 stories. Experiment results show that the bot responds well to relevant questions.

* Proceedings of the 4th International Conference on Natural Language Processing and Information Retrieval, pp. 144-148. 2020
* 5 pages

Via

Access Paper or Ask Questions

Creating Lexical Resources for Endangered Languages

Aug 08, 2022

Khang Nhut Lam, Feras Al Tarouti, Jugal Kalita

Figure 1 for Creating Lexical Resources for Endangered Languages

Figure 2 for Creating Lexical Resources for Endangered Languages

Figure 3 for Creating Lexical Resources for Endangered Languages

Figure 4 for Creating Lexical Resources for Endangered Languages

Abstract:This paper examines approaches to generate lexical resources for endangered languages. Our algorithms construct bilingual dictionaries and multilingual thesauruses using public Wordnets and a machine translator (MT). Since our work relies on only one bilingual dictionary between an endangered language and an "intermediate helper" language, it is applicable to languages that lack many existing resources.

* Proceedings of the 2014 Workshop on the Use of Computational Methods in the Study of Endangered Languages, pp. 54-62. 2014
* 9 pages

Via

Access Paper or Ask Questions

Automatically constructing Wordnet synsets

Aug 08, 2022

Khang Nhut Lam, Feras Al Tarouti, Jugal Kalita

Figure 1 for Automatically constructing Wordnet synsets

Figure 2 for Automatically constructing Wordnet synsets

Figure 3 for Automatically constructing Wordnet synsets

Figure 4 for Automatically constructing Wordnet synsets

Abstract:Manually constructing a Wordnet is a difficult task, needing years of experts' time. As a first step to automatically construct full Wordnets, we propose approaches to generate Wordnet synsets for languages both resource-rich and resource-poor, using publicly available Wordnets, a machine translator and/or a single bilingual dictionary. Our algorithms translate synsets of existing Wordnets to a target language T, then apply a ranking method on the translation candidates to find best translations in T. Our approaches are applicable to any language which has at least one existing bilingual dictionary translating from English to it.

* Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 106-111. 2014
* 6 pages

Via

Access Paper or Ask Questions

Creating Reverse Bilingual Dictionaries

Aug 08, 2022

Khang Nhut Lam, Jugal Kalita

Figure 1 for Creating Reverse Bilingual Dictionaries

Figure 2 for Creating Reverse Bilingual Dictionaries

Abstract:Bilingual dictionaries are expensive resources and not many are available when one of the languages is resource-poor. In this paper, we propose algorithms for creation of new reverse bilingual dictionaries from existing bilingual dictionaries in which English is one of the two languages. Our algorithms exploit the similarity between word-concept pairs using the English Wordnet to produce reverse dictionary entries. Since our algorithms rely on available bilingual dictionaries, they are applicable to any bilingual dictionary as long as one of the two languages has Wordnet type lexical ontology.

* Proceedings of the 2013 conference of the North American chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 524-528. 2013
* 5 pages

Via

Access Paper or Ask Questions

Phrase translation using a bilingual dictionary and n-gram data: A case study from Vietnamese to English

Aug 05, 2022

Khang Nhut Lam, Feras Al Tarouti, Jugal Kalita

Figure 1 for Phrase translation using a bilingual dictionary and n-gram data: A case study from Vietnamese to English

Abstract:Past approaches to translate a phrase in a language L1 to a language L2 using a dictionary-based approach require grammar rules to restructure initial translations. This paper introduces a novel method without using any grammar rules to translate a given phrase in L1, which does not exist in the dictionary, to L2. We require at least one L1-L2 bilingual dictionary and n-gram data in L2. The average manual evaluation score of our translations is 4.29/5.00, which implies very high quality.

* In Proceedings of the 11th Workshop on Multiword Expressions, pp. 65-69. 2015
* 5 pages

Via

Access Paper or Ask Questions

Towards Multimodal Vision-Language Models Generating Non-Generic Text

Jul 09, 2022

Wes Robbins, Zanyar Zohourianshahzadi, Jugal Kalita

Figure 1 for Towards Multimodal Vision-Language Models Generating Non-Generic Text

Figure 2 for Towards Multimodal Vision-Language Models Generating Non-Generic Text

Figure 3 for Towards Multimodal Vision-Language Models Generating Non-Generic Text

Figure 4 for Towards Multimodal Vision-Language Models Generating Non-Generic Text

Abstract:Vision-language models can assess visual context in an image and generate descriptive text. While the generated text may be accurate and syntactically correct, it is often overly general. To address this, recent work has used optical character recognition to supplement visual information with text extracted from an image. In this work, we contend that vision-language models can benefit from additional information that can be extracted from an image, but are not used by current models. We modify previous multimodal frameworks to accept relevant information from any number of auxiliary classifiers. In particular, we focus on person names as an additional set of tokens and create a novel image-caption dataset to facilitate captioning with person names. The dataset, Politicians and Athletes in Captions (PAC), consists of captioned images of well-known people in context. By fine-tuning pretrained models with this dataset, we demonstrate a model that can naturally integrate facial recognition tokens into generated text by training on limited data. For the PAC dataset, we provide a discussion on collection and baseline benchmark scores.

* 2021 International Conference on Natural Language Processing

Via

Access Paper or Ask Questions

ZoDIAC: Zoneout Dropout Injection Attention Calculation

Jun 28, 2022

Zanyar Zohourianshahzadi, Jugal Kalita

Figure 1 for ZoDIAC: Zoneout Dropout Injection Attention Calculation

Figure 2 for ZoDIAC: Zoneout Dropout Injection Attention Calculation

Figure 3 for ZoDIAC: Zoneout Dropout Injection Attention Calculation

Figure 4 for ZoDIAC: Zoneout Dropout Injection Attention Calculation

Abstract:Recently the use of self-attention has yielded to state-of-the-art results in vision-language tasks such as image captioning as well as natural language understanding and generation (NLU and NLG) tasks and computer vision tasks such as image classification. This is since self-attention maps the internal interactions among the elements of input source and target sequences. Although self-attention successfully calculates the attention values and maps the relationships among the elements of input source and target sequence, yet there is no mechanism to control the intensity of attention. In real world, when communicating with each other face to face or vocally, we tend to express different visual and linguistic context with various amounts of intensity. Some words might carry (be spoken with) more stress and weight indicating the importance of that word in the context of the whole sentence. Based on this intuition, we propose Zoneout Dropout Injection Attention Calculation (ZoDIAC) in which the intensities of attention values in the elements of the input sequence are calculated with respect to the context of the elements of input sequence. The results of our experiments reveal that employing ZoDIAC leads to better performance in comparison with the self-attention module in the Transformer model. The ultimate goal is to find out if we could modify self-attention module in the Transformer model with a method that is potentially extensible to other models that leverage on self-attention at their core. Our findings suggest that this particular goal deserves further attention and investigation by the research community. The code for ZoDIAC is available on www.github.com/zanyarz/zodiac .

* This work has been submitted to SN-AIRE journal and is currently under review

Via

Access Paper or Ask Questions

Using Random Perturbations to Mitigate Adversarial Attacks on Sentiment Analysis Models

Feb 11, 2022

Abigail Swenor, Jugal Kalita

Figure 1 for Using Random Perturbations to Mitigate Adversarial Attacks on Sentiment Analysis Models

Figure 2 for Using Random Perturbations to Mitigate Adversarial Attacks on Sentiment Analysis Models

Figure 3 for Using Random Perturbations to Mitigate Adversarial Attacks on Sentiment Analysis Models

Figure 4 for Using Random Perturbations to Mitigate Adversarial Attacks on Sentiment Analysis Models

Abstract:Attacks on deep learning models are often difficult to identify and therefore are difficult to protect against. This problem is exacerbated by the use of public datasets that typically are not manually inspected before use. In this paper, we offer a solution to this vulnerability by using, during testing, random perturbations such as spelling correction if necessary, substitution by random synonym, or simply dropping the word. These perturbations are applied to random words in random sentences to defend NLP models against adversarial attacks. Our Random Perturbations Defense and Increased Randomness Defense methods are successful in returning attacked models to similar accuracy of models before attacks. The original accuracy of the model used in this work is 80% for sentiment classification. After undergoing attacks, the accuracy drops to accuracy between 0% and 44%. After applying our defense methods, the accuracy of the model is returned to the original accuracy within statistical significance.

* To be published in the proceedings for the 18th International Conference on Natural Language Processing (ICON 2021)

Via

Access Paper or Ask Questions

Incremental Deep Neural Network Learning using Classification Confidence Thresholding

Jun 21, 2021

Justin Leo, Jugal Kalita

Figure 1 for Incremental Deep Neural Network Learning using Classification Confidence Thresholding

Figure 2 for Incremental Deep Neural Network Learning using Classification Confidence Thresholding

Figure 3 for Incremental Deep Neural Network Learning using Classification Confidence Thresholding

Figure 4 for Incremental Deep Neural Network Learning using Classification Confidence Thresholding

Abstract:Most modern neural networks for classification fail to take into account the concept of the unknown. Trained neural networks are usually tested in an unrealistic scenario with only examples from a closed set of known classes. In an attempt to develop a more realistic model, the concept of working in an open set environment has been introduced. This in turn leads to the concept of incremental learning where a model with its own architecture and initial trained set of data can identify unknown classes during the testing phase and autonomously update itself if evidence of a new class is detected. Some problems that arise in incremental learning are inefficient use of resources to retrain the classifier repeatedly and the decrease of classification accuracy as multiple classes are added over time. This process of instantiating new classes is repeated as many times as necessary, accruing errors. To address these problems, this paper proposes the Classification Confidence Threshold approach to prime neural networks for incremental learning to keep accuracies high by limiting forgetting. A lean method is also used to reduce resources used in the retraining of the neural network. The proposed method is based on the idea that a network is able to incrementally learn a new class even when exposed to a limited number samples associated with the new class. This method can be applied to most existing neural networks with minimal changes to network architecture.

* Accepted to IEEE TNNLS

Via

Access Paper or Ask Questions

Improving Computer Generated Dialog with Auxiliary Loss Functions and Custom Evaluation Metrics

Jun 04, 2021

Thomas Conley, Jack St. Clair, Jugal Kalita

Figure 1 for Improving Computer Generated Dialog with Auxiliary Loss Functions and Custom Evaluation Metrics

Figure 2 for Improving Computer Generated Dialog with Auxiliary Loss Functions and Custom Evaluation Metrics

Figure 3 for Improving Computer Generated Dialog with Auxiliary Loss Functions and Custom Evaluation Metrics

Abstract:Although people have the ability to engage in vapid dialogue without effort, this may not be a uniquely human trait. Since the 1960's researchers have been trying to create agents that can generate artificial conversation. These programs are commonly known as chatbots. With increasing use of neural networks for dialog generation, some conclude that this goal has been achieved. This research joins the quest by creating a dialog generating Recurrent Neural Network (RNN) and by enhancing the ability of this network with auxiliary loss functions and a beam search. Our custom loss functions achieve better cohesion and coherence by including calculations of Maximum Mutual Information (MMI) and entropy. We demonstrate the effectiveness of this system by using a set of custom evaluation metrics inspired by an abundance of previous research and based on tried-and-true principles of Natural Language Processing.

* Proceedings of ICON-2018, Patiala, India. December 2018, pages 143--149

Via

Access Paper or Ask Questions