Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kok Wai Wong

Integration of Explainable AI Techniques with Large Language Models for Enhanced Interpretability for Sentiment Analysis

Mar 15, 2025

Thivya Thogesan, Anupiya Nugaliyadde, Kok Wai Wong

Abstract:Interpretability remains a key difficulty in sentiment analysis with Large Language Models (LLMs), particularly in high-stakes applications where it is crucial to comprehend the rationale behind forecasts. This research addressed this by introducing a technique that applies SHAP (Shapley Additive Explanations) by breaking down LLMs into components such as embedding layer,encoder,decoder and attention layer to provide a layer-by-layer knowledge of sentiment prediction. The approach offers a clearer overview of how model interpret and categorise sentiment by breaking down LLMs into these parts. The method is evaluated using the Stanford Sentiment Treebank (SST-2) dataset, which shows how different sentences affect different layers. The effectiveness of layer-wise SHAP analysis in clarifying sentiment-specific token attributions is demonstrated by experimental evaluations, which provide a notable enhancement over current whole-model explainability techniques. These results highlight how the suggested approach could improve the reliability and transparency of LLM-based sentiment analysis in crucial applications.

Via

Access Paper or Ask Questions

Relevance Judgment Convergence Degree -- A Measure of Inconsistency among Assessors for Information Retrieval

Aug 08, 2022

Dengya Zhu, Shastri L Nimmagadda, Kok Wai Wong, Torsten Reiners

Figure 1 for Relevance Judgment Convergence Degree -- A Measure of Inconsistency among Assessors for Information Retrieval

Figure 2 for Relevance Judgment Convergence Degree -- A Measure of Inconsistency among Assessors for Information Retrieval

Figure 3 for Relevance Judgment Convergence Degree -- A Measure of Inconsistency among Assessors for Information Retrieval

Figure 4 for Relevance Judgment Convergence Degree -- A Measure of Inconsistency among Assessors for Information Retrieval

Abstract:Relevance judgment of human assessors is inherently subjective and dynamic when evaluation datasets are created for Information Retrieval (IR) systems. However, a small group of experts' relevance judgment results are usually taken as ground truth to "objectively" evaluate the performance of the IR systems. Recent trends intend to employ a group of judges, such as outsourcing, to alleviate the potentially biased judgment results stemmed from using only a single expert's judgment. Nevertheless, different judges may have different opinions and may not agree with each other, and the inconsistency in human relevance judgment may affect the IR system evaluation results. In this research, we introduce a Relevance Judgment Convergence Degree (RJCD) to measure the quality of queries in the evaluation datasets. Experimental results reveal a strong correlation coefficient between the proposed RJCD score and the performance differences between the two IR systems.

* To appear on 30th International Conference on Information Systems Development (ISD2022)

Via

Access Paper or Ask Questions

RCNN for Region of Interest Detection in Whole Slide Images

Sep 18, 2020

A Nugaliyadde, Kok Wai Wong, Jeremy Parry, Ferdous Sohel, Hamid Laga, Upeka V. Somaratne, Chris Yeomans, Orchid Foster

Figure 1 for RCNN for Region of Interest Detection in Whole Slide Images

Figure 2 for RCNN for Region of Interest Detection in Whole Slide Images

Figure 3 for RCNN for Region of Interest Detection in Whole Slide Images

Figure 4 for RCNN for Region of Interest Detection in Whole Slide Images

Abstract:Digital pathology has attracted significant attention in recent years. Analysis of Whole Slide Images (WSIs) is challenging because they are very large, i.e., of Giga-pixel resolution. Identifying Regions of Interest (ROIs) is the first step for pathologists to analyse further the regions of diagnostic interest for cancer detection and other anomalies. In this paper, we investigate the use of RCNN, which is a deep machine learning technique, for detecting such ROIs only using a small number of labelled WSIs for training. For experimentation, we used real WSIs from a public hospital pathology service in Western Australia. We used 60 WSIs for training the RCNN model and another 12 WSIs for testing. The model was further tested on a new set of unseen WSIs. The results show that RCNN can be effectively used for ROI detection from WSIs.

* This paper was accepted to the 27th International Conference on Neural Information Processing (ICONIP 2020) and will be published in the Springer CCIS Series

Via

Access Paper or Ask Questions

Predicting Electricity Consumption using Deep Recurrent Neural Networks

Sep 18, 2019

Anupiya Nugaliyadde, Upeka Somaratne, Kok Wai Wong

Figure 1 for Predicting Electricity Consumption using Deep Recurrent Neural Networks

Figure 2 for Predicting Electricity Consumption using Deep Recurrent Neural Networks

Figure 3 for Predicting Electricity Consumption using Deep Recurrent Neural Networks

Figure 4 for Predicting Electricity Consumption using Deep Recurrent Neural Networks

Abstract:Electricity consumption has increased exponentially during the past few decades. This increase is heavily burdening the electricity distributors. Therefore, predicting the future demand for electricity consumption will provide an upper hand to the electricity distributor. Predicting electricity consumption requires many parameters. The paper presents two approaches with one using a Recurrent Neural Network (RNN) and another one using a Long Short Term Memory (LSTM) network, which only considers the previous electricity consumption to predict the future electricity consumption. These models were tested on the publicly available London smart meter dataset. To assess the applicability of the RNN and the LSTM network to predict electricity consumption, they were tested to predict for an individual house and a block of houses for a given time period. The predictions were done for daily, trimester and 13 months, which covers short term, mid-term and long term prediction. Both the RNN and the LSTM network have achieved an average Root Mean Square error of 0.1.

Via

Access Paper or Ask Questions

Language Modeling through Long Term Memory Network

Apr 18, 2019

Anupiya Nugaliyadde, Kok Wai Wong, Ferdous Sohel, Hong Xie

Figure 1 for Language Modeling through Long Term Memory Network

Figure 2 for Language Modeling through Long Term Memory Network

Figure 3 for Language Modeling through Long Term Memory Network

Figure 4 for Language Modeling through Long Term Memory Network

Abstract:Recurrent Neural Networks (RNN), Long Short-Term Memory Networks (LSTM), and Memory Networks which contain memory are popularly used to learn patterns in sequential data. Sequential data has long sequences that hold relationships. RNN can handle long sequences but suffers from the vanishing and exploding gradient problems. While LSTM and other memory networks address this problem, they are not capable of handling long sequences (50 or more data points long sequence patterns). Language modelling requiring learning from longer sequences are affected by the need for more information in memory. This paper introduces Long Term Memory network (LTM), which can tackle the exploding and vanishing gradient problems and handles long sequences without forgetting. LTM is designed to scale data in the memory and gives a higher weight to the input in the sequence. LTM avoid overfitting by scaling the cell state after achieving the optimal results. The LTM is tested on Penn treebank dataset, and Text8 dataset and LTM achieves test perplexities of 83 and 82 respectively. 650 LTM cells achieved a test perplexity of 67 for Penn treebank, and 600 cells achieved a test perplexity of 77 for Text8. LTM achieves state of the art results by only using ten hidden LTM cells for both datasets.

* The paper is accepted to be published in IJCNN 2019

Via

Access Paper or Ask Questions

Enhancing Semantic Word Representations by Embedding Deeper Word Relationships

Jan 22, 2019

Anupiya Nugaliyadde, Kok Wai Wong, Ferdous Sohel, Hong Xie

Figure 1 for Enhancing Semantic Word Representations by Embedding Deeper Word Relationships

Figure 2 for Enhancing Semantic Word Representations by Embedding Deeper Word Relationships

Figure 3 for Enhancing Semantic Word Representations by Embedding Deeper Word Relationships

Figure 4 for Enhancing Semantic Word Representations by Embedding Deeper Word Relationships

Abstract:Word representations are created using analogy context-based statistics and lexical relations on words. Word representations are inputs for the learning models in Natural Language Understanding (NLU) tasks. However, to understand language, knowing only the context is not sufficient. Reading between the lines is a key component of NLU. Embedding deeper word relationships which are not represented in the context enhances the word representation. This paper presents a word embedding which combines an analogy, context-based statistics using Word2Vec, and deeper word relationships using Conceptnet, to create an expanded word representation. In order to fine-tune the word representation, Self-Organizing Map is used to optimize it. The proposed word representation is compared with semantic word representations using Simlex 999. Furthermore, the use of 3D visual representations has shown to be capable of representing the similarity and association between words. The proposed word representation shows a Spearman correlation score of 0.886 and provided the best results when compared to the current state-of-the-art methods, and exceed the human performance of 0.78.

* Accepted for the International Conference on Computer and Automation Engineering (ICCAE) 2019

Via

Access Paper or Ask Questions