Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Asif Ekbal

A Unified Multi-Task Learning Architecture for Hate Detection Leveraging User-Based Information

Nov 12, 2024

Prashant Kapil, Asif Ekbal

Figure 1 for A Unified Multi-Task Learning Architecture for Hate Detection Leveraging User-Based Information

Figure 2 for A Unified Multi-Task Learning Architecture for Hate Detection Leveraging User-Based Information

Figure 3 for A Unified Multi-Task Learning Architecture for Hate Detection Leveraging User-Based Information

Abstract:Hate speech, offensive language, aggression, racism, sexism, and other abusive language are common phenomena in social media. There is a need for Artificial Intelligence(AI)based intervention which can filter hate content at scale. Most existing hate speech detection solutions have utilized the features by treating each post as an isolated input instance for the classification. This paper addresses this issue by introducing a unique model that improves hate speech identification for the English language by utilising intra-user and inter-user-based information. The experiment is conducted over single-task learning (STL) and multi-task learning (MTL) paradigms that use deep neural networks, such as convolutional neural networks (CNN), gated recurrent unit (GRU), bidirectional encoder representations from the transformer (BERT), and A Lite BERT (ALBERT). We use three benchmark datasets and conclude that combining certain user features with textual features gives significant improvements in macro-F1 and weighted-F1.

* 7 pages, 1 figure, and two tables. Accepted at the 20th International Conference on Natural Language Processing (ICON) 2023. https://aclanthology.org/2023.icon-1.53

Via

Access Paper or Ask Questions

MedLogic-AQA: Enhancing Medical Question Answering with Abstractive Models Focusing on Logical Structures

Oct 20, 2024

Aizan Zafar, Kshitij Mishra, Asif Ekbal

Figure 1 for MedLogic-AQA: Enhancing Medical Question Answering with Abstractive Models Focusing on Logical Structures

Figure 2 for MedLogic-AQA: Enhancing Medical Question Answering with Abstractive Models Focusing on Logical Structures

Figure 3 for MedLogic-AQA: Enhancing Medical Question Answering with Abstractive Models Focusing on Logical Structures

Figure 4 for MedLogic-AQA: Enhancing Medical Question Answering with Abstractive Models Focusing on Logical Structures

Abstract:In Medical question-answering (QA) tasks, the need for effective systems is pivotal in delivering accurate responses to intricate medical queries. However, existing approaches often struggle to grasp the intricate logical structures and relationships inherent in medical contexts, thus limiting their capacity to furnish precise and nuanced answers. In this work, we address this gap by proposing a novel Abstractive QA system MedLogic-AQA that harnesses First Order Logic (FOL) based rules extracted from both context and questions to generate well-grounded answers. Through initial experimentation, we identified six pertinent first-order logical rules, which were then used to train a Logic-Understanding (LU) model capable of generating logical triples for a given context, question, and answer. These logic triples are then integrated into the training of MedLogic-AQA, enabling effective and coherent reasoning during answer generation. This distinctive fusion of logical reasoning with abstractive QA equips our system to produce answers that are logically sound, relevant, and engaging. Evaluation with respect to both automated and human-based demonstrates the robustness of MedLogic-AQA against strong baselines. Through empirical assessments and case studies, we validate the efficacy of MedLogic-AQA in elevating the quality and comprehensiveness of answers in terms of reasoning as well as informativeness

Via

Access Paper or Ask Questions

Seeing Through VisualBERT: A Causal Adventure on Memetic Landscapes

Oct 17, 2024

Dibyanayan Bandyopadhyay, Mohammed Hasanuzzaman, Asif Ekbal

Figure 1 for Seeing Through VisualBERT: A Causal Adventure on Memetic Landscapes

Figure 2 for Seeing Through VisualBERT: A Causal Adventure on Memetic Landscapes

Figure 3 for Seeing Through VisualBERT: A Causal Adventure on Memetic Landscapes

Figure 4 for Seeing Through VisualBERT: A Causal Adventure on Memetic Landscapes

Abstract:Detecting offensive memes is crucial, yet standard deep neural network systems often remain opaque. Various input attribution-based methods attempt to interpret their behavior, but they face challenges with implicitly offensive memes and non-causal attributions. To address these issues, we propose a framework based on a Structural Causal Model (SCM). In this framework, VisualBERT is trained to predict the class of an input meme based on both meme input and causal concepts, allowing for transparent interpretation. Our qualitative evaluation demonstrates the framework's effectiveness in understanding model behavior, particularly in determining whether the model was right due to the right reason, and in identifying reasons behind misclassification. Additionally, quantitative analysis assesses the significance of proposed modelling choices, such as de-confounding, adversarial learning, and dynamic routing, and compares them with input attribution methods. Surprisingly, we find that input attribution methods do not guarantee causality within our framework, raising questions about their reliability in safety-critical applications. The project page is at: https://newcodevelop.github.io/causality_adventure/

* Accepted at EMNLP Findings 2024

Via

Access Paper or Ask Questions

'Quis custodiet ipsos custodes?' Who will watch the watchmen? On Detecting AI-generated peer-reviews

Oct 13, 2024

Sandeep Kumar, Mohit Sahu, Vardhan Gacche, Tirthankar Ghosal, Asif Ekbal

Figure 1 for 'Quis custodiet ipsos custodes?' Who will watch the watchmen? On Detecting AI-generated peer-reviews

Figure 2 for 'Quis custodiet ipsos custodes?' Who will watch the watchmen? On Detecting AI-generated peer-reviews

Figure 3 for 'Quis custodiet ipsos custodes?' Who will watch the watchmen? On Detecting AI-generated peer-reviews

Figure 4 for 'Quis custodiet ipsos custodes?' Who will watch the watchmen? On Detecting AI-generated peer-reviews

Abstract:The integrity of the peer-review process is vital for maintaining scientific rigor and trust within the academic community. With the steady increase in the usage of large language models (LLMs) like ChatGPT in academic writing, there is a growing concern that AI-generated texts could compromise scientific publishing, including peer-reviews. Previous works have focused on generic AI-generated text detection or have presented an approach for estimating the fraction of peer-reviews that can be AI-generated. Our focus here is to solve a real-world problem by assisting the editor or chair in determining whether a review is written by ChatGPT or not. To address this, we introduce the Term Frequency (TF) model, which posits that AI often repeats tokens, and the Review Regeneration (RR) model, which is based on the idea that ChatGPT generates similar outputs upon re-prompting. We stress test these detectors against token attack and paraphrasing. Finally, we propose an effective defensive strategy to reduce the effect of paraphrasing on our models. Our findings suggest both our proposed methods perform better than the other AI text detectors. Our RR model is more robust, although our TF model performs better than the RR model without any attacks. We make our code, dataset, and model public.

* EMNLP Main, 17 pages, 5 figures, 9 tables

Via

Access Paper or Ask Questions

ECIS-VQG: Generation of Entity-centric Information-seeking Questions from Videos

Oct 13, 2024

Arpan Phukan, Manish Gupta, Asif Ekbal

Figure 1 for ECIS-VQG: Generation of Entity-centric Information-seeking Questions from Videos

Figure 2 for ECIS-VQG: Generation of Entity-centric Information-seeking Questions from Videos

Figure 3 for ECIS-VQG: Generation of Entity-centric Information-seeking Questions from Videos

Figure 4 for ECIS-VQG: Generation of Entity-centric Information-seeking Questions from Videos

Abstract:Previous studies on question generation from videos have mostly focused on generating questions about common objects and attributes and hence are not entity-centric. In this work, we focus on the generation of entity-centric information-seeking questions from videos. Such a system could be useful for video-based learning, recommending ``People Also Ask'' questions, video-based chatbots, and fact-checking. Our work addresses three key challenges: identifying question-worthy information, linking it to entities, and effectively utilizing multimodal signals. Further, to the best of our knowledge, there does not exist a large-scale dataset for this task. Most video question generation datasets are on TV shows, movies, or human activities or lack entity-centric information-seeking questions. Hence, we contribute a diverse dataset of YouTube videos, VideoQuestions, consisting of 411 videos with 2265 manually annotated questions. We further propose a model architecture combining Transformers, rich context signals (titles, transcripts, captions, embeddings), and a combination of cross-entropy and contrastive loss function to encourage entity-centric question generation. Our best method yields BLEU, ROUGE, CIDEr, and METEOR scores of 71.3, 78.6, 7.31, and 81.9, respectively, demonstrating practical usability. We make the code and dataset publicly available. https://github.com/thePhukan/ECIS-VQG

* Accepted in EMNLP 2024, https://openreview.net/forum?id=CriKOn01dI

Via

Access Paper or Ask Questions

M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop Chain-of-Thought

Oct 11, 2024

Gitanjali Kumari, Kirtan Jain, Asif Ekbal

Abstract:In recent years, there has been a significant rise in the phenomenon of hate against women on social media platforms, particularly through the use of misogynous memes. These memes often target women with subtle and obscure cues, making their detection a challenging task for automated systems. Recently, Large Language Models (LLMs) have shown promising results in reasoning using Chain-of-Thought (CoT) prompting to generate the intermediate reasoning chains as the rationale to facilitate multimodal tasks, but often neglect cultural diversity and key aspects like emotion and contextual knowledge hidden in the visual modalities. To address this gap, we introduce a Multimodal Multi-hop CoT (M3Hop-CoT) framework for Misogynous meme identification, combining a CLIP-based classifier and a multimodal CoT module with entity-object-relationship integration. M3Hop-CoT employs a three-step multimodal prompting principle to induce emotions, target awareness, and contextual knowledge for meme analysis. Our empirical evaluation, including both qualitative and quantitative analysis, validates the efficacy of the M3Hop-CoT framework on the SemEval-2022 Task 5 (MAMI task) dataset, highlighting its strong performance in the macro-F1 score. Furthermore, we evaluate the model's generalizability by evaluating it on various benchmark meme datasets, offering a thorough insight into the effectiveness of our approach across different datasets.

* 34 Pages. Accepted in The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024). Main Conference

Via

Access Paper or Ask Questions

Overview of Factify5WQA: Fact Verification through 5W Question-Answering

Oct 05, 2024

Suryavardan Suresh, Anku Rani, Parth Patwa, Aishwarya Reganti, Vinija Jain, Aman Chadha, Amitava Das, Amit Sheth, Asif Ekbal

Figure 1 for Overview of Factify5WQA: Fact Verification through 5W Question-Answering

Figure 2 for Overview of Factify5WQA: Fact Verification through 5W Question-Answering

Figure 3 for Overview of Factify5WQA: Fact Verification through 5W Question-Answering

Figure 4 for Overview of Factify5WQA: Fact Verification through 5W Question-Answering

Abstract:Researchers have found that fake news spreads much times faster than real news. This is a major problem, especially in today's world where social media is the key source of news for many among the younger population. Fact verification, thus, becomes an important task and many media sites contribute to the cause. Manual fact verification is a tedious task, given the volume of fake news online. The Factify5WQA shared task aims to increase research towards automated fake news detection by providing a dataset with an aspect-based question answering based fact verification method. Each claim and its supporting document is associated with 5W questions that help compare the two information sources. The objective performance measure in the task is done by comparing answers using BLEU score to measure the accuracy of the answers, followed by an accuracy measure of the classification. The task had submissions using custom training setup and pre-trained language-models among others. The best performing team posted an accuracy of 69.56%, which is a near 35% improvement over the baseline.

* Accepted at defactify3@aaai2024

Via

Access Paper or Ask Questions

Can Large Language Models Unlock Novel Scientific Research Ideas?

Sep 10, 2024

Sandeep Kumar, Tirthankar Ghosal, Vinayak Goyal, Asif Ekbal

Abstract:"An idea is nothing more nor less than a new combination of old elements" (Young, J.W.). The widespread adoption of Large Language Models (LLMs) and publicly available ChatGPT have marked a significant turning point in the integration of Artificial Intelligence (AI) into people's everyday lives. This study explores the capability of LLMs in generating novel research ideas based on information from research papers. We conduct a thorough examination of 4 LLMs in five domains (e.g., Chemistry, Computer, Economics, Medical, and Physics). We found that the future research ideas generated by Claude-2 and GPT-4 are more aligned with the author's perspective than GPT-3.5 and Gemini. We also found that Claude-2 generates more diverse future research ideas than GPT-4, GPT-3.5, and Gemini 1.0. We further performed a human evaluation of the novelty, relevancy, and feasibility of the generated future research ideas. This investigation offers insights into the evolving role of LLMs in idea generation, highlighting both its capability and limitations. Our work contributes to the ongoing efforts in evaluating and utilizing language models for generating future research ideas. We make our datasets and codes publicly available.

* 24 pages, 12 figures, 6 tables

Via

Access Paper or Ask Questions

A Case Study on Context-Aware Neural Machine Translation with Multi-Task Learning

Jul 03, 2024

Ramakrishna Appicharla, Baban Gain, Santanu Pal, Asif Ekbal, Pushpak Bhattacharyya

Figure 1 for A Case Study on Context-Aware Neural Machine Translation with Multi-Task Learning

Figure 2 for A Case Study on Context-Aware Neural Machine Translation with Multi-Task Learning

Figure 3 for A Case Study on Context-Aware Neural Machine Translation with Multi-Task Learning

Figure 4 for A Case Study on Context-Aware Neural Machine Translation with Multi-Task Learning

Abstract:In document-level neural machine translation (DocNMT), multi-encoder approaches are common in encoding context and source sentences. Recent studies \cite{li-etal-2020-multi-encoder} have shown that the context encoder generates noise and makes the model robust to the choice of context. This paper further investigates this observation by explicitly modelling context encoding through multi-task learning (MTL) to make the model sensitive to the choice of context. We conduct experiments on cascade MTL architecture, which consists of one encoder and two decoders. Generation of the source from the context is considered an auxiliary task, and generation of the target from the source is the main task. We experimented with German--English language pairs on News, TED, and Europarl corpora. Evaluation results show that the proposed MTL approach performs better than concatenation-based and multi-encoder DocNMT models in low-resource settings and is sensitive to the choice of context. However, we observe that the MTL models are failing to generate the source from the context. These observations align with the previous studies, and this might suggest that the available document-level parallel corpora are not context-aware, and a robust sentence-level model can outperform the context-aware models.

* Accepted to EAMT 2024 (poster)

Via

Access Paper or Ask Questions

Universal Adversarial Framework to Improve Adversarial Robustness for Diabetic Retinopathy Detection

Dec 13, 2023

Samrat Mukherjee, Dibyanayan Bandyopadhyay, Baban Gain, Asif Ekbal

Figure 1 for Universal Adversarial Framework to Improve Adversarial Robustness for Diabetic Retinopathy Detection

Figure 2 for Universal Adversarial Framework to Improve Adversarial Robustness for Diabetic Retinopathy Detection

Figure 3 for Universal Adversarial Framework to Improve Adversarial Robustness for Diabetic Retinopathy Detection

Figure 4 for Universal Adversarial Framework to Improve Adversarial Robustness for Diabetic Retinopathy Detection

Abstract:Diabetic Retinopathy (DR) is a prevalent illness associated with Diabetes which, if left untreated, can result in irreversible blindness. Deep Learning based systems are gradually being introduced as automated support for clinical diagnosis. Since healthcare has always been an extremely important domain demanding error-free performance, any adversaries could pose a big threat to the applicability of such systems. In this work, we use Universal Adversarial Perturbations (UAPs) to quantify the vulnerability of Medical Deep Neural Networks (DNNs) for detecting DR. To the best of our knowledge, this is the very first attempt that works on attacking complete fine-grained classification of DR images using various UAPs. Also, as a part of this work, we use UAPs to fine-tune the trained models to defend against adversarial samples. We experiment on several models and observe that the performance of such models towards unseen adversarial attacks gets boosted on average by $3.41$ Cohen-kappa value and maximum by $31.92$ Cohen-kappa value. The performance degradation on normal data upon ensembling the fine-tuned models was found to be statistically insignificant using t-test, highlighting the benefits of UAP-based adversarial fine-tuning.

Via

Access Paper or Ask Questions