Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pawan Goyal

Indian Institute of Technology Kharagpur

On The Persona-based Summarization of Domain-Specific Documents

Jun 06, 2024

Ankan Mullick, Sombit Bose, Rounak Saha, Ayan Kumar Bhowmick, Pawan Goyal, Niloy Ganguly, Prasenjit Dey, Ravi Kokku

Figure 1 for On The Persona-based Summarization of Domain-Specific Documents

Figure 2 for On The Persona-based Summarization of Domain-Specific Documents

Figure 3 for On The Persona-based Summarization of Domain-Specific Documents

Figure 4 for On The Persona-based Summarization of Domain-Specific Documents

Abstract:In an ever-expanding world of domain-specific knowledge, the increasing complexity of consuming, and storing information necessitates the generation of summaries from large information repositories. However, every persona of a domain has different requirements of information and hence their summarization. For example, in the healthcare domain, a persona-based (such as Doctor, Nurse, Patient etc.) approach is imperative to deliver targeted medical information efficiently. Persona-based summarization of domain-specific information by humans is a high cognitive load task and is generally not preferred. The summaries generated by two different humans have high variability and do not scale in cost and subject matter expertise as domains and personas grow. Further, AI-generated summaries using generic Large Language Models (LLMs) may not necessarily offer satisfactory accuracy for different domains unless they have been specifically trained on domain-specific data and can also be very expensive to use in day-to-day operations. Our contribution in this paper is two-fold: 1) We present an approach to efficiently fine-tune a domain-specific small foundation LLM using a healthcare corpus and also show that we can effectively evaluate the summarization quality using AI-based critiquing. 2) We further show that AI-based critiquing has good concordance with Human-based critiquing of the summaries. Hence, such AI-based pipelines to generate domain-specific persona-based summaries can be easily scaled to other domains such as legal, enterprise documents, education etc. in a very efficient and cost-effective manner.

* ACL 2024 Findings (Association for Computational Linguistics)

Via

Access Paper or Ask Questions

Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling

May 15, 2024

Subhendu Khatuya, Rajdeep Mukherjee, Akash Ghosh, Manjunath Hegde, Koustuv Dasgupta, Niloy Ganguly, Saptarshi Ghosh, Pawan Goyal

Figure 1 for Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling

Figure 2 for Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling

Figure 3 for Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling

Figure 4 for Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling

Abstract:We study the problem of automatically annotating relevant numerals (GAAP metrics) occurring in the financial documents with their corresponding XBRL tags. Different from prior works, we investigate the feasibility of solving this extreme classification problem using a generative paradigm through instruction tuning of Large Language Models (LLMs). To this end, we leverage metric metadata information to frame our target outputs while proposing a parameter efficient solution for the task using LoRA. We perform experiments on two recently released financial numeric labeling datasets. Our proposed model, FLAN-FinXC, achieves new state-of-the-art performances on both the datasets, outperforming several strong baselines. We explain the better scores of our proposed model by demonstrating its capability for zero-shot as well as the least frequently occurring tags. Also, even when we fail to predict the XBRL tags correctly, our generated output has substantial overlap with the ground-truth in majority of the cases.

* This work has been accepted to appear at North American Chapter of the Association for Computational Linguistics (NAACL), 2024

Via

Access Paper or Ask Questions

Instruction-Guided Bullet Point Summarization of Long Financial Earnings Call Transcripts

May 03, 2024

Subhendu Khatuya, Koushiki Sinha, Niloy Ganguly, Saptarshi Ghosh, Pawan Goyal

Figure 1 for Instruction-Guided Bullet Point Summarization of Long Financial Earnings Call Transcripts

Figure 2 for Instruction-Guided Bullet Point Summarization of Long Financial Earnings Call Transcripts

Figure 3 for Instruction-Guided Bullet Point Summarization of Long Financial Earnings Call Transcripts

Figure 4 for Instruction-Guided Bullet Point Summarization of Long Financial Earnings Call Transcripts

Abstract:While automatic summarization techniques have made significant advancements, their primary focus has been on summarizing short news articles or documents that have clear structural patterns like scientific articles or government reports. There has not been much exploration into developing efficient methods for summarizing financial documents, which often contain complex facts and figures. Here, we study the problem of bullet point summarization of long Earning Call Transcripts (ECTs) using the recently released ECTSum dataset. We leverage an unsupervised question-based extractive module followed by a parameter efficient instruction-tuned abstractive module to solve this task. Our proposed model FLAN-FinBPS achieves new state-of-the-art performances outperforming the strongest baseline with 14.88% average ROUGE score gain, and is capable of generating factually consistent bullet point summaries that capture the important facts discussed in the ECTs.

* Accepted in SIGIR 2024

Via

Access Paper or Ask Questions

SERPENT-VLM : Self-Refining Radiology Report Generation Using Vision Language Models

Apr 27, 2024

Manav Nitin Kapadnis, Sohan Patnaik, Abhilash Nandy, Sourjyadip Ray, Pawan Goyal, Debdoot Sheet

Abstract:Radiology Report Generation (R2Gen) demonstrates how Multi-modal Large Language Models (MLLMs) can automate the creation of accurate and coherent radiological reports. Existing methods often hallucinate details in text-based reports that don't accurately reflect the image content. To mitigate this, we introduce a novel strategy, SERPENT-VLM (SElf Refining Radiology RePort GENeraTion using Vision Language Models), which improves the R2Gen task by integrating a self-refining mechanism into the MLLM framework. We employ a unique self-supervised loss that leverages similarity between pooled image representations and the contextual representations of the generated radiological text, alongside the standard Causal Language Modeling objective, to refine image-text representations. This allows the model to scrutinize and align the generated text through dynamic interaction between a given image and the generated text, therefore reducing hallucination and continuously enhancing nuanced report generation. SERPENT-VLM outperforms existing baselines such as LLaVA-Med, BiomedGPT, etc., achieving SoTA performance on the IU X-ray and Radiology Objects in COntext (ROCO) datasets, and also proves to be robust against noisy images. A qualitative case study emphasizes the significant advancements towards more sophisticated MLLM frameworks for R2Gen, opening paths for further research into self-supervised refinement in the medical imaging domain.

* 8 pages, 3 figures, 4 tables, Accepted as oral at Clinical NLP workshop at NAACL 2024

Via

Access Paper or Ask Questions

Order-Based Pre-training Strategies for Procedural Text Understanding

Apr 06, 2024

Abhilash Nandy, Yash Kulkarni, Pawan Goyal, Niloy Ganguly

Figure 1 for Order-Based Pre-training Strategies for Procedural Text Understanding

Figure 2 for Order-Based Pre-training Strategies for Procedural Text Understanding

Figure 3 for Order-Based Pre-training Strategies for Procedural Text Understanding

Figure 4 for Order-Based Pre-training Strategies for Procedural Text Understanding

Abstract:In this paper, we propose sequence-based pretraining methods to enhance procedural understanding in natural language processing. Procedural text, containing sequential instructions to accomplish a task, is difficult to understand due to the changing attributes of entities in the context. We focus on recipes, which are commonly represented as ordered instructions, and use this order as a supervision signal. Our work is one of the first to compare several 'order as-supervision' transformer pre-training methods, including Permutation Classification, Embedding Regression, and Skip-Clip, and shows that these methods give improved results compared to the baselines and SoTA LLMs on two downstream Entity-Tracking datasets: NPN-Cooking dataset in recipe domain and ProPara dataset in open domain. Our proposed methods address the non-trivial Entity Tracking Task that requires prediction of entity states across procedure steps, which requires understanding the order of steps. These methods show an improvement over the best baseline by 1.6% and 7-9% on NPN-Cooking and ProPara Datasets respectively across metrics.

* 8 pages (Accepted for publication at NAACL 2024 (Main Conference))

Via

Access Paper or Ask Questions

Intent Detection and Entity Extraction from BioMedical Literature

Apr 04, 2024

Ankan Mullick, Mukur Gupta, Pawan Goyal

Figure 1 for Intent Detection and Entity Extraction from BioMedical Literature

Figure 2 for Intent Detection and Entity Extraction from BioMedical Literature

Figure 3 for Intent Detection and Entity Extraction from BioMedical Literature

Figure 4 for Intent Detection and Entity Extraction from BioMedical Literature

Abstract:Biomedical queries have become increasingly prevalent in web searches, reflecting the growing interest in accessing biomedical literature. Despite recent research on large-language models (LLMs) motivated by endeavours to attain generalized intelligence, their efficacy in replacing task and domain-specific natural language understanding approaches remains questionable. In this paper, we address this question by conducting a comprehensive empirical evaluation of intent detection and named entity recognition (NER) tasks from biomedical text. We show that Supervised Fine Tuned approaches are still relevant and more effective than general-purpose LLMs. Biomedical transformer models such as PubMedBERT can surpass ChatGPT on NER task with only 5 supervised examples.

* Accepted to CL4Health LREC-COLING 2024

Via

Access Paper or Ask Questions

How Robust are the Tabular QA Models for Scientific Tables? A Study using Customized Dataset

Mar 30, 2024

Akash Ghosh, B Venkata Sahith, Niloy Ganguly, Pawan Goyal, Mayank Singh

Figure 1 for How Robust are the Tabular QA Models for Scientific Tables? A Study using Customized Dataset

Figure 2 for How Robust are the Tabular QA Models for Scientific Tables? A Study using Customized Dataset

Figure 3 for How Robust are the Tabular QA Models for Scientific Tables? A Study using Customized Dataset

Figure 4 for How Robust are the Tabular QA Models for Scientific Tables? A Study using Customized Dataset

Abstract:Question-answering (QA) on hybrid scientific tabular and textual data deals with scientific information, and relies on complex numerical reasoning. In recent years, while tabular QA has seen rapid progress, understanding their robustness on scientific information is lacking due to absence of any benchmark dataset. To investigate the robustness of the existing state-of-the-art QA models on scientific hybrid tabular data, we propose a new dataset, "SciTabQA", consisting of 822 question-answer pairs from scientific tables and their descriptions. With the help of this dataset, we assess the state-of-the-art Tabular QA models based on their ability (i) to use heterogeneous information requiring both structured data (table) and unstructured data (text) and (ii) to perform complex scientific reasoning tasks. In essence, we check the capability of the models to interpret scientific tables and text. Our experiments show that "SciTabQA" is an innovative dataset to study question-answering over scientific heterogeneous data. We benchmark three state-of-the-art Tabular QA models, and find that the best F1 score is only 0.462.

Via

Access Paper or Ask Questions

Stability-Certified Learning of Control Systems with Quadratic Nonlinearities

Mar 01, 2024

Igor Pontes Duff, Pawan Goyal, Peter Benner

Abstract:This work primarily focuses on an operator inference methodology aimed at constructing low-dimensional dynamical models based on a priori hypotheses about their structure, often informed by established physics or expert insights. Stability is a fundamental attribute of dynamical systems, yet it is not always assured in models derived through inference. Our main objective is to develop a method that facilitates the inference of quadratic control dynamical systems with inherent stability guarantees. To this aim, we investigate the stability characteristics of control systems with energy-preserving nonlinearities, thereby identifying conditions under which such systems are bounded-input bounded-state stable. These insights are subsequently applied to the learning process, yielding inferred models that are inherently stable by design. The efficacy of our proposed framework is demonstrated through a couple of numerical examples.

* 12 pages, 4 figures

Via

Access Paper or Ask Questions

Learning reduced-order Quadratic-Linear models in Process Engineering using Operator Inference

Feb 27, 2024

Ion Victor Gosea, Luisa Peterson, Pawan Goyal, Jens Bremer, Kai Sundmacher, Peter Benner

Figure 1 for Learning reduced-order Quadratic-Linear models in Process Engineering using Operator Inference

Figure 2 for Learning reduced-order Quadratic-Linear models in Process Engineering using Operator Inference

Figure 3 for Learning reduced-order Quadratic-Linear models in Process Engineering using Operator Inference

Abstract:In this work, we address the challenge of efficiently modeling dynamical systems in process engineering. We use reduced-order model learning, specifically operator inference. This is a non-intrusive, data-driven method for learning dynamical systems from time-domain data. The application in our study is carbon dioxide methanation, an important reaction within the Power-to-X framework, to demonstrate its potential. The numerical results show the ability of the reduced-order models constructed with operator inference to provide a reduced yet accurate surrogate solution. This represents an important milestone towards the implementation of fast and reliable digital twin architectures.

* 10 pages, 3 figures

Via

Access Paper or Ask Questions

Long Dialog Summarization: An Analysis

Feb 26, 2024

Ankan Mullick, Ayan Kumar Bhowmick, Raghav R, Ravi Kokku, Prasenjit Dey, Pawan Goyal, Niloy Ganguly

Figure 1 for Long Dialog Summarization: An Analysis

Figure 2 for Long Dialog Summarization: An Analysis

Abstract:Dialog summarization has become increasingly important in managing and comprehending large-scale conversations across various domains. This task presents unique challenges in capturing the key points, context, and nuances of multi-turn long conversations for summarization. It is worth noting that the summarization techniques may vary based on specific requirements such as in a shopping-chatbot scenario, the dialog summary helps to learn user preferences, whereas in the case of a customer call center, the summary may involve the problem attributes that a user specified, and the final resolution provided. This work emphasizes the significance of creating coherent and contextually rich summaries for effective communication in various applications. We explore current state-of-the-art approaches for long dialog summarization in different domains and benchmark metrics based evaluations show that one single model does not perform well across various areas for distinct summarization tasks.

Via

Access Paper or Ask Questions