Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Edward A. Fox

When LLMs Disagree: Diagnosing Relevance Filtering Bias and Retrieval Divergence in SDG Search

Jul 02, 2025

William A. Ingram, Bipasha Banerjee, Edward A. Fox

Figure 1 for When LLMs Disagree: Diagnosing Relevance Filtering Bias and Retrieval Divergence in SDG Search

Figure 2 for When LLMs Disagree: Diagnosing Relevance Filtering Bias and Retrieval Divergence in SDG Search

Figure 3 for When LLMs Disagree: Diagnosing Relevance Filtering Bias and Retrieval Divergence in SDG Search

Figure 4 for When LLMs Disagree: Diagnosing Relevance Filtering Bias and Retrieval Divergence in SDG Search

Abstract:Large language models (LLMs) are increasingly used to assign document relevance labels in information retrieval pipelines, especially in domains lacking human-labeled data. However, different models often disagree on borderline cases, raising concerns about how such disagreement affects downstream retrieval. This study examines labeling disagreement between two open-weight LLMs, LLaMA and Qwen, on a corpus of scholarly abstracts related to Sustainable Development Goals (SDGs) 1, 3, and 7. We isolate disagreement subsets and examine their lexical properties, rank-order behavior, and classification predictability. Our results show that model disagreement is systematic, not random: disagreement cases exhibit consistent lexical patterns, produce divergent top-ranked outputs under shared scoring functions, and are distinguishable with AUCs above 0.74 using simple classifiers. These findings suggest that LLM-based filtering introduces structured variability in document retrieval, even under controlled prompting and shared ranking logic. We propose using classification disagreement as an object of analysis in retrieval evaluation, particularly in policy-relevant or thematic search tasks.

* Presented at LLM4Eval Workshop, SIGIR 2025 Padova, Italy, July 17, 2025

Via

Access Paper or Ask Questions

Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals

Nov 26, 2024

William A. Ingram, Bipasha Banerjee, Edward A. Fox

Figure 1 for Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals

Figure 2 for Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals

Figure 3 for Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals

Figure 4 for Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals

Abstract:As research institutions increasingly commit to supporting the United Nations' Sustainable Development Goals (SDGs), there is a pressing need to accurately assess their research output against these goals. Current approaches, primarily reliant on keyword-based Boolean search queries, conflate incidental keyword matches with genuine contributions, reducing retrieval precision and complicating benchmarking efforts. This study investigates the application of autoregressive Large Language Models (LLMs) as evaluation agents to identify relevant scholarly contributions to SDG targets in scholarly publications. Using a dataset of academic abstracts retrieved via SDG-specific keyword queries, we demonstrate that small, locally-hosted LLMs can differentiate semantically relevant contributions to SDG targets from documents retrieved due to incidental keyword matches, addressing the limitations of traditional methods. By leveraging the contextual understanding of LLMs, this approach provides a scalable framework for improving SDG-related research metrics and informing institutional reporting.

Via

Access Paper or Ask Questions

Automating Chapter-Level Classification for Electronic Theses and Dissertations

Nov 26, 2024

Bipasha Banerjee, William A. Ingram, Edward A. Fox

Abstract:Traditional archival practices for describing electronic theses and dissertations (ETDs) rely on broad, high-level metadata schemes that fail to capture the depth, complexity, and interdisciplinary nature of these long scholarly works. The lack of detailed, chapter-level content descriptions impedes researchers' ability to locate specific sections or themes, thereby reducing discoverability and overall accessibility. By providing chapter-level metadata information, we improve the effectiveness of ETDs as research resources. This makes it easier for scholars to navigate them efficiently and extract valuable insights. The absence of such metadata further obstructs interdisciplinary research by obscuring connections across fields, hindering new academic discoveries and collaboration. In this paper, we propose a machine learning and AI-driven solution to automatically categorize ETD chapters. This solution is intended to improve discoverability and promote understanding of chapters. Our approach enriches traditional archival practices by providing context-rich descriptions that facilitate targeted navigation and improved access. We aim to support interdisciplinary research and make ETDs more accessible. By providing chapter-level classification labels and using them to index in our developed prototype system, we make content in ETD chapters more discoverable and usable for a diverse range of scholarly needs. Implementing this AI-enhanced approach allows archives to serve researchers better, enabling efficient access to relevant information and supporting deeper engagement with ETDs. This will increase the impact of ETDs as research tools, foster interdisciplinary exploration, and reinforce the role of archives in scholarly communication within the data-intensive academic landscape.

Via

Access Paper or Ask Questions

Analyzing Quality, Bias, and Performance in Text-to-Image Generative Models

Jun 28, 2024

Nila Masrourisaadat, Nazanin Sedaghatkish, Fatemeh Sarshartehrani, Edward A. Fox

Abstract:Advances in generative models have led to significant interest in image synthesis, demonstrating the ability to generate high-quality images for a diverse range of text prompts. Despite this progress, most studies ignore the presence of bias. In this paper, we examine several text-to-image models not only by qualitatively assessing their performance in generating accurate images of human faces, groups, and specified numbers of objects but also by presenting a social bias analysis. As expected, models with larger capacity generate higher-quality images. However, we also document the inherent gender or social biases these models possess, offering a more complete understanding of their impact and limitations.

* 20 pages, 8 figures

Via

Access Paper or Ask Questions

ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations

Nov 07, 2023

Muntabir Hasan Choudhury, Lamia Salsabil, William A. Ingram, Edward A. Fox, Jian Wu

Figure 1 for ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations

Figure 2 for ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations

Figure 3 for ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations

Figure 4 for ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations

Abstract:Electronic theses and dissertations (ETDs) have been proposed, advocated, and generated for more than 25 years. Although ETDs are hosted by commercial or institutional digital library repositories, they are still an understudied type of scholarly big data, partially because they are usually longer than conference proceedings and journals. Segmenting ETDs will allow researchers to study sectional content. Readers can navigate to particular pages of interest, discover, and explore the content buried in these long documents. Most existing frameworks on document page classification are designed for classifying general documents and perform poorly on ETDs. In this paper, we propose ETDPC. Its backbone is a two-stream multimodal model with a cross-attention network to classify ETD pages into 13 categories. To overcome the challenge of imbalanced labeled samples, we augmented data for minority categories and employed a hierarchical classifier. ETDPC outperforms the state-of-the-art models in all categories, achieving an F1 of 0.84 -- 0.96 for 9 out of 13 categories. We also demonstrated its data efficiency. The code and data can be found on GitHub (https://github.com/lamps-lab/ETDMiner/tree/master/etd_segmentation).

* 10 pages, 3 figures, accepted to Innovative Applications of Artificial Intelligence (IAAI-24)

Via

Access Paper or Ask Questions

AI Chatbot for Generating Episodic Future Thinking (EFT) Cue Texts for Health

Nov 06, 2023

Sareh Ahmadi, Edward A. Fox

Abstract:We describe an AI-powered chatbot to aid with health improvement by generating Episodic Future Thinking (EFT) cue texts that should reduce delay discounting. In prior studies, EFT has been shown to address maladaptive health behaviors. Those studies involved participants, working with researchers, vividly imagining future events, and writing a description that they subsequently will frequently review, to ensure a shift from an inclination towards immediate rewards. That should promote behavior change, aiding in health tasks such as treatment adherence and lifestyle modifications. The AI chatbot is designed to guide users in generating personalized EFTs, automating the current labor-intensive interview-based process. This can enhance the efficiency of EFT interventions and make them more accessible, targeting specifically those with limited educational backgrounds or communication challenges. By leveraging AI for EFT intervention, we anticipate broadened access and improved health outcomes across diverse populations

Via

Access Paper or Ask Questions

MetaEnhance: Metadata Quality Improvement for Electronic Theses and Dissertations of University Libraries

Mar 30, 2023

Muntabir Hasan Choudhury, Lamia Salsabil, Himarsha R. Jayanetti, Jian Wu, William A. Ingram, Edward A. Fox

Figure 1 for MetaEnhance: Metadata Quality Improvement for Electronic Theses and Dissertations of University Libraries

Figure 2 for MetaEnhance: Metadata Quality Improvement for Electronic Theses and Dissertations of University Libraries

Figure 3 for MetaEnhance: Metadata Quality Improvement for Electronic Theses and Dissertations of University Libraries

Figure 4 for MetaEnhance: Metadata Quality Improvement for Electronic Theses and Dissertations of University Libraries

Abstract:Metadata quality is crucial for digital objects to be discovered through digital library interfaces. However, due to various reasons, the metadata of digital objects often exhibits incomplete, inconsistent, and incorrect values. We investigate methods to automatically detect, correct, and canonicalize scholarly metadata, using seven key fields of electronic theses and dissertations (ETDs) as a case study. We propose MetaEnhance, a framework that utilizes state-of-the-art artificial intelligence methods to improve the quality of these fields. To evaluate MetaEnhance, we compiled a metadata quality evaluation benchmark containing 500 ETDs, by combining subsets sampled using multiple criteria. We tested MetaEnhance on this benchmark and found that the proposed methods achieved nearly perfect F1-scores in detecting errors and F1-scores in correcting errors ranging from 0.85 to 1.00 for five of seven fields.

* 7 pages, 3 tables, and 1 figure. Accepted by 2023 ACM/IEEE Joint Conference on Digital Libraries (JCDL '23) as a short paper

Via

Access Paper or Ask Questions

Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations

Jul 01, 2021

Muntabir Hasan Choudhury, Himarsha R. Jayanetti, Jian Wu, William A. Ingram, Edward A. Fox

Figure 1 for Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations

Figure 2 for Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations

Figure 3 for Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations

Figure 4 for Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations

Abstract:Electronic Theses and Dissertations (ETDs) contain domain knowledge that can be used for many digital library tasks, such as analyzing citation networks and predicting research trends. Automatic metadata extraction is important to build scalable digital library search engines. Most existing methods are designed for born-digital documents, so they often fail to extract metadata from scanned documents such as for ETDs. Traditional sequence tagging methods mainly rely on text-based features. In this paper, we propose a conditional random field (CRF) model that combines text-based and visual features. To verify the robustness of our model, we extended an existing corpus and created a new ground truth corpus consisting of 500 ETD cover pages with human validated metadata. Our experiments show that CRF with visual features outperformed both a heuristic and a CRF model with only text-based features. The proposed model achieved 81.3%-96% F1 measure on seven metadata fields. The data and source code are publicly available on Google Drive (https://tinyurl.com/y8kxzwrp) and a GitHub repository (https://github.com/lamps-lab/ETDMiner/tree/master/etd_crf), respectively.

* 7 pages, 4 figures, 1 table. Accepted by JCDL '21 as a short paper

Via

Access Paper or Ask Questions

ScanBank: A Benchmark Dataset for Figure Extraction from Scanned Electronic Theses and Dissertations

Jun 23, 2021

Sampanna Yashwant Kahu, William A. Ingram, Edward A. Fox, Jian Wu

Figure 1 for ScanBank: A Benchmark Dataset for Figure Extraction from Scanned Electronic Theses and Dissertations

Figure 2 for ScanBank: A Benchmark Dataset for Figure Extraction from Scanned Electronic Theses and Dissertations

Figure 3 for ScanBank: A Benchmark Dataset for Figure Extraction from Scanned Electronic Theses and Dissertations

Figure 4 for ScanBank: A Benchmark Dataset for Figure Extraction from Scanned Electronic Theses and Dissertations

Abstract:We focus on electronic theses and dissertations (ETDs), aiming to improve access and expand their utility, since more than 6 million are publicly available, and they constitute an important corpus to aid research and education across disciplines. The corpus is growing as new born-digital documents are included, and since millions of older theses and dissertations have been converted to digital form to be disseminated electronically in institutional repositories. In ETDs, as with other scholarly works, figures and tables can communicate a large amount of information in a concise way. Although methods have been proposed for extracting figures and tables from born-digital PDFs, they do not work well with scanned ETDs. Considering this problem, our assessment of state-of-the-art figure extraction systems is that the reason they do not function well on scanned PDFs is that they have only been trained on born-digital documents. To address this limitation, we present ScanBank, a new dataset containing 10 thousand scanned page images, manually labeled by humans as to the presence of the 3.3 thousand figures or tables found therein. We use this dataset to train a deep neural network model based on YOLOv5 to accurately extract figures and tables from scanned ETDs. We pose and answer important research questions aimed at finding better methods for figure extraction from scanned documents. One of those concerns the value for training, of data augmentation techniques applied to born-digital documents which are used to train models better suited for figure extraction from scanned documents. To the best of our knowledge, ScanBank is the first manually annotated dataset for figure and table extraction for scanned ETDs. A YOLOv5-based model, trained on ScanBank, outperforms existing comparable open-source and freely available baseline methods by a considerable margin.

* 16 pages, 3 figures, submitted to ACM/IEEE Joint Conference on Digital Libraries

Via

Access Paper or Ask Questions

Differentially Private Synthetic Medical Data Generation using Convolutional GANs

Dec 22, 2020

Amirsina Torfi, Edward A. Fox, Chandan K. Reddy

Figure 1 for Differentially Private Synthetic Medical Data Generation using Convolutional GANs

Figure 2 for Differentially Private Synthetic Medical Data Generation using Convolutional GANs

Figure 3 for Differentially Private Synthetic Medical Data Generation using Convolutional GANs

Figure 4 for Differentially Private Synthetic Medical Data Generation using Convolutional GANs

Abstract:Deep learning models have demonstrated superior performance in several application problems, such as image classification and speech processing. However, creating a deep learning model using health record data requires addressing certain privacy challenges that bring unique concerns to researchers working in this domain. One effective way to handle such private data issues is to generate realistic synthetic data that can provide practically acceptable data quality and correspondingly the model performance. To tackle this challenge, we develop a differentially private framework for synthetic data generation using R\'enyi differential privacy. Our approach builds on convolutional autoencoders and convolutional generative adversarial networks to preserve some of the critical characteristics of the generated synthetic data. In addition, our model can also capture the temporal information and feature correlations that might be present in the original data. We demonstrate that our model outperforms existing state-of-the-art models under the same privacy budget using several publicly available benchmark medical datasets in both supervised and unsupervised settings.

Via

Access Paper or Ask Questions