Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Agustina Saenz

ReXamine-Global: A Framework for Uncovering Inconsistencies in Radiology Report Generation Metrics

Aug 29, 2024

Oishi Banerjee, Agustina Saenz, Kay Wu, Warren Clements, Adil Zia, Dominic Buensalido, Helen Kavnoudias, Alain S. Abi-Ghanem, Nour El Ghawi, Cibele Luna(+10 more)

Figure 1 for ReXamine-Global: A Framework for Uncovering Inconsistencies in Radiology Report Generation Metrics

Figure 2 for ReXamine-Global: A Framework for Uncovering Inconsistencies in Radiology Report Generation Metrics

Figure 3 for ReXamine-Global: A Framework for Uncovering Inconsistencies in Radiology Report Generation Metrics

Figure 4 for ReXamine-Global: A Framework for Uncovering Inconsistencies in Radiology Report Generation Metrics

Abstract:Given the rapidly expanding capabilities of generative AI models for radiology, there is a need for robust metrics that can accurately measure the quality of AI-generated radiology reports across diverse hospitals. We develop ReXamine-Global, a LLM-powered, multi-site framework that tests metrics across different writing styles and patient populations, exposing gaps in their generalization. First, our method tests whether a metric is undesirably sensitive to reporting style, providing different scores depending on whether AI-generated reports are stylistically similar to ground-truth reports or not. Second, our method measures whether a metric reliably agrees with experts, or whether metric and expert scores of AI-generated report quality diverge for some sites. Using 240 reports from 6 hospitals around the world, we apply ReXamine-Global to 7 established report evaluation metrics and uncover serious gaps in their generalizability. Developers can apply ReXamine-Global when designing new report evaluation metrics, ensuring their robustness across sites. Additionally, our analysis of existing metrics can guide users of those metrics towards evaluation procedures that work reliably at their sites of interest.

Via

Access Paper or Ask Questions

Style-Aware Radiology Report Generation with RadGraph and Few-Shot Prompting

Oct 31, 2023

Benjamin Yan, Ruochen Liu, David E. Kuo, Subathra Adithan, Eduardo Pontes Reis, Stephen Kwak, Vasantha Kumar Venugopal, Chloe P. O'Connell, Agustina Saenz, Pranav Rajpurkar(+1 more)

Figure 1 for Style-Aware Radiology Report Generation with RadGraph and Few-Shot Prompting

Figure 2 for Style-Aware Radiology Report Generation with RadGraph and Few-Shot Prompting

Figure 3 for Style-Aware Radiology Report Generation with RadGraph and Few-Shot Prompting

Figure 4 for Style-Aware Radiology Report Generation with RadGraph and Few-Shot Prompting

Abstract:Automatically generated reports from medical images promise to improve the workflow of radiologists. Existing methods consider an image-to-report modeling task by directly generating a fully-fledged report from an image. However, this conflates the content of the report (e.g., findings and their attributes) with its style (e.g., format and choice of words), which can lead to clinically inaccurate reports. To address this, we propose a two-step approach for radiology report generation. First, we extract the content from an image; then, we verbalize the extracted content into a report that matches the style of a specific radiologist. For this, we leverage RadGraph -- a graph representation of reports -- together with large language models (LLMs). In our quantitative evaluations, we find that our approach leads to beneficial performance. Our human evaluation with clinical raters highlights that the AI-generated reports are indistinguishably tailored to the style of individual radiologist despite leveraging only a few examples as context.

* Accepted to Findings of EMNLP 2023

Via

Access Paper or Ask Questions

RadGraph2: Modeling Disease Progression in Radiology Reports via Hierarchical Information Extraction

Aug 09, 2023

Sameer Khanna, Adam Dejl, Kibo Yoon, Quoc Hung Truong, Hanh Duong, Agustina Saenz, Pranav Rajpurkar

Figure 1 for RadGraph2: Modeling Disease Progression in Radiology Reports via Hierarchical Information Extraction

Figure 2 for RadGraph2: Modeling Disease Progression in Radiology Reports via Hierarchical Information Extraction

Figure 3 for RadGraph2: Modeling Disease Progression in Radiology Reports via Hierarchical Information Extraction

Figure 4 for RadGraph2: Modeling Disease Progression in Radiology Reports via Hierarchical Information Extraction

Abstract:We present RadGraph2, a novel dataset for extracting information from radiology reports that focuses on capturing changes in disease state and device placement over time. We introduce a hierarchical schema that organizes entities based on their relationships and show that using this hierarchy during training improves the performance of an information extraction model. Specifically, we propose a modification to the DyGIE++ framework, resulting in our model HGIE, which outperforms previous models in entity and relation extraction tasks. We demonstrate that RadGraph2 enables models to capture a wider variety of findings and perform better at relation extraction compared to those trained on the original RadGraph dataset. Our work provides the foundation for developing automated systems that can track disease progression over time and develop information extraction models that leverage the natural hierarchy of labels in the medical domain.

* Accepted at Machine Learning for Healthcare 2023

Via

Access Paper or Ask Questions