With advances in generative artificial intelligence (AI), it is now possible to produce realistic-looking automated reports for preliminary reads of radiology images. This can expedite clinical workflows, improve accuracy and reduce overall costs. However, it is also well-known that such models often hallucinate, leading to false findings in the generated reports. In this paper, we propose a new method of fact-checking of AI-generated reports using their associated images. Specifically, the developed examiner differentiates real and fake sentences in reports by learning the association between an image and sentences describing real or potentially fake findings. To train such an examiner, we first created a new dataset of fake reports by perturbing the findings in the original ground truth radiology reports associated with images. Text encodings of real and fake sentences drawn from these reports are then paired with image encodings to learn the mapping to real/fake labels. The utility of such an examiner is demonstrated for verifying automatically generated reports by detecting and removing fake sentences. Future generative AI approaches can use the resulting tool to validate their reports leading to a more responsible use of AI in expediting clinical workflows.
As a popular deep learning model, the convolutional neural network (CNN) has produced promising results in analyzing lung nodules and tumors in low-dose CT images. However, this approach still suffers from the lack of labeled data, which is a major challenge for further improvement in the screening and diagnostic performance of CNN. Accurate localization and characterization of nodules provides crucial pathological clues, especially relevant size, attenuation, shape, margins, and growth or stability of lesions, with which the sensitivity and specificity of detection and classification can be increased. To address this challenge, in this paper we develop a soft activation mapping (SAM) to enable fine-grained lesion analysis with a CNN so that it can access rich radiomics features. By combining high-level convolutional features with SAM, we further propose a high-level feature enhancement scheme to localize lesions precisely from multiple CT slices, which helps alleviate overfitting without any additional data augmentation. Experiments on the LIDC-IDRI benchmark dataset indicate that our proposed approach achieves a state-of-the-art predictive performance, reducing the false positive rate. Moreover, the SAM method focuses on irregular margins which are often linked to malignancy.