Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mriganka Nath

ClipTTT: CLIP-Guided Test-Time Training Helps LVLMs See Better

Mar 27, 2026

Mriganka Nath, Anurag Das, Jiahao Xie, Bernt Schiele

Abstract:Large vision-language models (LVLMs) tend to hallucinate, especially when visual inputs are corrupted at test time. We show that such corruptions act as additional distribution shifts, significantly amplifying hallucination rates in real-world applications. To address this, we propose CLIP-guided Test-Time Training (ClipTTT), a method to adapt LVLMs under degraded conditions on the fly with a single test sample. Specifically, we leverage the image-text alignment strength of a pre-trained CLIP model as a stable guidance signal to identify reliable self-supervision targets, enabling rapid adaptation without altering the base LVLMs. Extensive experiments on standard hallucination benchmarks, with 15 common corruptions, demonstrate that ClipTTT effectively mitigates hallucinations and improves descriptive faithfulness under visual corruptions.

* 30 pages, 12 figures

Via

Access Paper or Ask Questions

Toxicity Detection in Drug Candidates using Simplified Molecular-Input Line-Entry System

Jan 21, 2021

Mriganka Nath, Subhasish Goswami

Figure 1 for Toxicity Detection in Drug Candidates using Simplified Molecular-Input Line-Entry System

Figure 2 for Toxicity Detection in Drug Candidates using Simplified Molecular-Input Line-Entry System

Figure 3 for Toxicity Detection in Drug Candidates using Simplified Molecular-Input Line-Entry System

Figure 4 for Toxicity Detection in Drug Candidates using Simplified Molecular-Input Line-Entry System

Abstract:The need for analysis of toxicity in new drug candidates and the requirement of doing it fast have asked the consideration of scientists towards the use of artificial intelligence tools to examine toxicity levels and to develop models to a degree where they can be used commercially to measure toxicity levels efficiently in upcoming drugs. Artificial Intelligence based models can be used to predict the toxic nature of a chemical using Quantitative Structure Activity Relationship techniques. Convolutional Neural Network models have demonstrated great outcomes in predicting the qualitative analysis of chemicals in order to determine the toxicity. This paper goes for the study of Simplified Molecular Input Line-Entry System (SMILES) as a parameter to develop Long short term memory (LSTM) based models in order to examine the toxicity of a molecule and the degree to which the need can be fulfilled for practical use alongside its future outlooks for the purpose of real world applications.

* International Journal of Computer Applications 175(21):1-4, September 2020
* 4 Pages, 4 Figures, Published with International Journal of Computer Applications (IJCA)

Via

Access Paper or Ask Questions