Picture for Arya Shah

Arya Shah

SycoPhantasy: Quantifying Sycophancy and Hallucination in Small Open Weight VLMs for Vision-Language Scoring of Fantasy Characters

Add code
Apr 27, 2026
Viaarxiv icon

Gaslight, Gatekeep, V1-V3: Early Visual Cortex Alignment Shields Vision-Language Models from Sycophantic Manipulation

Add code
Apr 15, 2026
Viaarxiv icon

GF-Score: Certified Class-Conditional Robustness Evaluation with Fairness Guarantees

Add code
Apr 14, 2026
Viaarxiv icon

Too Nice to Tell the Truth: Quantifying Agreeableness-Driven Sycophancy in Role-Playing Language Models

Add code
Apr 12, 2026
Viaarxiv icon

One Instruction Does Not Fit All: How Well Do Embeddings Align Personas and Instructions in Low-Resource Indian Languages?

Add code
Jan 15, 2026
Viaarxiv icon