Picture for Shanmukha Vellamcheti

Shanmukha Vellamcheti

CVT-Bench: Counterfactual Viewpoint Transformations Reveal Unstable Spatial Representations in Multimodal LLMs

Add code
Mar 22, 2026
Viaarxiv icon

Hallucinate, Ground, Repeat: A Framework for Generalized Visual Relationship Detection

Add code
Jun 06, 2025
Figure 1 for Hallucinate, Ground, Repeat: A Framework for Generalized Visual Relationship Detection
Figure 2 for Hallucinate, Ground, Repeat: A Framework for Generalized Visual Relationship Detection
Figure 3 for Hallucinate, Ground, Repeat: A Framework for Generalized Visual Relationship Detection
Figure 4 for Hallucinate, Ground, Repeat: A Framework for Generalized Visual Relationship Detection
Viaarxiv icon

A Probabilistic Jump-Diffusion Framework for Open-World Egocentric Activity Recognition

Add code
May 28, 2025
Figure 1 for A Probabilistic Jump-Diffusion Framework for Open-World Egocentric Activity Recognition
Figure 2 for A Probabilistic Jump-Diffusion Framework for Open-World Egocentric Activity Recognition
Figure 3 for A Probabilistic Jump-Diffusion Framework for Open-World Egocentric Activity Recognition
Figure 4 for A Probabilistic Jump-Diffusion Framework for Open-World Egocentric Activity Recognition
Viaarxiv icon