Picture for Yogesh S Rawat

Yogesh S Rawat

Re:Verse -- Can Your VLM Read a Manga?

Add code
Aug 11, 2025
Viaarxiv icon

DisenQ: Disentangling Q-Former for Activity-Biometrics

Add code
Jul 09, 2025
Viaarxiv icon

Coarse Attribute Prediction with Task Agnostic Distillation for Real World Clothes Changing ReID

Add code
May 19, 2025
Viaarxiv icon

A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning

Add code
Apr 08, 2025
Viaarxiv icon

DIFFER: Disentangling Identity Features via Semantic Cues for Clothes-Changing Person Re-ID

Add code
Mar 28, 2025
Figure 1 for DIFFER: Disentangling Identity Features via Semantic Cues for Clothes-Changing Person Re-ID
Figure 2 for DIFFER: Disentangling Identity Features via Semantic Cues for Clothes-Changing Person Re-ID
Figure 3 for DIFFER: Disentangling Identity Features via Semantic Cues for Clothes-Changing Person Re-ID
Figure 4 for DIFFER: Disentangling Identity Features via Semantic Cues for Clothes-Changing Person Re-ID
Viaarxiv icon

STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding

Add code
Feb 28, 2025
Figure 1 for STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding
Figure 2 for STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding
Figure 3 for STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding
Figure 4 for STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding
Viaarxiv icon

LR0.FM: Low-Resolution Zero-shot Classification Benchmark For Foundation Models

Add code
Feb 07, 2025
Viaarxiv icon

DH-Bench: Probing Depth and Height Perception of Large Visual-Language Models

Add code
Aug 21, 2024
Viaarxiv icon

AirSketch: Generative Motion to Sketch

Add code
Jul 12, 2024
Viaarxiv icon

Navigating Hallucinations for Reasoning of Unintentional Activities

Add code
Mar 03, 2024
Viaarxiv icon