Picture for Manish Kumar Govind

Manish Kumar Govind

UniLACT: Depth-Aware RGB Latent Action Learning for Vision-Language-Action Models

Add code
Feb 23, 2026
Viaarxiv icon

From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities

Add code
Jan 10, 2025
Figure 1 for From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities
Figure 2 for From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities
Figure 3 for From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities
Figure 4 for From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities
Viaarxiv icon

Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads

Add code
Jun 27, 2024
Figure 1 for Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Figure 2 for Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Figure 3 for Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Figure 4 for Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Viaarxiv icon

LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living

Add code
Jun 13, 2024
Viaarxiv icon