Picture for Mohit Bansal

Mohit Bansal

Shammie

CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval

Add code
Jun 06, 2025
Viaarxiv icon

Movie Facts and Fibs (MF$^2$): A Benchmark for Long Movie Understanding

Add code
Jun 06, 2025
Viaarxiv icon

CLATTER: Comprehensive Entailment Reasoning for Hallucination Detection

Add code
Jun 05, 2025
Viaarxiv icon

OpenThoughts: Data Recipes for Reasoning Models

Add code
Jun 05, 2025
Viaarxiv icon

SiLVR: A Simple Language-based Video Reasoning Framework

Add code
May 30, 2025
Viaarxiv icon

EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance

Add code
May 28, 2025
Viaarxiv icon

Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation

Add code
May 01, 2025
Viaarxiv icon

Anyprefer: An Agentic Framework for Preference Data Synthesis

Add code
Apr 27, 2025
Viaarxiv icon

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon

CAPTURe: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting

Add code
Apr 21, 2025
Viaarxiv icon