Picture for Mohit Bansal

Mohit Bansal

Shammie

GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization

Add code
May 29, 2026
Viaarxiv icon

STORM: Internalized Modeling for Spatial-Temporal Reasoning in Video-Language Models

Add code
May 25, 2026
Viaarxiv icon

AVSD: Adaptive-View Self-Distillation by Balancing Consensus and Teacher-Specific Privileged Signals

Add code
May 20, 2026
Viaarxiv icon

MINTEval: Evaluating Memory under Multi-Target Interference in Long-Horizon Agent Systems

Add code
May 19, 2026
Viaarxiv icon

PhyMotion: Structured 3D Motion Reward for Physics-Grounded Human Video Generation

Add code
May 14, 2026
Viaarxiv icon

EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding

Add code
May 11, 2026
Viaarxiv icon

Stabilizing Efficient Reasoning with Step-Level Advantage Selection

Add code
Apr 27, 2026
Viaarxiv icon

MERRIN: A Benchmark for Multimodal Evidence Retrieval and Reasoning in Noisy Web Environments

Add code
Apr 15, 2026
Viaarxiv icon

Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind

Add code
Apr 13, 2026
Viaarxiv icon

The Master Key Hypothesis: Unlocking Cross-Model Capability Transfer via Linear Subspace Alignment

Add code
Apr 07, 2026
Viaarxiv icon