Picture for Vikas Chandra

Vikas Chandra

VLM3: Vision Language Models Are Native 3D Learners

Add code
May 28, 2026
Viaarxiv icon

MobileMoE: Scaling On-Device Mixture of Experts

Add code
May 26, 2026
Viaarxiv icon

Exploring Audio Hallucination in Egocentric Video Understanding

Add code
Apr 26, 2026
Viaarxiv icon

RPRA: Predicting an LLM-Judge for Efficient but Performant Inference

Add code
Apr 14, 2026
Viaarxiv icon

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Add code
Apr 09, 2026
Viaarxiv icon

Neural Computers

Add code
Apr 07, 2026
Viaarxiv icon

Efficient Universal Perception Encoder

Add code
Mar 23, 2026
Viaarxiv icon

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

Add code
Mar 19, 2026
Viaarxiv icon

MobileLLM-Flash: Latency-Guided On-Device LLM Design for Industry Scale

Add code
Mar 16, 2026
Viaarxiv icon

EgoAVU: Egocentric Audio-Visual Understanding

Add code
Feb 05, 2026
Viaarxiv icon