Picture for Yizhou Wang

Yizhou Wang

Distorted or Fabricated? A Survey on Hallucination in Video LLMs

Add code
Apr 14, 2026
Viaarxiv icon

Visually-grounded Humanoid Agents

Add code
Apr 09, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

Less Data, Faster Convergence: Goal-Driven Data Optimization for Multimodal Instruction Tuning

Add code
Mar 12, 2026
Viaarxiv icon

Emerging Extrinsic Dexterity in Cluttered Scenes via Dynamics-aware Policy Learning

Add code
Mar 10, 2026
Viaarxiv icon

Ref-Adv: Exploring MLLM Visual Reasoning in Referring Expression Tasks

Add code
Feb 27, 2026
Viaarxiv icon

LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion

Add code
Feb 12, 2026
Viaarxiv icon

MTPano: Multi-Task Panoramic Scene Understanding via Label-Free Integration of Dense Prediction Priors

Add code
Feb 05, 2026
Viaarxiv icon

Model Optimization for Multi-Camera 3D Detection and Tracking

Add code
Feb 03, 2026
Viaarxiv icon

Calibration without Ground Truth

Add code
Jan 27, 2026
Viaarxiv icon