Picture for Yifan Wang

Yifan Wang

Yifan

Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

Add code
Jan 21, 2026
Viaarxiv icon

Think3D: Thinking with Space for Spatial Reasoning

Add code
Jan 19, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Diffusion-DRF: Differentiable Reward Flow for Video Diffusion Fine-Tuning

Add code
Jan 07, 2026
Viaarxiv icon

AR-MOT: Autoregressive Multi-object Tracking

Add code
Jan 05, 2026
Viaarxiv icon

Split4D: Decomposed 4D Scene Reconstruction Without Video Segmentation

Add code
Dec 28, 2025
Viaarxiv icon

Rethinking Fine-Tuning: Unlocking Hidden Capabilities in Vision-Language Models

Add code
Dec 28, 2025
Viaarxiv icon

Flowing from Reasoning to Motion: Learning 3D Hand Trajectory Prediction from Egocentric Human Interaction Videos

Add code
Dec 18, 2025
Viaarxiv icon

FAIR: Focused Attention Is All You Need for Generative Recommendation

Add code
Dec 17, 2025
Viaarxiv icon

Zero-shot Synthetic Video Realism Enhancement via Structure-aware Denoising

Add code
Nov 18, 2025
Viaarxiv icon