Picture for Yilun Du

Yilun Du

Derek

Grading Scale Impact on LLM-as-a-Judge: Human-LLM Alignment Is Highest on 0-5 Grading Scale

Add code
Jan 06, 2026
Viaarxiv icon

Flow Equivariant World Models: Memory for Partially Observed Dynamic Environments

Add code
Jan 03, 2026
Viaarxiv icon

Flexible Multitask Learning with Factorized Diffusion Policy

Add code
Dec 26, 2025
Viaarxiv icon

Confucius Code Agent: Scalable Agent Scaffolding for Real-World Codebases

Add code
Dec 20, 2025
Viaarxiv icon

OPENTOUCH: Bringing Full-Hand Touch to Real-World Interaction

Add code
Dec 18, 2025
Viaarxiv icon

Towards a Science of Scaling Agent Systems

Add code
Dec 17, 2025
Viaarxiv icon

Large Video Planner Enables Generalizable Robot Control

Add code
Dec 17, 2025
Figure 1 for Large Video Planner Enables Generalizable Robot Control
Figure 2 for Large Video Planner Enables Generalizable Robot Control
Figure 3 for Large Video Planner Enables Generalizable Robot Control
Figure 4 for Large Video Planner Enables Generalizable Robot Control
Viaarxiv icon

Evaluating Gemini Robotics Policies in a Veo World Simulator

Add code
Dec 11, 2025
Viaarxiv icon

Model-Based Diffusion Sampling for Predictive Control in Offline Decision Making

Add code
Dec 09, 2025
Viaarxiv icon

Abstract 3D Perception for Spatial Intelligence in Vision-Language Models

Add code
Nov 14, 2025
Viaarxiv icon