Picture for Ran Xu

Ran Xu

MTA-Agent: An Open Recipe for Multimodal Deep Search Agents

Add code
Apr 07, 2026
Viaarxiv icon

How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical Generative Reasoning

Add code
Mar 25, 2026
Viaarxiv icon

Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

Add code
Feb 02, 2026
Viaarxiv icon

Future Optical Flow Prediction Improves Robot Control & Video Generation

Add code
Jan 15, 2026
Viaarxiv icon

Robotic VLA Benefits from Joint Learning with Motion Image Diffusion

Add code
Dec 19, 2025
Viaarxiv icon

Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning

Add code
Oct 27, 2025
Figure 1 for Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Figure 2 for Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Figure 3 for Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Figure 4 for Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Viaarxiv icon

RoboSVG: A Unified Framework for Interactive SVG Generation with Multi-modal Guidance

Add code
Oct 26, 2025
Figure 1 for RoboSVG: A Unified Framework for Interactive SVG Generation with Multi-modal Guidance
Figure 2 for RoboSVG: A Unified Framework for Interactive SVG Generation with Multi-modal Guidance
Figure 3 for RoboSVG: A Unified Framework for Interactive SVG Generation with Multi-modal Guidance
Figure 4 for RoboSVG: A Unified Framework for Interactive SVG Generation with Multi-modal Guidance
Viaarxiv icon

OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment

Add code
Oct 09, 2025
Figure 1 for OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment
Figure 2 for OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment
Figure 3 for OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment
Figure 4 for OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment
Viaarxiv icon

WALT: Web Agents that Learn Tools

Add code
Oct 01, 2025
Viaarxiv icon

SCUBA: Salesforce Computer Use Benchmark

Add code
Sep 30, 2025
Figure 1 for SCUBA: Salesforce Computer Use Benchmark
Figure 2 for SCUBA: Salesforce Computer Use Benchmark
Figure 3 for SCUBA: Salesforce Computer Use Benchmark
Figure 4 for SCUBA: Salesforce Computer Use Benchmark
Viaarxiv icon