Picture for Peng Xu

Peng Xu

Google

AdsQA: Towards Advertisement Video Understanding

Add code
Sep 10, 2025
Viaarxiv icon

PegasusFlow: Parallel Rolling-Denoising Score Sampling for Robot Diffusion Planner Flow Matching

Add code
Sep 10, 2025
Viaarxiv icon

From Feedback to Checklists: Grounded Evaluation of AI-Generated Clinical Notes

Add code
Jul 23, 2025
Viaarxiv icon

Whole-Body Constrained Learning for Legged Locomotion via Hierarchical Optimization

Add code
Jun 05, 2025
Viaarxiv icon

VTBench: Comprehensive Benchmark Suite Towards Real-World Virtual Try-on Models

Add code
May 26, 2025
Viaarxiv icon

No Free Lunch: Non-Asymptotic Analysis of Prediction-Powered Inference

Add code
May 26, 2025
Viaarxiv icon

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon

LENS: Multi-level Evaluation of Multimodal Reasoning with Large Language Models

Add code
May 21, 2025
Viaarxiv icon

SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models

Add code
May 19, 2025
Viaarxiv icon

LONGER: Scaling Up Long Sequence Modeling in Industrial Recommenders

Add code
May 07, 2025
Viaarxiv icon