Picture for Yuping Wang

Yuping Wang

SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems

Add code
Jun 09, 2025
Viaarxiv icon

Sounding that Object: Interactive Object-Aware Image to Audio Generation

Add code
Jun 04, 2025
Viaarxiv icon

Towards Reliable Large Audio Language Model

Add code
May 25, 2025
Viaarxiv icon

AudioMorphix: Training-free audio editing with diffusion probabilistic models

Add code
May 21, 2025
Viaarxiv icon

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Add code
May 19, 2025
Viaarxiv icon

Generative AI for Autonomous Driving: Frontiers and Opportunities

Add code
May 13, 2025
Viaarxiv icon

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Add code
Apr 11, 2025
Viaarxiv icon

UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving

Add code
Mar 31, 2025
Viaarxiv icon

Can Large Vision Language Models Read Maps Like a Human?

Add code
Mar 18, 2025
Viaarxiv icon

FwNet-ECA: Facilitating Window Attention with Global Receptive Fields through Fourier Filtering Operations

Add code
Feb 25, 2025
Viaarxiv icon