Picture for Lili Qiu

Lili Qiu

MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation

Add code
Apr 16, 2026
Viaarxiv icon

AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation

Add code
Apr 09, 2026
Viaarxiv icon

BizGenEval: A Systematic Benchmark for Commercial Visual Content Generation

Add code
Mar 26, 2026
Viaarxiv icon

SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

Add code
Mar 24, 2026
Viaarxiv icon

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Add code
Mar 03, 2026
Viaarxiv icon

MTraining: Distributed Dynamic Sparse Attention for Efficient Ultra-Long Context Training

Add code
Oct 21, 2025
Viaarxiv icon

Diffusion^2: Turning 3D Environments into Radio Frequency Heatmaps

Add code
Oct 02, 2025
Viaarxiv icon

VidGuard-R1: AI-Generated Video Detection and Explanation via Reasoning MLLMs and RL

Add code
Oct 02, 2025
Viaarxiv icon

$ΔL$ Normalization: Rethink Loss Aggregation in RLVR

Add code
Sep 09, 2025
Viaarxiv icon

MmBack: Clock-free Multi-Sensor Backscatter with Synchronous Acquisition and Multiplexing

Add code
Jul 02, 2025
Viaarxiv icon