Picture for Wenhao Li

Wenhao Li

Action-Aware Generative Sequence Modeling for Short Video Recommendation

Add code
Apr 28, 2026
Viaarxiv icon

MINT-Bench: A Comprehensive Multilingual Benchmark for Instruction-Following Text-to-Speech

Add code
Apr 20, 2026
Viaarxiv icon

STS-Mixer: Spatio-Temporal-Spectral Mixer for 4D Point Cloud Video Understanding

Add code
Apr 13, 2026
Viaarxiv icon

Visual Prototype Conditioned Focal Region Generation for UAV-Based Object Detection

Add code
Apr 03, 2026
Viaarxiv icon

AscendOptimizer: Episodic Agent for Ascend NPU Operator Optimization

Add code
Mar 24, 2026
Viaarxiv icon

Large Neighborhood Search meets Iterative Neural Constraint Heuristics

Add code
Mar 21, 2026
Viaarxiv icon

OmniCodec: Low Frame Rate Universal Audio Codec with Semantic-Acoustic Disentanglement

Add code
Mar 21, 2026
Viaarxiv icon

MoRL: Reinforced Reasoning for Unified Motion Understanding and Generation

Add code
Feb 16, 2026
Viaarxiv icon

It Takes Two to Tango: A Holistic Simulator for Joint Order Scheduling and Multi-Agent Path Finding in Robotic Warehouses

Add code
Feb 15, 2026
Viaarxiv icon

See, Plan, Snap: Evaluating Multimodal GUI Agents in Scratch

Add code
Feb 11, 2026
Viaarxiv icon