Picture for Ming Lu

Ming Lu

Uni-Synergy: Bridging Understanding and Generation for Personalized Reasoning via Co-operative Reinforcement Learning

Add code
May 11, 2026
Viaarxiv icon

VEGA: Visual Encoder Grounding Alignment for Spatially-Aware Vision-Language-Action Models

Add code
May 11, 2026
Viaarxiv icon

DiT-IC: Aligned Diffusion Transformer for Efficient Image Compression

Add code
Mar 13, 2026
Viaarxiv icon

AD-MIR: Bridging the Gap from Perception to Persuasion in Advertising Video Understanding via Structured Reasoning

Add code
Feb 07, 2026
Viaarxiv icon

M2A: Multimodal Memory Agent with Dual-Layer Hybrid Memory for Long-Term Personalized Interactions

Add code
Feb 07, 2026
Viaarxiv icon

MASC: Metal-Aware Sampling and Correction via Reinforcement Learning for Accelerated MRI

Add code
Jan 30, 2026
Viaarxiv icon

Reinforced Rate Control for Neural Video Compression via Inter-Frame Rate-Distortion Awareness

Add code
Jan 27, 2026
Viaarxiv icon

ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation

Add code
Jan 07, 2026
Viaarxiv icon

ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking

Add code
Jan 04, 2026
Viaarxiv icon

YODA: Yet Another One-step Diffusion-based Video Compressor

Add code
Jan 03, 2026
Viaarxiv icon