Picture for Ming-Ming Cheng

Ming-Ming Cheng

Nankai University

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

Add code
Apr 28, 2026
Viaarxiv icon

Amped: Adaptive Multi-stage Non-edge Pruning for Edge Detection

Add code
Mar 29, 2026
Viaarxiv icon

Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation

Add code
Mar 28, 2026
Viaarxiv icon

Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought

Add code
Mar 24, 2026
Viaarxiv icon

Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models

Add code
Mar 21, 2026
Viaarxiv icon

Mixture of Style Experts for Diverse Image Stylization

Add code
Mar 17, 2026
Viaarxiv icon

Unifying Heterogeneous Multi-Modal Remote Sensing Detection Via Language-Pivoted Pretraining

Add code
Mar 02, 2026
Viaarxiv icon

Test-Time Computing for Referring Multimodal Large Language Models

Add code
Feb 23, 2026
Viaarxiv icon

Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions

Add code
Feb 13, 2026
Viaarxiv icon

FlowConsist: Make Your Flow Consistent with Real Trajectory

Add code
Feb 06, 2026
Viaarxiv icon