Picture for Yu Cheng

Yu Cheng

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Add code
Jun 04, 2025
Viaarxiv icon

VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models

Add code
May 29, 2025
Viaarxiv icon

Are Unified Vision-Language Models Necessary: Generalization Across Understanding and Generation

Add code
May 29, 2025
Viaarxiv icon

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Add code
May 28, 2025
Viaarxiv icon

Structured Memory Mechanisms for Stable Context Representation in Large Language Models

Add code
May 28, 2025
Viaarxiv icon

FGS-Audio: Fixed-Decoder Framework for Audio Steganography with Adversarial Perturbation Generation

Add code
May 28, 2025
Viaarxiv icon

Unveiling the Compositional Ability Gap in Vision-Language Reasoning Model

Add code
May 26, 2025
Viaarxiv icon

Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow

Add code
May 26, 2025
Viaarxiv icon

SuperAD: A Training-free Anomaly Classification and Segmentation Method for CVPR 2025 VAND 3.0 Workshop Challenge Track 1: Adapt & Detect

Add code
May 26, 2025
Viaarxiv icon