Picture for Yu Cheng

Yu Cheng

Are Unified Vision-Language Models Necessary: Generalization Across Understanding and Generation

Add code
May 29, 2025
Viaarxiv icon

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Add code
May 28, 2025
Figure 1 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Figure 2 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Figure 3 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Figure 4 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Viaarxiv icon

Structured Memory Mechanisms for Stable Context Representation in Large Language Models

Add code
May 28, 2025
Figure 1 for Structured Memory Mechanisms for Stable Context Representation in Large Language Models
Figure 2 for Structured Memory Mechanisms for Stable Context Representation in Large Language Models
Figure 3 for Structured Memory Mechanisms for Stable Context Representation in Large Language Models
Figure 4 for Structured Memory Mechanisms for Stable Context Representation in Large Language Models
Viaarxiv icon

FGS-Audio: Fixed-Decoder Framework for Audio Steganography with Adversarial Perturbation Generation

Add code
May 28, 2025
Figure 1 for FGS-Audio: Fixed-Decoder Framework for Audio Steganography with Adversarial Perturbation Generation
Figure 2 for FGS-Audio: Fixed-Decoder Framework for Audio Steganography with Adversarial Perturbation Generation
Figure 3 for FGS-Audio: Fixed-Decoder Framework for Audio Steganography with Adversarial Perturbation Generation
Figure 4 for FGS-Audio: Fixed-Decoder Framework for Audio Steganography with Adversarial Perturbation Generation
Viaarxiv icon

FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow

Add code
May 26, 2025
Viaarxiv icon

SuperAD: A Training-free Anomaly Classification and Segmentation Method for CVPR 2025 VAND 3.0 Workshop Challenge Track 1: Adapt & Detect

Add code
May 26, 2025
Viaarxiv icon

Unveiling the Compositional Ability Gap in Vision-Language Reasoning Model

Add code
May 26, 2025
Viaarxiv icon

Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

SATORI-R1: Incentivizing Multimodal Reasoning with Spatial Grounding and Verifiable Rewards

Add code
May 25, 2025
Viaarxiv icon

Step-level Reward for Free in RL-based T2I Diffusion Model Fine-tuning

Add code
May 25, 2025
Viaarxiv icon