Picture for Yu Cheng

Yu Cheng

VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models

Add code
May 29, 2025
Viaarxiv icon

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Add code
May 28, 2025
Figure 1 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Figure 2 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Figure 3 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Figure 4 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Viaarxiv icon

FGS-Audio: Fixed-Decoder Framework for Audio Steganography with Adversarial Perturbation Generation

Add code
May 28, 2025
Figure 1 for FGS-Audio: Fixed-Decoder Framework for Audio Steganography with Adversarial Perturbation Generation
Figure 2 for FGS-Audio: Fixed-Decoder Framework for Audio Steganography with Adversarial Perturbation Generation
Figure 3 for FGS-Audio: Fixed-Decoder Framework for Audio Steganography with Adversarial Perturbation Generation
Figure 4 for FGS-Audio: Fixed-Decoder Framework for Audio Steganography with Adversarial Perturbation Generation
Viaarxiv icon

Structured Memory Mechanisms for Stable Context Representation in Large Language Models

Add code
May 28, 2025
Figure 1 for Structured Memory Mechanisms for Stable Context Representation in Large Language Models
Figure 2 for Structured Memory Mechanisms for Stable Context Representation in Large Language Models
Figure 3 for Structured Memory Mechanisms for Stable Context Representation in Large Language Models
Figure 4 for Structured Memory Mechanisms for Stable Context Representation in Large Language Models
Viaarxiv icon

Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

SuperAD: A Training-free Anomaly Classification and Segmentation Method for CVPR 2025 VAND 3.0 Workshop Challenge Track 1: Adapt & Detect

Add code
May 26, 2025
Viaarxiv icon

FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow

Add code
May 26, 2025
Viaarxiv icon

Unveiling the Compositional Ability Gap in Vision-Language Reasoning Model

Add code
May 26, 2025
Viaarxiv icon

SATORI-R1: Incentivizing Multimodal Reasoning with Spatial Grounding and Verifiable Rewards

Add code
May 25, 2025
Viaarxiv icon

Step-level Reward for Free in RL-based T2I Diffusion Model Fine-tuning

Add code
May 25, 2025
Viaarxiv icon