Picture for Yu Cheng

Yu Cheng

Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism

Add code
Oct 30, 2025
Viaarxiv icon

Native Hybrid Attention for Efficient Sequence Modeling

Add code
Oct 08, 2025
Figure 1 for Native Hybrid Attention for Efficient Sequence Modeling
Figure 2 for Native Hybrid Attention for Efficient Sequence Modeling
Figure 3 for Native Hybrid Attention for Efficient Sequence Modeling
Figure 4 for Native Hybrid Attention for Efficient Sequence Modeling
Viaarxiv icon

ExGRPO: Learning to Reason from Experience

Add code
Oct 02, 2025
Figure 1 for ExGRPO: Learning to Reason from Experience
Figure 2 for ExGRPO: Learning to Reason from Experience
Figure 3 for ExGRPO: Learning to Reason from Experience
Figure 4 for ExGRPO: Learning to Reason from Experience
Viaarxiv icon

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Add code
Sep 18, 2025
Viaarxiv icon

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Add code
Sep 10, 2025
Figure 1 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 2 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 3 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 4 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Viaarxiv icon

Interleaving Reasoning for Better Text-to-Image Generation

Add code
Sep 09, 2025
Figure 1 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 2 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 3 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 4 for Interleaving Reasoning for Better Text-to-Image Generation
Viaarxiv icon

Synthesizing Sheet Music Problems for Evaluation and Reinforcement Learning

Add code
Sep 04, 2025
Viaarxiv icon

Improved Personalized Headline Generation via Denoising Fake Interests from Implicit Feedback

Add code
Aug 10, 2025
Viaarxiv icon

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Add code
Jul 24, 2025
Figure 1 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 2 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 3 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 4 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Viaarxiv icon

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Add code
Jun 10, 2025
Figure 1 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 2 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 3 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 4 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Viaarxiv icon