Picture for Yelong Shen

Yelong Shen

Training Matryoshka Mixture-of-Experts for Elastic Inference-Time Expert Utilization

Add code
Sep 30, 2025
Viaarxiv icon

SAS: Simulated Attention Score

Add code
Jul 10, 2025
Viaarxiv icon

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Add code
Jul 09, 2025
Figure 1 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 2 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 3 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 4 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Viaarxiv icon

PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning

Add code
Jun 17, 2025
Viaarxiv icon

SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning

Add code
Jun 10, 2025
Viaarxiv icon

SoK: Are Watermarks in LLMs Ready for Deployment?

Add code
Jun 05, 2025
Figure 1 for SoK: Are Watermarks in LLMs Ready for Deployment?
Figure 2 for SoK: Are Watermarks in LLMs Ready for Deployment?
Figure 3 for SoK: Are Watermarks in LLMs Ready for Deployment?
Figure 4 for SoK: Are Watermarks in LLMs Ready for Deployment?
Viaarxiv icon

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

Add code
May 21, 2025
Viaarxiv icon

R&D-Agent: Automating Data-Driven AI Solution Building Through LLM-Powered Automated Research, Development, and Evolution

Add code
May 20, 2025
Viaarxiv icon

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Add code
Apr 30, 2025
Figure 1 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 2 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 3 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 4 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Viaarxiv icon

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Add code
Apr 29, 2025
Figure 1 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Figure 2 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Figure 3 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Figure 4 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Viaarxiv icon