Picture for Yuchen Fan

Yuchen Fan

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Add code
Sep 11, 2025
Viaarxiv icon

A Survey of Reinforcement Learning for Large Reasoning Models

Add code
Sep 10, 2025
Viaarxiv icon

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Add code
May 28, 2025
Viaarxiv icon

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Add code
Mar 14, 2025
Figure 1 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Figure 2 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Figure 3 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Figure 4 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Viaarxiv icon

Process Reinforcement through Implicit Rewards

Add code
Feb 03, 2025
Viaarxiv icon

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Add code
Dec 23, 2024
Viaarxiv icon

Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models

Add code
Dec 17, 2024
Figure 1 for Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models
Figure 2 for Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models
Figure 3 for Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models
Figure 4 for Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models
Viaarxiv icon

3D Mesh Editing using Masked LRMs

Add code
Dec 11, 2024
Viaarxiv icon

Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds

Add code
Dec 10, 2024
Figure 1 for Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds
Figure 2 for Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds
Figure 3 for Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds
Figure 4 for Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds
Viaarxiv icon

BoRA: Bi-dimensional Weight-Decomposed Low-Rank Adaptation

Add code
Dec 09, 2024
Viaarxiv icon