Picture for Xingtai Lv

Xingtai Lv

FlowRL: Matching Reward Distributions for LLM Reasoning

Add code
Sep 18, 2025
Viaarxiv icon

A Survey of Reinforcement Learning for Large Reasoning Models

Add code
Sep 10, 2025
Viaarxiv icon

Towards a Unified View of Large Language Model Post-Training

Add code
Sep 04, 2025
Figure 1 for Towards a Unified View of Large Language Model Post-Training
Figure 2 for Towards a Unified View of Large Language Model Post-Training
Figure 3 for Towards a Unified View of Large Language Model Post-Training
Figure 4 for Towards a Unified View of Large Language Model Post-Training
Viaarxiv icon

Automating Exploratory Multiomics Research via Language Models

Add code
Jun 09, 2025
Figure 1 for Automating Exploratory Multiomics Research via Language Models
Figure 2 for Automating Exploratory Multiomics Research via Language Models
Figure 3 for Automating Exploratory Multiomics Research via Language Models
Figure 4 for Automating Exploratory Multiomics Research via Language Models
Viaarxiv icon

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Add code
Mar 14, 2025
Figure 1 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Figure 2 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Figure 3 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Figure 4 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Viaarxiv icon

Process Reinforcement through Implicit Rewards

Add code
Feb 03, 2025
Figure 1 for Process Reinforcement through Implicit Rewards
Figure 2 for Process Reinforcement through Implicit Rewards
Figure 3 for Process Reinforcement through Implicit Rewards
Figure 4 for Process Reinforcement through Implicit Rewards
Viaarxiv icon

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Add code
Dec 23, 2024
Viaarxiv icon

How to Synthesize Text Data without Model Collapse?

Add code
Dec 19, 2024
Figure 1 for How to Synthesize Text Data without Model Collapse?
Figure 2 for How to Synthesize Text Data without Model Collapse?
Figure 3 for How to Synthesize Text Data without Model Collapse?
Figure 4 for How to Synthesize Text Data without Model Collapse?
Viaarxiv icon

Automating Exploratory Proteomics Research via Language Models

Add code
Nov 06, 2024
Figure 1 for Automating Exploratory Proteomics Research via Language Models
Figure 2 for Automating Exploratory Proteomics Research via Language Models
Figure 3 for Automating Exploratory Proteomics Research via Language Models
Figure 4 for Automating Exploratory Proteomics Research via Language Models
Viaarxiv icon

Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention

Add code
Nov 04, 2024
Figure 1 for Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
Figure 2 for Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
Figure 3 for Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
Figure 4 for Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
Viaarxiv icon