Picture for Renren Jin

Renren Jin

Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models

Add code
Apr 27, 2026
Viaarxiv icon

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Add code
Apr 14, 2026
Viaarxiv icon

SOUP: Token-level Single-sample Mix-policy Reinforcement Learning for Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

Revisiting Entropy in Reinforcement Learning for Large Reasoning Models

Add code
Nov 08, 2025
Viaarxiv icon

Joint Training And Decoding for Multilingual End-to-End Simultaneous Speech Translation

Add code
Mar 14, 2025
Viaarxiv icon

ProBench: Benchmarking Large Language Models in Competitive Programming

Add code
Feb 28, 2025
Viaarxiv icon

Large Language Model Safety: A Holistic Survey

Add code
Dec 23, 2024
Figure 1 for Large Language Model Safety: A Holistic Survey
Figure 2 for Large Language Model Safety: A Holistic Survey
Figure 3 for Large Language Model Safety: A Holistic Survey
Figure 4 for Large Language Model Safety: A Holistic Survey
Viaarxiv icon

Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning

Add code
Nov 21, 2024
Figure 1 for Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning
Figure 2 for Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning
Figure 3 for Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning
Figure 4 for Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning
Viaarxiv icon

Multilingual Large Language Models: A Systematic Survey

Add code
Nov 19, 2024
Viaarxiv icon

FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data

Add code
Aug 13, 2024
Figure 1 for FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data
Figure 2 for FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data
Figure 3 for FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data
Figure 4 for FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data
Viaarxiv icon