Picture for Linjing Li

Linjing Li

Uncertainty Unveiled: Can Exposure to More In-context Examples Mitigate Uncertainty for Large Language Models?

Add code
May 27, 2025
Viaarxiv icon

Unearthing Gems from Stones: Policy Optimization with Negative Sample Augmentation for LLM Reasoning

Add code
May 20, 2025
Viaarxiv icon

Beyond the First Error: Process Reward Models for Reflective Mathematical Reasoning

Add code
May 20, 2025
Viaarxiv icon

Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL

Add code
May 16, 2025
Viaarxiv icon

Learning Dynamics in Continual Pre-Training for Large Language Models

Add code
May 12, 2025
Viaarxiv icon

Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation

Add code
Mar 17, 2025
Viaarxiv icon

Learning Strategy Representation for Imitation Learning in Multi-Agent Games

Add code
Sep 28, 2024
Viaarxiv icon

Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons

Add code
Aug 06, 2024
Viaarxiv icon

ELA: Exploited Level Augmentation for Offline Learning in Zero-Sum Games

Add code
Feb 28, 2024
Viaarxiv icon

LDM$^2$: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement

Add code
Dec 13, 2023
Figure 1 for LDM$^2$: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement
Figure 2 for LDM$^2$: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement
Figure 3 for LDM$^2$: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement
Figure 4 for LDM$^2$: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement
Viaarxiv icon