Picture for Jiayu Liu

Jiayu Liu

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

Add code
Jan 16, 2026
Viaarxiv icon

Step-DeepResearch Technical Report

Add code
Dec 24, 2025
Viaarxiv icon

Pruning Long Chain-of-Thought of Large Reasoning Models via Small-Scale Preference Optimization

Add code
Aug 13, 2025
Figure 1 for Pruning Long Chain-of-Thought of Large Reasoning Models via Small-Scale Preference Optimization
Figure 2 for Pruning Long Chain-of-Thought of Large Reasoning Models via Small-Scale Preference Optimization
Figure 3 for Pruning Long Chain-of-Thought of Large Reasoning Models via Small-Scale Preference Optimization
Figure 4 for Pruning Long Chain-of-Thought of Large Reasoning Models via Small-Scale Preference Optimization
Viaarxiv icon

Prospect Theory Fails for LLMs: Revealing Instability of Decision-Making under Epistemic Uncertainty

Add code
Aug 12, 2025
Figure 1 for Prospect Theory Fails for LLMs: Revealing Instability of Decision-Making under Epistemic Uncertainty
Figure 2 for Prospect Theory Fails for LLMs: Revealing Instability of Decision-Making under Epistemic Uncertainty
Figure 3 for Prospect Theory Fails for LLMs: Revealing Instability of Decision-Making under Epistemic Uncertainty
Figure 4 for Prospect Theory Fails for LLMs: Revealing Instability of Decision-Making under Epistemic Uncertainty
Viaarxiv icon

Diversity-Enhanced Reasoning for Subjective Questions

Add code
Jul 27, 2025
Figure 1 for Diversity-Enhanced Reasoning for Subjective Questions
Figure 2 for Diversity-Enhanced Reasoning for Subjective Questions
Figure 3 for Diversity-Enhanced Reasoning for Subjective Questions
Figure 4 for Diversity-Enhanced Reasoning for Subjective Questions
Viaarxiv icon

CogMath: Assessing LLMs' Authentic Mathematical Ability from a Human Cognitive Perspective

Add code
Jun 04, 2025
Viaarxiv icon

Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Models' Uncertainty?

Add code
May 30, 2025
Figure 1 for Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Models' Uncertainty?
Figure 2 for Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Models' Uncertainty?
Figure 3 for Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Models' Uncertainty?
Figure 4 for Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Models' Uncertainty?
Viaarxiv icon

What Makes In-context Learning Effective for Mathematical Reasoning: A Theoretical Analysis

Add code
Dec 11, 2024
Figure 1 for What Makes In-context Learning Effective for Mathematical Reasoning: A Theoretical Analysis
Figure 2 for What Makes In-context Learning Effective for Mathematical Reasoning: A Theoretical Analysis
Figure 3 for What Makes In-context Learning Effective for Mathematical Reasoning: A Theoretical Analysis
Figure 4 for What Makes In-context Learning Effective for Mathematical Reasoning: A Theoretical Analysis
Viaarxiv icon

FedDW: Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning

Add code
Dec 05, 2024
Figure 1 for FedDW: Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning
Figure 2 for FedDW: Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning
Figure 3 for FedDW: Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning
Figure 4 for FedDW: Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning
Viaarxiv icon

Locating the Leading Edge of Cultural Change

Add code
Nov 22, 2024
Viaarxiv icon