Picture for Yukun Yan

Yukun Yan

Legal$Δ$: Enhancing Legal Reasoning in LLMs via Reinforcement Learning with Chain-of-Thought Guided Information Gain

Add code
Aug 17, 2025
Figure 1 for Legal$Δ$: Enhancing Legal Reasoning in LLMs via Reinforcement Learning with Chain-of-Thought Guided Information Gain
Figure 2 for Legal$Δ$: Enhancing Legal Reasoning in LLMs via Reinforcement Learning with Chain-of-Thought Guided Information Gain
Figure 3 for Legal$Δ$: Enhancing Legal Reasoning in LLMs via Reinforcement Learning with Chain-of-Thought Guided Information Gain
Figure 4 for Legal$Δ$: Enhancing Legal Reasoning in LLMs via Reinforcement Learning with Chain-of-Thought Guided Information Gain
Viaarxiv icon

ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization

Add code
Jun 12, 2025
Figure 1 for ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization
Figure 2 for ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization
Figure 3 for ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization
Figure 4 for ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization
Viaarxiv icon

KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs

Add code
Jun 11, 2025
Figure 1 for KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs
Figure 2 for KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs
Figure 3 for KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs
Figure 4 for KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs
Viaarxiv icon

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Figure 1 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 2 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 3 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 4 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Viaarxiv icon

ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented Generation

Add code
May 30, 2025
Viaarxiv icon

ConsRec: Denoising Sequential Recommendation through User-Consistent Preference Modeling

Add code
May 28, 2025
Viaarxiv icon

Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning

Add code
May 28, 2025
Figure 1 for Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning
Figure 2 for Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning
Figure 3 for Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning
Figure 4 for Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning
Viaarxiv icon

EULER: Enhancing the Reasoning Ability of Large Language Models through Error-Induced Learning

Add code
May 28, 2025
Viaarxiv icon

Why Stop at One Error? Benchmarking LLMs as Data Science Code Debuggers for Multi-Hop and Multi-Bug Errors

Add code
Mar 28, 2025
Viaarxiv icon

Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language Models

Add code
Feb 26, 2025
Figure 1 for Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language Models
Figure 2 for Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language Models
Figure 3 for Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language Models
Figure 4 for Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language Models
Viaarxiv icon