Picture for Xianpei Han

Xianpei Han

RMTBench: Benchmarking LLMs Through Multi-Turn User-Centric Role-Playing

Add code
Jul 27, 2025
Viaarxiv icon

ARise: Towards Knowledge-Augmented Reasoning via Risk-Adaptive Search

Add code
Apr 15, 2025
Viaarxiv icon

ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers

Add code
Apr 01, 2025
Viaarxiv icon

Memorizing is Not Enough: Deep Knowledge Injection Through Reasoning

Add code
Apr 01, 2025
Figure 1 for Memorizing is Not Enough: Deep Knowledge Injection Through Reasoning
Figure 2 for Memorizing is Not Enough: Deep Knowledge Injection Through Reasoning
Figure 3 for Memorizing is Not Enough: Deep Knowledge Injection Through Reasoning
Figure 4 for Memorizing is Not Enough: Deep Knowledge Injection Through Reasoning
Viaarxiv icon

Large Language Models Often Say One Thing and Do Another

Add code
Mar 10, 2025
Figure 1 for Large Language Models Often Say One Thing and Do Another
Figure 2 for Large Language Models Often Say One Thing and Do Another
Figure 3 for Large Language Models Often Say One Thing and Do Another
Figure 4 for Large Language Models Often Say One Thing and Do Another
Viaarxiv icon

The Devil Is in the Details: Tackling Unimodal Spurious Correlations for Generalizable Multimodal Reward Models

Add code
Mar 05, 2025
Viaarxiv icon

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking

Add code
Feb 28, 2025
Viaarxiv icon

Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch

Add code
Feb 24, 2025
Figure 1 for Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Figure 2 for Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Figure 3 for Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Figure 4 for Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Viaarxiv icon

SAISA: Towards Multimodal Large Language Models with Both Training and Inference Efficiency

Add code
Feb 04, 2025
Viaarxiv icon

DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Add code
Feb 03, 2025
Figure 1 for DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Figure 2 for DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Figure 3 for DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Figure 4 for DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Viaarxiv icon