Picture for Xiaonan Li

Xiaonan Li

MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval

Add code
Oct 31, 2025
Viaarxiv icon

Towards Global Retrieval Augmented Generation: A Benchmark for Corpus-Level Reasoning

Add code
Oct 30, 2025
Viaarxiv icon

R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning

Add code
May 26, 2025
Figure 1 for R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
Figure 2 for R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
Figure 3 for R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
Figure 4 for R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
Viaarxiv icon

Understanding the Role of LLMs in Multimodal Evaluation Benchmarks

Add code
Oct 16, 2024
Figure 1 for Understanding the Role of LLMs in Multimodal Evaluation Benchmarks
Figure 2 for Understanding the Role of LLMs in Multimodal Evaluation Benchmarks
Figure 3 for Understanding the Role of LLMs in Multimodal Evaluation Benchmarks
Figure 4 for Understanding the Role of LLMs in Multimodal Evaluation Benchmarks
Viaarxiv icon

Case2Code: Learning Inductive Reasoning with Synthetic Data

Add code
Jul 17, 2024
Figure 1 for Case2Code: Learning Inductive Reasoning with Synthetic Data
Figure 2 for Case2Code: Learning Inductive Reasoning with Synthetic Data
Figure 3 for Case2Code: Learning Inductive Reasoning with Synthetic Data
Figure 4 for Case2Code: Learning Inductive Reasoning with Synthetic Data
Viaarxiv icon

Scaling Laws for Fact Memorization of Large Language Models

Add code
Jun 22, 2024
Viaarxiv icon

Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation

Add code
Jun 20, 2024
Figure 1 for Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
Figure 2 for Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
Figure 3 for Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
Figure 4 for Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
Viaarxiv icon

Unified Active Retrieval for Retrieval Augmented Generation

Add code
Jun 18, 2024
Viaarxiv icon

Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models

Add code
May 21, 2024
Figure 1 for Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models
Figure 2 for Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models
Figure 3 for Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models
Figure 4 for Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models
Viaarxiv icon

LLatrieval: LLM-Verified Retrieval for Verifiable Generation

Add code
Nov 14, 2023
Figure 1 for LLatrieval: LLM-Verified Retrieval for Verifiable Generation
Figure 2 for LLatrieval: LLM-Verified Retrieval for Verifiable Generation
Figure 3 for LLatrieval: LLM-Verified Retrieval for Verifiable Generation
Figure 4 for LLatrieval: LLM-Verified Retrieval for Verifiable Generation
Viaarxiv icon