Picture for Xuan Qi

Xuan Qi

DataProphet: Demystifying Supervision Data Generalization in Multimodal LLMs

Add code
Mar 20, 2026
Viaarxiv icon

Difficulty-Based Preference Data Selection by DPO Implicit Reward Gap

Add code
Aug 06, 2025
Figure 1 for Difficulty-Based Preference Data Selection by DPO Implicit Reward Gap
Figure 2 for Difficulty-Based Preference Data Selection by DPO Implicit Reward Gap
Figure 3 for Difficulty-Based Preference Data Selection by DPO Implicit Reward Gap
Figure 4 for Difficulty-Based Preference Data Selection by DPO Implicit Reward Gap
Viaarxiv icon

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Add code
Jul 28, 2025
Figure 1 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 2 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 3 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 4 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Viaarxiv icon

AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes

Add code
Jun 17, 2025
Figure 1 for AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
Figure 2 for AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
Figure 3 for AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
Figure 4 for AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
Viaarxiv icon

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Add code
May 26, 2025
Figure 1 for Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution
Figure 2 for Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution
Figure 3 for Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution
Figure 4 for Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution
Viaarxiv icon

On Path to Multimodal Historical Reasoning: HistBench and HistAgent

Add code
May 26, 2025
Viaarxiv icon

Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?

Add code
May 21, 2025
Figure 1 for Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?
Figure 2 for Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?
Figure 3 for Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?
Figure 4 for Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?
Viaarxiv icon

A Systematic Survey of Automatic Prompt Optimization Techniques

Add code
Feb 24, 2025
Viaarxiv icon

DebateQA: Evaluating Question Answering on Debatable Knowledge

Add code
Aug 02, 2024
Figure 1 for DebateQA: Evaluating Question Answering on Debatable Knowledge
Figure 2 for DebateQA: Evaluating Question Answering on Debatable Knowledge
Figure 3 for DebateQA: Evaluating Question Answering on Debatable Knowledge
Figure 4 for DebateQA: Evaluating Question Answering on Debatable Knowledge
Viaarxiv icon

Truthful Dataset Valuation by Pointwise Mutual Information

Add code
May 28, 2024
Figure 1 for Truthful Dataset Valuation by Pointwise Mutual Information
Figure 2 for Truthful Dataset Valuation by Pointwise Mutual Information
Figure 3 for Truthful Dataset Valuation by Pointwise Mutual Information
Figure 4 for Truthful Dataset Valuation by Pointwise Mutual Information
Viaarxiv icon