Picture for Xiaoman Pan

Xiaoman Pan

SynPlanResearch-R1: Encouraging Tool Exploration for Deep Research with Synthetic Plans

Add code
Mar 09, 2026
Viaarxiv icon

Stabilizing Reinforcement Learning for Honesty Alignment in Language Models on Deductive Reasoning

Add code
Nov 12, 2025
Viaarxiv icon

PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time

Add code
Jun 06, 2025
Viaarxiv icon

SePPO: Semi-Policy Preference Optimization for Diffusion Alignment

Add code
Oct 07, 2024
Figure 1 for SePPO: Semi-Policy Preference Optimization for Diffusion Alignment
Figure 2 for SePPO: Semi-Policy Preference Optimization for Diffusion Alignment
Figure 3 for SePPO: Semi-Policy Preference Optimization for Diffusion Alignment
Figure 4 for SePPO: Semi-Policy Preference Optimization for Diffusion Alignment
Viaarxiv icon

DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects

Add code
Oct 03, 2024
Figure 1 for DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Figure 2 for DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Figure 3 for DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Figure 4 for DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Viaarxiv icon

Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots

Add code
Sep 16, 2024
Figure 1 for Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Figure 2 for Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Figure 3 for Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Figure 4 for Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Viaarxiv icon

Abstraction-of-Thought Makes Language Models Better Reasoners

Add code
Jun 18, 2024
Figure 1 for Abstraction-of-Thought Makes Language Models Better Reasoners
Figure 2 for Abstraction-of-Thought Makes Language Models Better Reasoners
Figure 3 for Abstraction-of-Thought Makes Language Models Better Reasoners
Figure 4 for Abstraction-of-Thought Makes Language Models Better Reasoners
Viaarxiv icon

Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models

Add code
Feb 27, 2024
Viaarxiv icon

Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment

Add code
Feb 25, 2024
Viaarxiv icon

Zebra: Extending Context Window with Layerwise Grouped Local-Global Attention

Add code
Dec 14, 2023
Figure 1 for Zebra: Extending Context Window with Layerwise Grouped Local-Global Attention
Figure 2 for Zebra: Extending Context Window with Layerwise Grouped Local-Global Attention
Figure 3 for Zebra: Extending Context Window with Layerwise Grouped Local-Global Attention
Figure 4 for Zebra: Extending Context Window with Layerwise Grouped Local-Global Attention
Viaarxiv icon