Picture for Qixin Xu

Qixin Xu

Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism

Add code
May 29, 2026
Viaarxiv icon

Bad Seeing or Bad Thinking? Rewarding Perception for Vision-Language Reasoning

Add code
May 13, 2026
Viaarxiv icon

CogDoc: Towards Unified thinking in Documents

Add code
Dec 14, 2025
Viaarxiv icon

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Add code
Sep 03, 2025
Figure 1 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Figure 2 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Figure 3 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Figure 4 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Viaarxiv icon

Process Reinforcement through Implicit Rewards

Add code
Feb 03, 2025
Figure 1 for Process Reinforcement through Implicit Rewards
Figure 2 for Process Reinforcement through Implicit Rewards
Figure 3 for Process Reinforcement through Implicit Rewards
Figure 4 for Process Reinforcement through Implicit Rewards
Viaarxiv icon

Leveraging Frequent Query Substructures to Generate Formal Queries for Complex Question Answering

Add code
Aug 29, 2019
Figure 1 for Leveraging Frequent Query Substructures to Generate Formal Queries for Complex Question Answering
Figure 2 for Leveraging Frequent Query Substructures to Generate Formal Queries for Complex Question Answering
Figure 3 for Leveraging Frequent Query Substructures to Generate Formal Queries for Complex Question Answering
Figure 4 for Leveraging Frequent Query Substructures to Generate Formal Queries for Complex Question Answering
Viaarxiv icon