Picture for Xiaopeng Li

Xiaopeng Li

GeoRouter: Dynamic Paradigm Routing for Worldwide Image Geolocalization

Add code
Mar 25, 2026
Viaarxiv icon

High-Slip-Ratio Control for Peak Tire-Road Friction Estimation Using Automated Vehicles

Add code
Mar 10, 2026
Viaarxiv icon

To Search or Not to Search: Aligning the Decision Boundary of Deep Search Agents via Causal Intervention

Add code
Feb 03, 2026
Viaarxiv icon

Reward-free Alignment for Conflicting Objectives

Add code
Feb 02, 2026
Viaarxiv icon

Enhancing Conversational Agents via Task-Oriented Adversarial Memory Adaptation

Add code
Jan 29, 2026
Viaarxiv icon

Robustness and Resilience Evaluation of Eco-Driving Strategies at Signalized Intersections

Add code
Jan 19, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Exploring Recommender System Evaluation: A Multi-Modal User Agent Framework for A/B Testing

Add code
Jan 08, 2026
Viaarxiv icon

JPU: Bridging Jailbreak Defense and Unlearning via On-Policy Path Rectification

Add code
Jan 06, 2026
Viaarxiv icon

Exploration vs Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward

Add code
Dec 21, 2025
Figure 1 for Exploration vs Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward
Figure 2 for Exploration vs Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward
Figure 3 for Exploration vs Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward
Figure 4 for Exploration vs Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward
Viaarxiv icon