Picture for Jiaqi Li

Jiaqi Li

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

Add code
May 29, 2026
Viaarxiv icon

Xetrieval: Mechanistically Explaining Dense Retrieval

Add code
May 28, 2026
Viaarxiv icon

Less is More: Early Stopping Rollout for On-Policy Distillation

Add code
May 26, 2026
Viaarxiv icon

SimART: A Unified and Open Real-world Multimodal Simulation Platform for 6G Integrated Sensing and Communication

Add code
May 13, 2026
Viaarxiv icon

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Add code
May 12, 2026
Viaarxiv icon

Residual-loss Anomaly Analysis of Physics-Informed Neural Networks: An Inverse Method for Change-point Detection in Nonlinear Dynamical Systems with Regime Switching

Add code
Apr 28, 2026
Viaarxiv icon

Poster: ClawdGo: Endogenous Security Awareness Training for Autonomous AI Agents

Add code
Apr 27, 2026
Viaarxiv icon

AIT Academy: Cultivating the Complete Agent with a Confucian Three-Domain Curriculum

Add code
Apr 20, 2026
Viaarxiv icon

\$OneMillion-Bench: How Far are Language Agents from Human Experts?

Add code
Mar 09, 2026
Viaarxiv icon

AR2-4FV: Anchored Referring and Re-identification for Long-Term Grounding in Fixed-View Videos

Add code
Mar 08, 2026
Viaarxiv icon