Picture for Junshan Zhang

Junshan Zhang

Sherman

SOLAR: Communication-Efficient Model Adaptation via Subspace-Oriented Latent Adapter Reparametrization

Add code
Apr 09, 2026
Viaarxiv icon

Personalized RewardBench: Evaluating Reward Models with Human Aligned Personalization

Add code
Apr 08, 2026
Viaarxiv icon

IR$^3$: Contrastive Inverse Reinforcement Learning for Interpretable Detection and Mitigation of Reward Hacking

Add code
Feb 23, 2026
Viaarxiv icon

Adversarial Reward Auditing for Active Detection and Mitigation of Reward Hacking

Add code
Feb 02, 2026
Viaarxiv icon

VITA: Vision-to-Action Flow Matching Policy

Add code
Jul 17, 2025
Viaarxiv icon

Ego-centric Learning of Communicative World Models for Autonomous Driving

Add code
Jun 09, 2025
Figure 1 for Ego-centric Learning of Communicative World Models for Autonomous Driving
Figure 2 for Ego-centric Learning of Communicative World Models for Autonomous Driving
Figure 3 for Ego-centric Learning of Communicative World Models for Autonomous Driving
Figure 4 for Ego-centric Learning of Communicative World Models for Autonomous Driving
Viaarxiv icon

IN-RIL: Interleaved Reinforcement and Imitation Learning for Policy Fine-Tuning

Add code
May 15, 2025
Viaarxiv icon

AugFL: Augmenting Federated Learning with Pretrained Models

Add code
Mar 04, 2025
Viaarxiv icon

Heterogeneous Decision Making in Mixed Traffic: Uncertainty-aware Planning and Bounded Rationality

Add code
Feb 25, 2025
Viaarxiv icon

Towards Unraveling and Improving Generalization in World Models

Add code
Dec 31, 2024
Viaarxiv icon