Picture for Lifu Huang

Lifu Huang

UC Davis

Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models

Add code
Jun 15, 2026
Viaarxiv icon

Proxy Reward Internalization and Mechanistic Exploitation: A Learned Precursor to Reward Hacking and Its Generalization

Add code
Jun 08, 2026
Viaarxiv icon

Learning Geometric Representations from Videos for Spatial Intelligent Multimodal Large Language Models

Add code
Jun 04, 2026
Viaarxiv icon

Unleashing Implicit Rewards: Prefix-Value Learning for Distribution-Level Optimization

Add code
Apr 14, 2026
Viaarxiv icon

Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding

Add code
Apr 02, 2026
Viaarxiv icon

Incentivizing Temporal-Awareness in Egocentric Video Understanding Models

Add code
Mar 28, 2026
Viaarxiv icon

IR$^3$: Contrastive Inverse Reinforcement Learning for Interpretable Detection and Mitigation of Reward Hacking

Add code
Feb 23, 2026
Viaarxiv icon

StagePilot: A Deep Reinforcement Learning Agent for Stage-Controlled Cybergrooming Simulation

Add code
Feb 04, 2026
Viaarxiv icon

Adversarial Reward Auditing for Active Detection and Mitigation of Reward Hacking

Add code
Feb 02, 2026
Viaarxiv icon

TokenSeek: Memory Efficient Fine Tuning via Instance-Aware Token Ditching

Add code
Jan 27, 2026
Viaarxiv icon