Picture for Zihui Zhao

Zihui Zhao

Reflective Preference Optimization (RPO): Enhancing On-Policy Alignment via Hint-Guided Reflection

Add code
Dec 15, 2025
Viaarxiv icon

S2ML: Spatio-Spectral Mutual Learning for Depth Completion

Add code
Nov 08, 2025
Viaarxiv icon

Learning Free Token Reduction for Multi-Modal LLM

Add code
Jan 29, 2025
Figure 1 for Learning Free Token Reduction for Multi-Modal LLM
Figure 2 for Learning Free Token Reduction for Multi-Modal LLM
Figure 3 for Learning Free Token Reduction for Multi-Modal LLM
Figure 4 for Learning Free Token Reduction for Multi-Modal LLM
Viaarxiv icon