Picture for Shaoan Zhao

Shaoan Zhao

HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment

Add code
Nov 10, 2025
Viaarxiv icon