Picture for Qiang Hui

Qiang Hui

HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment

Add code
Nov 10, 2025
Viaarxiv icon