Picture for Zexi Jia

Zexi Jia

F2RVLM: Boosting Fine-grained Fragment Retrieval for Multi-Modal Long-form Dialogue with Vision Language Model

Add code
Aug 25, 2025
Viaarxiv icon

Control-CLIP: Decoupling Category and Style Guidance in CLIP for Specific-Domain Generation

Add code
Feb 17, 2025
Viaarxiv icon

Semantic to Structure: Learning Structural Representations for Infringement Detection

Add code
Feb 11, 2025
Viaarxiv icon