Picture for Zi Huang

Zi Huang

Generative Recall, Dense Reranking: Learning Multi-View Semantic IDs for Efficient Text-to-Video Retrieval

Add code
Jan 29, 2026
Viaarxiv icon

Integrating Vision-Centric Text Understanding for Conversational Recommender Systems

Add code
Jan 20, 2026
Viaarxiv icon

Hierarchical Refinement of Universal Multimodal Attacks on Vision-Language Models

Add code
Jan 15, 2026
Viaarxiv icon

Distributed Zero-Shot Learning for Visual Recognition

Add code
Nov 11, 2025
Viaarxiv icon

ReaKase-8B: Legal Case Retrieval via Knowledge and Reasoning Representations with LLMs

Add code
Oct 30, 2025
Viaarxiv icon

Does Homophily Help in Robust Test-time Node Classification?

Add code
Oct 25, 2025
Viaarxiv icon

ContextNav: Towards Agentic Multimodal In-Context Learning

Add code
Oct 06, 2025
Figure 1 for ContextNav: Towards Agentic Multimodal In-Context Learning
Figure 2 for ContextNav: Towards Agentic Multimodal In-Context Learning
Figure 3 for ContextNav: Towards Agentic Multimodal In-Context Learning
Figure 4 for ContextNav: Towards Agentic Multimodal In-Context Learning
Viaarxiv icon

ALSA: Anchors in Logit Space for Out-of-Distribution Accuracy Estimation

Add code
Aug 27, 2025
Viaarxiv icon

Text Meets Topology: Rethinking Out-of-distribution Detection in Text-Rich Networks

Add code
Aug 25, 2025
Viaarxiv icon

UQLegalAI@COLIEE2025: Advancing Legal Case Retrieval with Large Language Models and Graph Neural Networks

Add code
May 27, 2025
Viaarxiv icon