Picture for Richang Hong

Richang Hong

Learning Speaker-Invariant Visual Features for Lipreading

Add code
Jun 09, 2025
Viaarxiv icon

DragNeXt: Rethinking Drag-Based Image Editing

Add code
Jun 09, 2025
Viaarxiv icon

Rebalancing Contrastive Alignment with Learnable Semantic Gaps in Text-Video Retrieval

Add code
May 18, 2025
Viaarxiv icon

VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection

Add code
May 05, 2025
Viaarxiv icon

Invariance Matters: Empowering Social Recommendation via Graph Invariant Learning

Add code
Apr 14, 2025
Viaarxiv icon

A Short Survey on Small Reasoning Models: Training, Inference, Applications and Research Directions

Add code
Apr 12, 2025
Viaarxiv icon

Video Flow as Time Series: Discovering Temporal Consistency and Variability for VideoQA

Add code
Apr 08, 2025
Viaarxiv icon

Domain Generalization for Face Anti-spoofing via Content-aware Composite Prompt Engineering

Add code
Apr 06, 2025
Viaarxiv icon

Generalized Kullback-Leibler Divergence Loss

Add code
Mar 11, 2025
Viaarxiv icon

EgoBlind: Towards Egocentric Visual Assistance for the Blind People

Add code
Mar 11, 2025
Viaarxiv icon