Picture for Fang Li

Fang Li

University of Texas at Dallas

EyeMVP: OCT-Informed Fundus Representation Learning via Paired CFP--OCT Pretraining

Add code
Jun 13, 2026
Viaarxiv icon

DriveReward: A Comprehensive Dataset and Generative Vision-Language Reward Model for Autonomous Driving

Add code
Jun 07, 2026
Viaarxiv icon

LiWi: Layering in the Wild

Add code
May 14, 2026
Viaarxiv icon

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Add code
Apr 20, 2026
Viaarxiv icon

Think before Go: Hierarchical Reasoning for Image-goal Navigation

Add code
Apr 19, 2026
Viaarxiv icon

DLink: Distilling Layer-wise and Dominant Knowledge from EEG Foundation Models

Add code
Apr 16, 2026
Viaarxiv icon

Bringing Clustering to MLL: Weakly-Supervised Clustering for Partial Multi-Label Learning

Add code
Apr 10, 2026
Viaarxiv icon

DVGT-2: Vision-Geometry-Action Model for Autonomous Driving at Scale

Add code
Apr 01, 2026
Viaarxiv icon

NS-VLA: Towards Neuro-Symbolic Vision-Language-Action Models

Add code
Mar 10, 2026
Viaarxiv icon

LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving

Add code
Mar 02, 2026
Viaarxiv icon