Picture for Yan Li

Yan Li

University of Minnesota

LiWi: Layering in the Wild

Add code
May 14, 2026
Viaarxiv icon

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Add code
May 12, 2026
Viaarxiv icon

From Trajectories to Phenotypes: Disease Progression as Structural Priors for Multi-organ Imaging Representation Learning

Add code
May 12, 2026
Viaarxiv icon

Fashion130K: An E-commerce Fashion Dataset for Outfit Generation with Unified Multi-modal Condition

Add code
May 11, 2026
Viaarxiv icon

AcademiClaw: When Students Set Challenges for AI Agents

Add code
May 04, 2026
Viaarxiv icon

A General Representation-Based Approach to Multi-Source Domain Adaptation

Add code
Apr 26, 2026
Viaarxiv icon

ST-$π$: Structured SpatioTemporal VLA for Robotic Manipulation

Add code
Apr 20, 2026
Viaarxiv icon

MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation

Add code
Apr 16, 2026
Viaarxiv icon

Semantic-Geometric Dual Compression: Training-Free Visual Token Reduction for Ultra-High-Resolution Remote Sensing Understanding

Add code
Apr 13, 2026
Viaarxiv icon

BizGenEval: A Systematic Benchmark for Commercial Visual Content Generation

Add code
Mar 26, 2026
Viaarxiv icon