Picture for Yan Li

Yan Li

University of Minnesota

AcademiClaw: When Students Set Challenges for AI Agents

Add code
May 04, 2026
Viaarxiv icon

A General Representation-Based Approach to Multi-Source Domain Adaptation

Add code
Apr 26, 2026
Viaarxiv icon

ST-$π$: Structured SpatioTemporal VLA for Robotic Manipulation

Add code
Apr 20, 2026
Viaarxiv icon

MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation

Add code
Apr 16, 2026
Viaarxiv icon

Semantic-Geometric Dual Compression: Training-Free Visual Token Reduction for Ultra-High-Resolution Remote Sensing Understanding

Add code
Apr 13, 2026
Viaarxiv icon

BizGenEval: A Systematic Benchmark for Commercial Visual Content Generation

Add code
Mar 26, 2026
Viaarxiv icon

AlignMamba-2: Enhancing Multimodal Fusion and Sentiment Analysis with Modality-Aware Mamba

Add code
Mar 19, 2026
Viaarxiv icon

Meta-TTRL: A Metacognitive Framework for Self-Improving Test-Time Reinforcement Learning in Unified Multimodal Models

Add code
Mar 16, 2026
Viaarxiv icon

EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation

Add code
Mar 12, 2026
Viaarxiv icon

EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection

Add code
Mar 05, 2026
Viaarxiv icon