Picture for Yi Li

Yi Li

Victor

Conditional Multi-Event Temporal Grounding in Long-Form Video

Add code
Jun 13, 2026
Viaarxiv icon

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

Add code
Jun 08, 2026
Viaarxiv icon

Agents' Last Exam

Add code
Jun 03, 2026
Viaarxiv icon

CausalPOI: Spatio-Temporal Graph-Based Causal Modeling for Cold-Start POI Check-in Forecasting

Add code
Jun 03, 2026
Viaarxiv icon

An Attribute-Based Measure of Video Complexity

Add code
May 30, 2026
Viaarxiv icon

DarkForest: Less Talk, Higher Accuracy for Multi-Agent LLMs

Add code
May 24, 2026
Viaarxiv icon

ProtoMedAgent: Multimodal Clinical Interpretability via Privacy-Aware Agentic Workflows

Add code
May 13, 2026
Viaarxiv icon

HAGE: Harnessing Agentic Memory via RL-Driven Weighted Graph Evolution

Add code
May 11, 2026
Viaarxiv icon

BGG: Bridging the Geometric Gap between Cross-View images by Vision Foundation Model Adaptation for Geo-Localization

Add code
May 11, 2026
Viaarxiv icon

LEAD: Length-Efficient Adaptive and Dynamic Reasoning for Large Language Models

Add code
May 10, 2026
Viaarxiv icon