Picture for Yong Li

Yong Li

Tsinghua University

Embodied-R: Collaborative Framework for Activating Embodied Spatial Reasoning in Foundation Models via Reinforcement Learning

Add code
Apr 17, 2025
Viaarxiv icon

A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science

Add code
Apr 14, 2025
Viaarxiv icon

GeoNav: Empowering MLLMs with Explicit Geospatial Reasoning Abilities for Language-Goal Aerial Navigation

Add code
Apr 13, 2025
Viaarxiv icon

An Evaluation of Cultural Value Alignment in LLM

Add code
Apr 11, 2025
Viaarxiv icon

The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models?

Add code
Apr 06, 2025
Viaarxiv icon

CoSIL: Software Issue Localization via LLM-Driven Code Repository Graph Searching

Add code
Mar 28, 2025
Viaarxiv icon

Wan: Open and Advanced Large-Scale Video Generative Models

Add code
Mar 26, 2025
Viaarxiv icon

Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space

Add code
Mar 14, 2025
Viaarxiv icon

Beyond Overfitting: Doubly Adaptive Dropout for Generalizable AU Detection

Add code
Mar 12, 2025
Viaarxiv icon

Decoupled Doubly Contrastive Learning for Cross Domain Facial Action Unit Detection

Add code
Mar 12, 2025
Viaarxiv icon