Picture for Baining Zhao

Baining Zhao

iWorld-Bench: A Benchmark for Interactive World Models with a Unified Action Generation Framework

Add code
May 06, 2026
Viaarxiv icon

A Benchmark for Interactive World Models with a Unified Action Generation Framework

Add code
May 05, 2026
Viaarxiv icon

How Far Are Large Multimodal Models from Human-Level Spatial Action? A Benchmark for Goal-Oriented Embodied Navigation in Urban Airspace

Add code
Apr 09, 2026
Viaarxiv icon

Aerial World Model for Long-horizon Visual Generation and Navigation in 3D Space

Add code
Dec 26, 2025
Viaarxiv icon

KEVER^2: Knowledge-Enhanced Visual Emotion Reasoning and Retrieval

Add code
May 30, 2025
Viaarxiv icon

Context-Aware Sentiment Forecasting via LLM-based Multi-Perspective Role-Playing Agents

Add code
May 30, 2025
Viaarxiv icon

CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory

Add code
May 08, 2025
Viaarxiv icon

Embodied-R: Collaborative Framework for Activating Embodied Spatial Reasoning in Foundation Models via Reinforcement Learning

Add code
Apr 17, 2025
Viaarxiv icon

UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces

Add code
Mar 08, 2025
Viaarxiv icon

EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment

Add code
Oct 12, 2024
Viaarxiv icon