Picture for Baining Zhao

Baining Zhao

KEVER^2: Knowledge-Enhanced Visual Emotion Reasoning and Retrieval

Add code
May 30, 2025
Viaarxiv icon

Context-Aware Sentiment Forecasting via LLM-based Multi-Perspective Role-Playing Agents

Add code
May 30, 2025
Viaarxiv icon

CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory

Add code
May 08, 2025
Viaarxiv icon

Embodied-R: Collaborative Framework for Activating Embodied Spatial Reasoning in Foundation Models via Reinforcement Learning

Add code
Apr 17, 2025
Viaarxiv icon

UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces

Add code
Mar 08, 2025
Viaarxiv icon

EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment

Add code
Oct 12, 2024
Viaarxiv icon