Picture for Xinlei Chen

Xinlei Chen

Context-Aware Sentiment Forecasting via LLM-based Multi-Perspective Role-Playing Agents

Add code
May 30, 2025
Viaarxiv icon

KEVER^2: Knowledge-Enhanced Visual Emotion Reasoning and Retrieval

Add code
May 30, 2025
Viaarxiv icon

Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization

Add code
May 28, 2025
Viaarxiv icon

What Can RL Bring to VLA Generalization? An Empirical Study

Add code
May 26, 2025
Viaarxiv icon

DIMM: Decoupled Multi-hierarchy Kalman Filter for 3D Object Tracking

Add code
May 18, 2025
Viaarxiv icon

PRE-Mamba: A 4D State Space Model for Ultra-High-Frequent Event Camera Deraining

Add code
May 08, 2025
Viaarxiv icon

EDmamba: A Simple yet Effective Event Denoising Method with State Space Model

Add code
May 08, 2025
Viaarxiv icon

CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory

Add code
May 08, 2025
Viaarxiv icon

Embodied-R: Collaborative Framework for Activating Embodied Spatial Reasoning in Foundation Models via Reinforcement Learning

Add code
Apr 17, 2025
Viaarxiv icon

How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM

Add code
Apr 08, 2025
Viaarxiv icon