Picture for Tao Jin

Tao Jin

Andrew

Character Beyond Speech: Leveraging Role-Playing Evaluation in Audio Large Language Models via Reinforcement Learning

Add code
Apr 15, 2026
Viaarxiv icon

From Perception to Planning: Evolving Ego-Centric Task-Oriented Spatiotemporal Reasoning via Curriculum Learning

Add code
Apr 12, 2026
Viaarxiv icon

A Progressive Training Strategy for Vision-Language Models to Counteract Spatio-Temporal Hallucinations in Embodied Reasoning

Add code
Apr 12, 2026
Viaarxiv icon

ImVideoEdit: Image-learning Video Editing via 2D Spatial Difference Attention Blocks

Add code
Apr 09, 2026
Viaarxiv icon

Goose: Anisotropic Speculation Trees for Training-Free Speculative Decoding

Add code
Apr 02, 2026
Viaarxiv icon

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Add code
Apr 02, 2026
Viaarxiv icon

Proact-VL: A Proactive VideoLLM for Real-Time AI Companions

Add code
Mar 03, 2026
Viaarxiv icon

Decoupling Stability and Plasticity for Multi-Modal Test-Time Adaptation

Add code
Feb 28, 2026
Viaarxiv icon

WorldEdit: Towards Open-World Image Editing with a Knowledge-Informed Benchmark

Add code
Feb 06, 2026
Viaarxiv icon

HVD: Human Vision-Driven Video Representation Learning for Text-Video Retrieval

Add code
Jan 22, 2026
Viaarxiv icon