Picture for Xuelong Li

Xuelong Li

Do MLLMs Really See It: Reinforcing Visual Attention in Multimodal LLMs

Add code
Feb 09, 2026
Viaarxiv icon

TextOp: Real-time Interactive Text-Driven Humanoid Robot Motion Generation and Control

Add code
Feb 07, 2026
Viaarxiv icon

TeleBoost: A Systematic Alignment Framework for High-Fidelity, Controllable, and Robust Video Generation

Add code
Feb 07, 2026
Viaarxiv icon

Boosting SAM for Cross-Domain Few-Shot Segmentation via Conditional Point Sparsification

Add code
Feb 05, 2026
Viaarxiv icon

Learning Soccer Skills for Humanoid Robots: A Progressive Perception-Action Framework

Add code
Feb 05, 2026
Viaarxiv icon

Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models

Add code
Feb 05, 2026
Viaarxiv icon

Point2Insert: Video Object Insertion via Sparse Point Guidance

Add code
Feb 04, 2026
Viaarxiv icon

Understanding Degradation with Vision Language Model

Add code
Feb 04, 2026
Viaarxiv icon

HUSKY: Humanoid Skateboarding System via Physics-Aware Whole-Body Control

Add code
Feb 03, 2026
Viaarxiv icon

High-Fidelity Generative Audio Compression at 0.275kbps

Add code
Jan 31, 2026
Viaarxiv icon