Picture for Xuelong Li

Xuelong Li

Ruyi2 Technical Report

Add code
Feb 26, 2026
Viaarxiv icon

Beyond Static Artifacts: A Forensic Benchmark for Video Deepfake Reasoning in Vision Language Models

Add code
Feb 25, 2026
Viaarxiv icon

Controllable Exploration in Hybrid-Policy RLVR for Multi-Modal Reasoning

Add code
Feb 22, 2026
Viaarxiv icon

Tele-Omni: a Unified Multimodal Framework for Video Generation and Editing

Add code
Feb 10, 2026
Viaarxiv icon

Do MLLMs Really See It: Reinforcing Visual Attention in Multimodal LLMs

Add code
Feb 09, 2026
Viaarxiv icon

TextOp: Real-time Interactive Text-Driven Humanoid Robot Motion Generation and Control

Add code
Feb 07, 2026
Viaarxiv icon

TeleBoost: A Systematic Alignment Framework for High-Fidelity, Controllable, and Robust Video Generation

Add code
Feb 07, 2026
Viaarxiv icon

Boosting SAM for Cross-Domain Few-Shot Segmentation via Conditional Point Sparsification

Add code
Feb 05, 2026
Viaarxiv icon

Learning Soccer Skills for Humanoid Robots: A Progressive Perception-Action Framework

Add code
Feb 05, 2026
Viaarxiv icon

Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models

Add code
Feb 05, 2026
Viaarxiv icon