Picture for Xuelong Li

Xuelong Li

Secure Tug-of-War (SecTOW): Iterative Defense-Attack Training with Reinforcement Learning for Multimodal Model Security

Add code
Jul 29, 2025
Viaarxiv icon

TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios

Add code
Jul 24, 2025
Viaarxiv icon

Technical Report of TeleChat2, TeleChat2.5 and T1

Add code
Jul 24, 2025
Figure 1 for Technical Report of TeleChat2, TeleChat2.5 and T1
Figure 2 for Technical Report of TeleChat2, TeleChat2.5 and T1
Figure 3 for Technical Report of TeleChat2, TeleChat2.5 and T1
Figure 4 for Technical Report of TeleChat2, TeleChat2.5 and T1
Viaarxiv icon

BoSS: Beyond-Semantic Speech

Add code
Jul 23, 2025
Viaarxiv icon

KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills

Add code
Jun 15, 2025
Viaarxiv icon

AI Flow: Perspectives, Scenarios, and Approaches

Add code
Jun 14, 2025
Viaarxiv icon

MoRE: Mixture of Residual Experts for Humanoid Lifelike Gaits Learning on Complex Terrains

Add code
Jun 12, 2025
Figure 1 for MoRE: Mixture of Residual Experts for Humanoid Lifelike Gaits Learning on Complex Terrains
Figure 2 for MoRE: Mixture of Residual Experts for Humanoid Lifelike Gaits Learning on Complex Terrains
Figure 3 for MoRE: Mixture of Residual Experts for Humanoid Lifelike Gaits Learning on Complex Terrains
Figure 4 for MoRE: Mixture of Residual Experts for Humanoid Lifelike Gaits Learning on Complex Terrains
Viaarxiv icon

Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations

Add code
Jun 09, 2025
Figure 1 for Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations
Figure 2 for Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations
Figure 3 for Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations
Figure 4 for Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations
Viaarxiv icon

LLMs Caught in the Crossfire: Malware Requests and Jailbreak Challenges

Add code
Jun 09, 2025
Viaarxiv icon

WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-Code

Add code
Jun 09, 2025
Figure 1 for WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-Code
Figure 2 for WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-Code
Figure 3 for WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-Code
Figure 4 for WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-Code
Viaarxiv icon