Picture for Xuelong Li

Xuelong Li

Information Capacity: Evaluating the Efficiency of Large Language Models via Text Compression

Add code
Nov 13, 2025
Viaarxiv icon

Exploring the Underwater World Segmentation without Extra Training

Add code
Nov 11, 2025
Viaarxiv icon

ScRPO: From Errors to Insights

Add code
Nov 11, 2025
Viaarxiv icon

Class-Aware Prototype Learning with Negative Contrast for Test-Time Adaptation of Vision-Language Models

Add code
Oct 22, 2025
Viaarxiv icon

FastUMI-100K: Advancing Data-driven Robotic Manipulation with a Large-scale UMI-style Dataset

Add code
Oct 09, 2025
Viaarxiv icon

Towards Reliable LLM-based Robot Planning via Combined Uncertainty Estimation

Add code
Oct 09, 2025
Figure 1 for Towards Reliable LLM-based Robot Planning via Combined Uncertainty Estimation
Figure 2 for Towards Reliable LLM-based Robot Planning via Combined Uncertainty Estimation
Figure 3 for Towards Reliable LLM-based Robot Planning via Combined Uncertainty Estimation
Figure 4 for Towards Reliable LLM-based Robot Planning via Combined Uncertainty Estimation
Viaarxiv icon

T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

Add code
Aug 27, 2025
Viaarxiv icon

ERF-BA-TFD+: A Multimodal Model for Audio-Visual Deepfake Detection

Add code
Aug 24, 2025
Figure 1 for ERF-BA-TFD+: A Multimodal Model for Audio-Visual Deepfake Detection
Figure 2 for ERF-BA-TFD+: A Multimodal Model for Audio-Visual Deepfake Detection
Figure 3 for ERF-BA-TFD+: A Multimodal Model for Audio-Visual Deepfake Detection
Figure 4 for ERF-BA-TFD+: A Multimodal Model for Audio-Visual Deepfake Detection
Viaarxiv icon

InterSyn: Interleaved Learning for Dynamic Motion Synthesis in the Wild

Add code
Aug 14, 2025
Viaarxiv icon

Safe Semantics, Unsafe Interpretations: Tackling Implicit Reasoning Safety in Large Vision-Language Models

Add code
Aug 12, 2025
Viaarxiv icon