Picture for Xiaohui Li

Xiaohui Li

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Add code
Dec 25, 2025
Viaarxiv icon

MTR-DuplexBench: Towards a Comprehensive Evaluation of Multi-Round Conversations for Full-Duplex Speech Language Models

Add code
Nov 13, 2025
Viaarxiv icon

FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution

Add code
Oct 14, 2025
Viaarxiv icon

Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance

Add code
Aug 10, 2025
Figure 1 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Figure 2 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Figure 3 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Figure 4 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Viaarxiv icon

The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs

Add code
Jul 10, 2025
Figure 1 for The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Figure 2 for The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Figure 3 for The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Figure 4 for The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Viaarxiv icon

APTOS-2024 challenge report: Generation of synthetic 3D OCT images from fundus photographs

Add code
Jun 09, 2025
Viaarxiv icon

Road Similarity-Based BEV-Satellite Image Matching for UGV Localization

Add code
Apr 23, 2025
Viaarxiv icon

Self-Supervised Traversability Learning with Online Prototype Adaptation for Off-Road Autonomous Driving

Add code
Apr 16, 2025
Figure 1 for Self-Supervised Traversability Learning with Online Prototype Adaptation for Off-Road Autonomous Driving
Figure 2 for Self-Supervised Traversability Learning with Online Prototype Adaptation for Off-Road Autonomous Driving
Figure 3 for Self-Supervised Traversability Learning with Online Prototype Adaptation for Off-Road Autonomous Driving
Figure 4 for Self-Supervised Traversability Learning with Online Prototype Adaptation for Off-Road Autonomous Driving
Viaarxiv icon

EGVD: Event-Guided Video Diffusion Model for Physically Realistic Large-Motion Frame Interpolation

Add code
Mar 26, 2025
Viaarxiv icon

Research on the Offshore Marine Communication Environment Based on Satellite Remote Sensing Data

Add code
Feb 19, 2025
Viaarxiv icon