Picture for Xu Li

Xu Li

Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics-Huazhong University of Science and Technology, China

Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning

Add code
Jun 06, 2025
Viaarxiv icon

Fewer Hallucinations, More Verification: A Three-Stage LLM-Based Framework for ASR Error Correction

Add code
May 30, 2025
Viaarxiv icon

MM-Prompt: Cross-Modal Prompt Tuning for Continual Visual Question Answering

Add code
May 26, 2025
Viaarxiv icon

AutoGEEval: A Multimodal and Automated Framework for Geospatial Code Generation on GEE with Large Language Models

Add code
May 19, 2025
Viaarxiv icon

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Add code
Mar 11, 2025
Viaarxiv icon

NoT: Federated Unlearning via Weight Negation

Add code
Mar 07, 2025
Viaarxiv icon

Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning

Add code
Feb 19, 2025
Viaarxiv icon

Baichuan-Omni-1.5 Technical Report

Add code
Jan 26, 2025
Viaarxiv icon

Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification

Add code
Jan 25, 2025
Figure 1 for Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification
Figure 2 for Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification
Figure 3 for Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification
Figure 4 for Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification
Viaarxiv icon

Global Semantic-Guided Sub-image Feature Weight Allocation in High-Resolution Large Vision-Language Models

Add code
Jan 24, 2025
Viaarxiv icon