Picture for Yuheng Lu

Yuheng Lu

Separate First, Fuse Later: Mitigating Cross-Modal Interference in Audio-Visual LLMs Reasoning with Modality-Specific Chain-of-Thought

Add code
May 11, 2026
Viaarxiv icon

Evaluating the Expressive Appropriateness of Speech in Rich Contexts

Add code
May 10, 2026
Viaarxiv icon

Code-Switching Information Retrieval: Benchmarks, Analysis, and the Limits of Current Retrievers

Add code
Apr 19, 2026
Viaarxiv icon

TCM-Eval: An Expert-Level Dynamic and Extensible Benchmark for Traditional Chinese Medicine

Add code
Nov 10, 2025
Viaarxiv icon

TransBench: Breaking Barriers for Transferable Graphical User Interface Agents in Dynamic Digital Environments

Add code
May 23, 2025
Viaarxiv icon

Enhancing Complex Instruction Following for Large Language Models with Mixture-of-Contexts Fine-tuning

Add code
May 17, 2025
Viaarxiv icon

Controlled Low-Rank Adaptation with Subspace Regularization for Continued Training on Large Language Models

Add code
Oct 22, 2024
Figure 1 for Controlled Low-Rank Adaptation with Subspace Regularization for Continued Training on Large Language Models
Figure 2 for Controlled Low-Rank Adaptation with Subspace Regularization for Continued Training on Large Language Models
Figure 3 for Controlled Low-Rank Adaptation with Subspace Regularization for Continued Training on Large Language Models
Figure 4 for Controlled Low-Rank Adaptation with Subspace Regularization for Continued Training on Large Language Models
Viaarxiv icon

Open-Vocabulary Point-Cloud Object Detection without 3D Annotation

Add code
Apr 03, 2023
Figure 1 for Open-Vocabulary Point-Cloud Object Detection without 3D Annotation
Figure 2 for Open-Vocabulary Point-Cloud Object Detection without 3D Annotation
Figure 3 for Open-Vocabulary Point-Cloud Object Detection without 3D Annotation
Figure 4 for Open-Vocabulary Point-Cloud Object Detection without 3D Annotation
Viaarxiv icon

PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection

Add code
Mar 14, 2023
Figure 1 for PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection
Figure 2 for PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection
Figure 3 for PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection
Figure 4 for PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection
Viaarxiv icon

Open-Vocabulary 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning

Add code
Jul 05, 2022
Figure 1 for Open-Vocabulary 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning
Figure 2 for Open-Vocabulary 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning
Figure 3 for Open-Vocabulary 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning
Figure 4 for Open-Vocabulary 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning
Viaarxiv icon