Picture for Kaidi Xu

Kaidi Xu

LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks

Add code
Jul 27, 2025
Viaarxiv icon

Neural Collapse based Deep Supervised Federated Learning for Signal Detection in OFDM Systems

Add code
Jun 24, 2025
Viaarxiv icon

SConU: Selective Conformal Uncertainty in Large Language Models

Add code
Apr 19, 2025
Viaarxiv icon

Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models

Add code
Mar 14, 2025
Figure 1 for Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models
Figure 2 for Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models
Figure 3 for Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models
Figure 4 for Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models
Viaarxiv icon

DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation

Add code
Mar 13, 2025
Viaarxiv icon

TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention

Add code
Mar 13, 2025
Viaarxiv icon

MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models

Add code
Feb 20, 2025
Figure 1 for MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models
Figure 2 for MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models
Figure 3 for MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models
Figure 4 for MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models
Viaarxiv icon

Symbiotic Cooperation for Web Agents: Harnessing Complementary Strengths of Large and Small LLMs

Add code
Feb 11, 2025
Viaarxiv icon

GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing

Add code
Feb 10, 2025
Figure 1 for GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing
Figure 2 for GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing
Figure 3 for GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing
Figure 4 for GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing
Viaarxiv icon

Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak

Add code
Jan 23, 2025
Viaarxiv icon