Picture for Lu Cheng

Lu Cheng

EpiQAL: Benchmarking Large Language Models in Epidemiological Question Answering for Enhanced Alignment and Reasoning

Add code
Jan 06, 2026
Viaarxiv icon

QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retrieval-Augmented Generation

Add code
Dec 22, 2025
Viaarxiv icon

MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs

Add code
Nov 18, 2025
Figure 1 for MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs
Figure 2 for MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs
Figure 3 for MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs
Figure 4 for MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs
Viaarxiv icon

Revisiting NLI: Towards Cost-Effective and Human-Aligned Metrics for Evaluating LLMs in Question Answering

Add code
Nov 10, 2025
Viaarxiv icon

Robust Uncertainty Quantification for Self-Evolving Large Language Models via Continual Domain Pretraining

Add code
Oct 27, 2025
Figure 1 for Robust Uncertainty Quantification for Self-Evolving Large Language Models via Continual Domain Pretraining
Figure 2 for Robust Uncertainty Quantification for Self-Evolving Large Language Models via Continual Domain Pretraining
Figure 3 for Robust Uncertainty Quantification for Self-Evolving Large Language Models via Continual Domain Pretraining
Figure 4 for Robust Uncertainty Quantification for Self-Evolving Large Language Models via Continual Domain Pretraining
Viaarxiv icon

Credence Calibration Game? Calibrating Large Language Models through Structured Play

Add code
Aug 20, 2025
Viaarxiv icon

CP-Router: An Uncertainty-Aware Router Between LLM and LRM

Add code
May 26, 2025
Viaarxiv icon

Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers

Add code
Mar 04, 2025
Figure 1 for Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers
Figure 2 for Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers
Figure 3 for Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers
Figure 4 for Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers
Viaarxiv icon

DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding Models

Add code
Feb 25, 2025
Figure 1 for DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding Models
Figure 2 for DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding Models
Figure 3 for DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding Models
Figure 4 for DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding Models
Viaarxiv icon

Understanding the Uncertainty of LLM Explanations: A Perspective Based on Reasoning Topology

Add code
Feb 24, 2025
Viaarxiv icon