Picture for Qingcheng Zeng

Qingcheng Zeng

Seeing is Believing, but How Much? A Comprehensive Analysis of Verbalized Calibration in Vision-Language Models

Add code
May 26, 2025
Viaarxiv icon

The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models

Add code
May 24, 2025
Viaarxiv icon

CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs

Add code
May 16, 2025
Viaarxiv icon

Large Language Models Are More Persuasive Than Incentivized Human Persuaders

Add code
May 14, 2025
Viaarxiv icon

Do Reasoning Models Show Better Verbalized Calibration?

Add code
Apr 09, 2025
Viaarxiv icon

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation

Add code
Mar 13, 2025
Viaarxiv icon

SCORE: Saturated Consensus Relocalization in Semantic Line Maps

Add code
Mar 05, 2025
Viaarxiv icon

ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning

Add code
Feb 22, 2025
Viaarxiv icon

Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt

Add code
Jan 17, 2025
Figure 1 for Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt
Figure 2 for Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt
Figure 3 for Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt
Figure 4 for Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt
Viaarxiv icon

Causal Micro-Narratives

Add code
Oct 07, 2024
Viaarxiv icon