Picture for Qingcheng Zeng

Qingcheng Zeng

Good Intentions Beyond ACL: Who Does NLP for Social Good, and Where?

Add code
Oct 06, 2025
Viaarxiv icon

DeepSieve: Information Sieving via LLM-as-a-Knowledge-Router

Add code
Jul 30, 2025
Viaarxiv icon

Seeing is Believing, but How Much? A Comprehensive Analysis of Verbalized Calibration in Vision-Language Models

Add code
May 26, 2025
Viaarxiv icon

The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models

Add code
May 24, 2025
Viaarxiv icon

CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs

Add code
May 16, 2025
Viaarxiv icon

Large Language Models Are More Persuasive Than Incentivized Human Persuaders

Add code
May 14, 2025
Figure 1 for Large Language Models Are More Persuasive Than Incentivized Human Persuaders
Figure 2 for Large Language Models Are More Persuasive Than Incentivized Human Persuaders
Figure 3 for Large Language Models Are More Persuasive Than Incentivized Human Persuaders
Figure 4 for Large Language Models Are More Persuasive Than Incentivized Human Persuaders
Viaarxiv icon

Do Reasoning Models Show Better Verbalized Calibration?

Add code
Apr 09, 2025
Viaarxiv icon

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation

Add code
Mar 13, 2025
Viaarxiv icon

SCORE: Saturated Consensus Relocalization in Semantic Line Maps

Add code
Mar 05, 2025
Figure 1 for SCORE: Saturated Consensus Relocalization in Semantic Line Maps
Figure 2 for SCORE: Saturated Consensus Relocalization in Semantic Line Maps
Figure 3 for SCORE: Saturated Consensus Relocalization in Semantic Line Maps
Figure 4 for SCORE: Saturated Consensus Relocalization in Semantic Line Maps
Viaarxiv icon

ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning

Add code
Feb 22, 2025
Viaarxiv icon