Picture for Weiqi Wang

Weiqi Wang

Prospect Theory Fails for LLMs: Revealing Instability of Decision-Making under Epistemic Uncertainty

Add code
Aug 12, 2025
Viaarxiv icon

SessionIntentBench: A Multi-task Inter-session Intention-shift Modeling Benchmark for E-commerce Customer Behavior Understanding

Add code
Jul 27, 2025
Viaarxiv icon

Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Models' Uncertainty?

Add code
May 30, 2025
Viaarxiv icon

INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling

Add code
May 22, 2025
Viaarxiv icon

EcomScriptBench: A Multi-task Benchmark for E-commerce Script Planning via Step-wise Intention-Driven Product Association

Add code
May 21, 2025
Viaarxiv icon

Legal Rule Induction: Towards Generalizable Principle Discovery from Analogous Judicial Precedents

Add code
May 20, 2025
Viaarxiv icon

From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery

Add code
May 19, 2025
Viaarxiv icon

Towards Multi-Agent Reasoning Systems for Collaborative Expertise Delegation: An Exploratory Design Study

Add code
May 12, 2025
Viaarxiv icon

Science Hierarchography: Hierarchical Organization of Science Literature

Add code
Apr 18, 2025
Viaarxiv icon

Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol

Add code
Apr 14, 2025
Viaarxiv icon