Picture for Hua Wei

Hua Wei

Position: Uncertainty Quantification in LLMs is Just Unsupervised Clustering

Add code
May 19, 2026
Viaarxiv icon

ShadeBench: A Benchmark Dataset for Building Shade Simulation in Sustainable Society

Add code
May 19, 2026
Viaarxiv icon

Diagnosing Multi-step Reasoning Failures in Black-box LLMs via Stepwise Confidence Attribution

Add code
May 19, 2026
Viaarxiv icon

LEMON: Learning Executable Multi-Agent Orchestration via Counterfactual Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

When Simulation Lies: A Sim-to-Real Benchmark and Domain-Randomized RL Recipe for Tool-Use Agents

Add code
May 12, 2026
Viaarxiv icon

Are LLM Uncertainty and Correctness Encoded by the Same Features? A Functional Dissociation via Sparse Autoencoders

Add code
Apr 21, 2026
Viaarxiv icon

Every Response Counts: Quantifying Uncertainty of LLM-based Multi-Agent Systems through Tensor Decomposition

Add code
Apr 09, 2026
Viaarxiv icon

LangMARL: Natural Language Multi-Agent Reinforcement Learning

Add code
Apr 01, 2026
Viaarxiv icon

SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards

Add code
Feb 25, 2026
Viaarxiv icon

Conformal Feedback Alignment: Quantifying Answer-Level Reliability for Robust LLM Alignment

Add code
Jan 24, 2026
Viaarxiv icon