Picture for Richard Zemel

Richard Zemel

Unsupervised Confidence Calibration for Reasoning LLMs from a Single Generation

Add code
Apr 21, 2026
Viaarxiv icon

Level Up: Defining and Exploiting Transitional Problems for Curriculum Learning

Add code
Mar 14, 2026
Viaarxiv icon

Tell Me What To Learn: Generalizing Neural Memory to be Controllable in Natural Language

Add code
Feb 26, 2026
Viaarxiv icon

Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions

Add code
Feb 15, 2026
Viaarxiv icon

Few-Shot Design Optimization by Exploiting Auxiliary Information

Add code
Feb 12, 2026
Viaarxiv icon

Let the Experts Speak: Improving Survival Prediction & Calibration via Mixture-of-Experts Heads

Add code
Nov 11, 2025
Viaarxiv icon

Confidence Calibration in Vision-Language-Action Models

Add code
Jul 23, 2025
Viaarxiv icon

Guiding LLM Decision-Making with Fairness Reward Models

Add code
Jul 15, 2025
Figure 1 for Guiding LLM Decision-Making with Fairness Reward Models
Figure 2 for Guiding LLM Decision-Making with Fairness Reward Models
Figure 3 for Guiding LLM Decision-Making with Fairness Reward Models
Figure 4 for Guiding LLM Decision-Making with Fairness Reward Models
Viaarxiv icon

Replay Can Provably Increase Forgetting

Add code
Jun 04, 2025
Viaarxiv icon

Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation

Add code
May 27, 2025
Viaarxiv icon