Picture for Ali Emami

Ali Emami

Reasoning Traces Shape Outputs but Models Won't Say So

Add code
Mar 21, 2026
Viaarxiv icon

SCOPE: Selective Conformal Optimized Pairwise LLM Judging

Add code
Feb 13, 2026
Viaarxiv icon

Common to Whom? Regional Cultural Commonsense and LLM Bias in India

Add code
Jan 22, 2026
Viaarxiv icon

The Dog the Cat Chased Stumped the Model: Measuring When Language Models Abandon Structure for Shortcuts

Add code
Oct 23, 2025
Figure 1 for The Dog the Cat Chased Stumped the Model: Measuring When Language Models Abandon Structure for Shortcuts
Figure 2 for The Dog the Cat Chased Stumped the Model: Measuring When Language Models Abandon Structure for Shortcuts
Figure 3 for The Dog the Cat Chased Stumped the Model: Measuring When Language Models Abandon Structure for Shortcuts
Figure 4 for The Dog the Cat Chased Stumped the Model: Measuring When Language Models Abandon Structure for Shortcuts
Viaarxiv icon

Personality Matters: User Traits Predict LLM Preferences in Multi-Turn Collaborative Tasks

Add code
Aug 29, 2025
Viaarxiv icon

The World According to LLMs: How Geographic Origin Influences LLMs' Entity Deduction Capabilities

Add code
Aug 07, 2025
Viaarxiv icon

Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition

Add code
Apr 30, 2025
Figure 1 for Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition
Figure 2 for Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition
Figure 3 for Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition
Figure 4 for Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition
Viaarxiv icon

TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models

Add code
Apr 10, 2025
Figure 1 for TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models
Figure 2 for TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models
Figure 3 for TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models
Figure 4 for TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models
Viaarxiv icon

Fine-Tuned LLMs are "Time Capsules" for Tracking Societal Bias Through Books

Add code
Feb 07, 2025
Viaarxiv icon

Think or Step-by-Step? UnZIPping the Black Box in Zero-Shot Prompts

Add code
Feb 05, 2025
Viaarxiv icon