Picture for Dan Roth

Dan Roth

Shammie

Is Code Better Than Language for Algorithmic Reasoning

Add code
Jun 14, 2026
Viaarxiv icon

SOMA-SQL: Resolving Multi-Source Ambiguity in NL-to-SQL via Synthetic Log and Execution Probing

Add code
Jun 09, 2026
Viaarxiv icon

Do Image-Text Metrics Respect Semantic Invariances?

Add code
May 23, 2026
Viaarxiv icon

Formalize, Don't Optimize: The Heuristic Trap in LLM-Generated Combinatorial Solvers

Add code
May 12, 2026
Viaarxiv icon

Robust Audio-Text Retrieval via Cross-Modal Attention and Hybrid Loss

Add code
Apr 25, 2026
Viaarxiv icon

When Vision-Language Models Judge Without Seeing: Exposing Informativeness Bias

Add code
Apr 20, 2026
Viaarxiv icon

SPENCE: A Syntactic Probe for Detecting Contamination in NL2SQL Benchmarks

Add code
Apr 20, 2026
Viaarxiv icon

JTPRO: A Joint Tool-Prompt Reflective Optimization Framework for Language Agents

Add code
Apr 20, 2026
Viaarxiv icon

MT-OSC: Path for LLMs that Get Lost in Multi-Turn Conversation

Add code
Apr 09, 2026
Viaarxiv icon

DiffuMask: Diffusion Language Model for Token-level Prompt Pruning

Add code
Apr 08, 2026
Viaarxiv icon