Picture for Pouya Pezeshkpour

Pouya Pezeshkpour

Shammie

AutoPyVerifier: Learning Compact Executable Verifiers for Large Language Model Outputs

Add code
Apr 24, 2026
Viaarxiv icon

Blue Data Intelligence Layer: Streaming Data and Agents for Multi-source Multi-modal Data-Centric Applications

Add code
Apr 16, 2026
Viaarxiv icon

Align then Train: Efficient Retrieval Adapter Learning

Add code
Apr 03, 2026
Viaarxiv icon

Geometry-Aware Decoding with Wasserstein-Regularized Truncation and Mass Penalties for Large Language Models

Add code
Feb 10, 2026
Viaarxiv icon

From Task Solving to Robust Real-World Adaptation in LLM Agents

Add code
Feb 02, 2026
Viaarxiv icon

From Proof to Program: Characterizing Tool-Induced Reasoning Hallucinations in Large Language Models

Add code
Nov 14, 2025
Figure 1 for From Proof to Program: Characterizing Tool-Induced Reasoning Hallucinations in Large Language Models
Figure 2 for From Proof to Program: Characterizing Tool-Induced Reasoning Hallucinations in Large Language Models
Figure 3 for From Proof to Program: Characterizing Tool-Induced Reasoning Hallucinations in Large Language Models
Figure 4 for From Proof to Program: Characterizing Tool-Induced Reasoning Hallucinations in Large Language Models
Viaarxiv icon

Towards Reliable Benchmarking: A Contamination Free, Controllable Evaluation Framework for Multi-step LLM Function Calling

Add code
Sep 30, 2025
Figure 1 for Towards Reliable Benchmarking: A Contamination Free, Controllable Evaluation Framework for Multi-step LLM Function Calling
Figure 2 for Towards Reliable Benchmarking: A Contamination Free, Controllable Evaluation Framework for Multi-step LLM Function Calling
Figure 3 for Towards Reliable Benchmarking: A Contamination Free, Controllable Evaluation Framework for Multi-step LLM Function Calling
Figure 4 for Towards Reliable Benchmarking: A Contamination Free, Controllable Evaluation Framework for Multi-step LLM Function Calling
Viaarxiv icon

Mixed Signals: Decoding VLMs' Reasoning and Underlying Bias in Vision-Language Conflict

Add code
Apr 11, 2025
Viaarxiv icon

Insight-RAG: Enhancing LLMs with Insight-Driven Augmentation

Add code
Mar 31, 2025
Figure 1 for Insight-RAG: Enhancing LLMs with Insight-Driven Augmentation
Figure 2 for Insight-RAG: Enhancing LLMs with Insight-Driven Augmentation
Figure 3 for Insight-RAG: Enhancing LLMs with Insight-Driven Augmentation
Figure 4 for Insight-RAG: Enhancing LLMs with Insight-Driven Augmentation
Viaarxiv icon

Evaluating Bias in LLMs for Job-Resume Matching: Gender, Race, and Education

Add code
Mar 24, 2025
Viaarxiv icon