Picture for Pouya Pezeshkpour

Pouya Pezeshkpour

Shammie

LLMs Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark for Comprehensive Evaluation of LLMs

Add code
Jun 07, 2024
Viaarxiv icon

A Blueprint Architecture of Compound AI Systems for Enterprise

Add code
Jun 02, 2024
Viaarxiv icon

Multi-Conditional Ranking with Large Language Models

Add code
Mar 30, 2024
Viaarxiv icon

Reasoning Capacity in Multi-Agent Systems: Limitations, Challenges and Human-Centered Solutions

Add code
Feb 02, 2024
Viaarxiv icon

Distilling Large Language Models using Skill-Occupation Graph Context for HR-Related Tasks

Add code
Nov 10, 2023
Viaarxiv icon

Less is More for Long Document Summary Evaluation by LLMs

Add code
Sep 14, 2023
Figure 1 for Less is More for Long Document Summary Evaluation by LLMs
Figure 2 for Less is More for Long Document Summary Evaluation by LLMs
Figure 3 for Less is More for Long Document Summary Evaluation by LLMs
Figure 4 for Less is More for Long Document Summary Evaluation by LLMs
Viaarxiv icon

Rethinking Language Models as Symbolic Knowledge Graphs

Add code
Aug 25, 2023
Figure 1 for Rethinking Language Models as Symbolic Knowledge Graphs
Figure 2 for Rethinking Language Models as Symbolic Knowledge Graphs
Figure 3 for Rethinking Language Models as Symbolic Knowledge Graphs
Figure 4 for Rethinking Language Models as Symbolic Knowledge Graphs
Viaarxiv icon

Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions

Add code
Aug 22, 2023
Figure 1 for Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions
Figure 2 for Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions
Figure 3 for Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions
Figure 4 for Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions
Viaarxiv icon

Measuring and Modifying Factual Knowledge in Large Language Models

Add code
Jun 09, 2023
Figure 1 for Measuring and Modifying Factual Knowledge in Large Language Models
Figure 2 for Measuring and Modifying Factual Knowledge in Large Language Models
Figure 3 for Measuring and Modifying Factual Knowledge in Large Language Models
Figure 4 for Measuring and Modifying Factual Knowledge in Large Language Models
Viaarxiv icon

Quantifying Social Biases Using Templates is Unreliable

Add code
Oct 09, 2022
Figure 1 for Quantifying Social Biases Using Templates is Unreliable
Figure 2 for Quantifying Social Biases Using Templates is Unreliable
Figure 3 for Quantifying Social Biases Using Templates is Unreliable
Figure 4 for Quantifying Social Biases Using Templates is Unreliable
Viaarxiv icon