Picture for Abulhair Saparov

Abulhair Saparov

A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

Add code
Jul 02, 2024
Viaarxiv icon

LLMs Are Prone to Fallacies in Causal Inference

Add code
Jun 18, 2024
Viaarxiv icon

Foundational Challenges in Assuring Alignment and Safety of Large Language Models

Add code
Apr 15, 2024
Viaarxiv icon

Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?

Add code
Jan 31, 2024
Figure 1 for Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?
Figure 2 for Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?
Figure 3 for Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?
Figure 4 for Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?
Viaarxiv icon

Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis

Add code
Nov 01, 2023
Viaarxiv icon

Personas as a Way to Model Truthfulness in Language Models

Add code
Oct 30, 2023
Figure 1 for Personas as a Way to Model Truthfulness in Language Models
Figure 2 for Personas as a Way to Model Truthfulness in Language Models
Figure 3 for Personas as a Way to Model Truthfulness in Language Models
Figure 4 for Personas as a Way to Model Truthfulness in Language Models
Viaarxiv icon

Retrieval-Augmented Chain-of-Thought in Semi-structured Domains

Add code
Oct 22, 2023
Figure 1 for Retrieval-Augmented Chain-of-Thought in Semi-structured Domains
Figure 2 for Retrieval-Augmented Chain-of-Thought in Semi-structured Domains
Figure 3 for Retrieval-Augmented Chain-of-Thought in Semi-structured Domains
Figure 4 for Retrieval-Augmented Chain-of-Thought in Semi-structured Domains
Viaarxiv icon

World Models for Math Story Problems

Add code
Jun 07, 2023
Figure 1 for World Models for Math Story Problems
Figure 2 for World Models for Math Story Problems
Figure 3 for World Models for Math Story Problems
Figure 4 for World Models for Math Story Problems
Viaarxiv icon

Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples

Add code
May 24, 2023
Figure 1 for Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples
Figure 2 for Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples
Figure 3 for Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples
Figure 4 for Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples
Viaarxiv icon

Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought

Add code
Oct 03, 2022
Figure 1 for Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought
Figure 2 for Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought
Figure 3 for Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought
Figure 4 for Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought
Viaarxiv icon