Picture for Marwa Abdulhai

Marwa Abdulhai

Evaluating & Reducing Deceptive Dialogue From Language Models with Multi-turn RL

Add code
Oct 16, 2025
Viaarxiv icon

Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward

Add code
Apr 04, 2025
Figure 1 for Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward
Figure 2 for Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward
Figure 3 for Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward
Figure 4 for Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward
Viaarxiv icon

Virtual Personas for Language Models via an Anthology of Backstories

Add code
Jul 09, 2024
Figure 1 for Virtual Personas for Language Models via an Anthology of Backstories
Figure 2 for Virtual Personas for Language Models via an Anthology of Backstories
Figure 3 for Virtual Personas for Language Models via an Anthology of Backstories
Figure 4 for Virtual Personas for Language Models via an Anthology of Backstories
Viaarxiv icon

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models

Add code
Nov 30, 2023
Figure 1 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 2 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 3 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 4 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Viaarxiv icon

Moral Foundations of Large Language Models

Add code
Oct 23, 2023
Figure 1 for Moral Foundations of Large Language Models
Figure 2 for Moral Foundations of Large Language Models
Figure 3 for Moral Foundations of Large Language Models
Figure 4 for Moral Foundations of Large Language Models
Viaarxiv icon

Personality Traits in Large Language Models

Add code
Jul 01, 2023
Figure 1 for Personality Traits in Large Language Models
Figure 2 for Personality Traits in Large Language Models
Figure 3 for Personality Traits in Large Language Models
Figure 4 for Personality Traits in Large Language Models
Viaarxiv icon

Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience

Add code
Aug 09, 2022
Figure 1 for Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Figure 2 for Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Figure 3 for Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Figure 4 for Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Viaarxiv icon

Context-Specific Representation Abstraction for Deep Option Learning

Add code
Sep 20, 2021
Figure 1 for Context-Specific Representation Abstraction for Deep Option Learning
Figure 2 for Context-Specific Representation Abstraction for Deep Option Learning
Figure 3 for Context-Specific Representation Abstraction for Deep Option Learning
Figure 4 for Context-Specific Representation Abstraction for Deep Option Learning
Viaarxiv icon

A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

Add code
Oct 31, 2020
Figure 1 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Figure 2 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Figure 3 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Figure 4 for A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Viaarxiv icon