Picture for Ashwin Kalyan

Ashwin Kalyan

Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation

Add code
May 07, 2024
Figure 1 for Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation
Figure 2 for Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation
Figure 3 for Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation
Figure 4 for Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation
Viaarxiv icon

RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

Add code
Apr 12, 2024
Figure 1 for RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Figure 2 for RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Figure 3 for RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Figure 4 for RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Viaarxiv icon

GEO: Generative Engine Optimization

Add code
Nov 16, 2023
Viaarxiv icon

Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs

Add code
Nov 08, 2023
Figure 1 for Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Figure 2 for Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Figure 3 for Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Figure 4 for Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Viaarxiv icon

QualEval: Qualitative Evaluation for Model Improvement

Add code
Nov 06, 2023
Viaarxiv icon

Estimating Numbers without Regression

Add code
Oct 09, 2023
Figure 1 for Estimating Numbers without Regression
Figure 2 for Estimating Numbers without Regression
Figure 3 for Estimating Numbers without Regression
Figure 4 for Estimating Numbers without Regression
Viaarxiv icon

Distraction-free Embeddings for Robust VQA

Add code
Aug 31, 2023
Figure 1 for Distraction-free Embeddings for Robust VQA
Figure 2 for Distraction-free Embeddings for Robust VQA
Figure 3 for Distraction-free Embeddings for Robust VQA
Figure 4 for Distraction-free Embeddings for Robust VQA
Viaarxiv icon

Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations

Add code
Aug 07, 2023
Figure 1 for Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations
Figure 2 for Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations
Figure 3 for Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations
Figure 4 for Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations
Viaarxiv icon

CSTS: Conditional Semantic Textual Similarity

Add code
May 24, 2023
Figure 1 for CSTS: Conditional Semantic Textual Similarity
Figure 2 for CSTS: Conditional Semantic Textual Similarity
Figure 3 for CSTS: Conditional Semantic Textual Similarity
Figure 4 for CSTS: Conditional Semantic Textual Similarity
Viaarxiv icon

Anthropomorphization of AI: Opportunities and Risks

Add code
May 24, 2023
Figure 1 for Anthropomorphization of AI: Opportunities and Risks
Figure 2 for Anthropomorphization of AI: Opportunities and Risks
Figure 3 for Anthropomorphization of AI: Opportunities and Risks
Viaarxiv icon