Picture for Alane Suhr

Alane Suhr

Using Language Models to Disambiguate Lexical Choices in Translation

Add code
Nov 08, 2024
Figure 1 for Using Language Models to Disambiguate Lexical Choices in Translation
Figure 2 for Using Language Models to Disambiguate Lexical Choices in Translation
Figure 3 for Using Language Models to Disambiguate Lexical Choices in Translation
Figure 4 for Using Language Models to Disambiguate Lexical Choices in Translation
Viaarxiv icon

Grounding Language in Multi-Perspective Referential Communication

Add code
Oct 04, 2024
Viaarxiv icon

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Add code
Jun 14, 2024
Viaarxiv icon

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Add code
May 17, 2024
Figure 1 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 2 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 3 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 4 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Viaarxiv icon

Autonomous Evaluation and Refinement of Digital Agents

Add code
Apr 10, 2024
Viaarxiv icon

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

Add code
Nov 14, 2023
Viaarxiv icon

What's In My Big Data?

Add code
Oct 31, 2023
Figure 1 for What's In My Big Data?
Figure 2 for What's In My Big Data?
Figure 3 for What's In My Big Data?
Figure 4 for What's In My Big Data?
Viaarxiv icon

Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting

Add code
Oct 17, 2023
Figure 1 for Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting
Figure 2 for Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting
Figure 3 for Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting
Figure 4 for Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting
Viaarxiv icon

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

Add code
Jun 02, 2023
Figure 1 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Figure 2 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Figure 3 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Figure 4 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Viaarxiv icon

Minding Language Models' Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker

Add code
Jun 01, 2023
Viaarxiv icon