Picture for Sarath Chandar

Sarath Chandar

EpiK-Eval: Evaluation for Language Models as Epistemic Models

Add code
Oct 23, 2023
Viaarxiv icon

Faithfulness Measurable Masked Language Models

Add code
Oct 11, 2023
Viaarxiv icon

Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi

Add code
Aug 20, 2023
Figure 1 for Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi
Figure 2 for Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi
Figure 3 for Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi
Figure 4 for Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi
Viaarxiv icon

Lookbehind Optimizer: k steps back, 1 step forward

Add code
Jul 31, 2023
Viaarxiv icon

Promoting Exploration in Memory-Augmented Adam using Critical Momenta

Add code
Jul 18, 2023
Viaarxiv icon

Thompson sampling for improved exploration in GFlowNets

Add code
Jun 30, 2023
Viaarxiv icon

Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models

Add code
May 24, 2023
Viaarxiv icon

Should We Attend More or Less? Modulating Attention for Fairness

Add code
May 22, 2023
Figure 1 for Should We Attend More or Less? Modulating Attention for Fairness
Figure 2 for Should We Attend More or Less? Modulating Attention for Fairness
Figure 3 for Should We Attend More or Less? Modulating Attention for Fairness
Figure 4 for Should We Attend More or Less? Modulating Attention for Fairness
Viaarxiv icon

Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning

Add code
Mar 16, 2023
Figure 1 for Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Figure 2 for Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Figure 3 for Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Figure 4 for Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Viaarxiv icon

Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning

Add code
Mar 15, 2023
Viaarxiv icon