Picture for Boyuan Zheng

Boyuan Zheng

Interpretable Robotic Manipulation from Language

Add code
May 27, 2024
Figure 1 for Interpretable Robotic Manipulation from Language
Figure 2 for Interpretable Robotic Manipulation from Language
Figure 3 for Interpretable Robotic Manipulation from Language
Figure 4 for Interpretable Robotic Manipulation from Language
Viaarxiv icon

Scrutinize What We Ignore: Reining Task Representation Shift In Context-Based Offline Meta Reinforcement Learning

Add code
May 20, 2024
Figure 1 for Scrutinize What We Ignore: Reining Task Representation Shift In Context-Based Offline Meta Reinforcement Learning
Figure 2 for Scrutinize What We Ignore: Reining Task Representation Shift In Context-Based Offline Meta Reinforcement Learning
Figure 3 for Scrutinize What We Ignore: Reining Task Representation Shift In Context-Based Offline Meta Reinforcement Learning
Figure 4 for Scrutinize What We Ignore: Reining Task Representation Shift In Context-Based Offline Meta Reinforcement Learning
Viaarxiv icon

A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents

Add code
Feb 15, 2024
Viaarxiv icon

Dual-View Visual Contextualization for Web Navigation

Add code
Feb 06, 2024
Viaarxiv icon

The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts

Add code
Jan 23, 2024
Viaarxiv icon

GPT-4V is a Generalist Web Agent, if Grounded

Add code
Jan 03, 2024
Figure 1 for GPT-4V is a Generalist Web Agent, if Grounded
Figure 2 for GPT-4V is a Generalist Web Agent, if Grounded
Figure 3 for GPT-4V is a Generalist Web Agent, if Grounded
Figure 4 for GPT-4V is a Generalist Web Agent, if Grounded
Viaarxiv icon

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Add code
Nov 27, 2023
Figure 1 for MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Figure 2 for MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Figure 3 for MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Figure 4 for MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Viaarxiv icon

Mind2Web: Towards a Generalist Agent for the Web

Add code
Jun 15, 2023
Figure 1 for Mind2Web: Towards a Generalist Agent for the Web
Figure 2 for Mind2Web: Towards a Generalist Agent for the Web
Figure 3 for Mind2Web: Towards a Generalist Agent for the Web
Figure 4 for Mind2Web: Towards a Generalist Agent for the Web
Viaarxiv icon

Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency

Add code
May 18, 2023
Figure 1 for Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency
Figure 2 for Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency
Figure 3 for Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency
Figure 4 for Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency
Viaarxiv icon

Explaining Imitation Learning through Frames

Add code
Jan 03, 2023
Figure 1 for Explaining Imitation Learning through Frames
Figure 2 for Explaining Imitation Learning through Frames
Figure 3 for Explaining Imitation Learning through Frames
Figure 4 for Explaining Imitation Learning through Frames
Viaarxiv icon