Alert button
Picture for Victor Zhong

Victor Zhong

Alert button

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Add code
Bookmark button
Alert button
Apr 11, 2024
Tianbao Xie, Danyang Zhang, Jixuan Chen, Xiaochuan Li, Siheng Zhao, Ruisheng Cao, Toh Jing Hua, Zhoujun Cheng, Dongchan Shin, Fangyu Lei, Yitao Liu, Yiheng Xu, Shuyan Zhou, Silvio Savarese, Caiming Xiong, Victor Zhong, Tao Yu

Viaarxiv icon

Policy Improvement using Language Feedback Models

Add code
Bookmark button
Alert button
Feb 25, 2024
Victor Zhong, Dipendra Misra, Xingdi Yuan, Marc-Alexandre Côté

Viaarxiv icon

Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 21, 2023
Tianbao Xie, Siheng Zhao, Chen Henry Wu, Yitao Liu, Qian Luo, Victor Zhong, Yanchao Yang, Tao Yu

Figure 1 for Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning
Figure 2 for Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning
Figure 3 for Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning
Figure 4 for Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning
Viaarxiv icon

When Not to Trust Language Models: Investigating Effectiveness and Limitations of Parametric and Non-Parametric Memories

Add code
Bookmark button
Alert button
Dec 20, 2022
Alex Mallen, Akari Asai, Victor Zhong, Rajarshi Das, Hannaneh Hajishirzi, Daniel Khashabi

Figure 1 for When Not to Trust Language Models: Investigating Effectiveness and Limitations of Parametric and Non-Parametric Memories
Figure 2 for When Not to Trust Language Models: Investigating Effectiveness and Limitations of Parametric and Non-Parametric Memories
Figure 3 for When Not to Trust Language Models: Investigating Effectiveness and Limitations of Parametric and Non-Parametric Memories
Figure 4 for When Not to Trust Language Models: Investigating Effectiveness and Limitations of Parametric and Non-Parametric Memories
Viaarxiv icon

RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering

Add code
Bookmark button
Alert button
Oct 25, 2022
Victor Zhong, Weijia Shi, Wen-tau Yih, Luke Zettlemoyer

Figure 1 for RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering
Figure 2 for RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering
Figure 3 for RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering
Figure 4 for RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering
Viaarxiv icon

M2D2: A Massively Multi-domain Language Modeling Dataset

Add code
Bookmark button
Alert button
Oct 13, 2022
Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer

Figure 1 for M2D2: A Massively Multi-domain Language Modeling Dataset
Figure 2 for M2D2: A Massively Multi-domain Language Modeling Dataset
Figure 3 for M2D2: A Massively Multi-domain Language Modeling Dataset
Figure 4 for M2D2: A Massively Multi-domain Language Modeling Dataset
Viaarxiv icon

Improving Policy Learning via Language Dynamics Distillation

Add code
Bookmark button
Alert button
Sep 30, 2022
Victor Zhong, Jesse Mu, Luke Zettlemoyer, Edward Grefenstette, Tim Rocktäschel

Figure 1 for Improving Policy Learning via Language Dynamics Distillation
Figure 2 for Improving Policy Learning via Language Dynamics Distillation
Figure 3 for Improving Policy Learning via Language Dynamics Distillation
Figure 4 for Improving Policy Learning via Language Dynamics Distillation
Viaarxiv icon