Alert button
Picture for Yihao Feng

Yihao Feng

Alert button

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Add code
Bookmark button
Alert button
Apr 02, 2024
Ruohong Zhang, Liangke Gui, Zhiqing Sun, Yihao Feng, Keyang Xu, Yuanhan Zhang, Di Fu, Chunyuan Li, Alexander Hauptmann, Yonatan Bisk, Yiming Yang

Viaarxiv icon

FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability

Add code
Bookmark button
Alert button
Feb 28, 2024
Congying Xia, Chen Xing, Jiangshu Du, Xinyi Yang, Yihao Feng, Ran Xu, Wenpeng Yin, Caiming Xiong

Viaarxiv icon

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

Add code
Bookmark button
Alert button
Feb 26, 2024
Jianguo Zhang, Tian Lan, Rithesh Murthy, Zhiwei Liu, Weiran Yao, Juntao Tan, Thai Hoang, Liangwei Yang, Yihao Feng, Zuxin Liu, Tulika Awalgaonkar, Juan Carlos Niebles, Silvio Savarese, Shelby Heinecke, Huan Wang, Caiming Xiong

Viaarxiv icon

Text2Data: Low-Resource Data Generation with Textual Control

Add code
Bookmark button
Alert button
Feb 08, 2024
Shiyu Wang, Yihao Feng, Tian Lan, Ning Yu, Yu Bai, Ran Xu, Huan Wang, Caiming Xiong, Silvio Savarese

Viaarxiv icon

BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

Add code
Bookmark button
Alert button
Aug 11, 2023
Zhiwei Liu, Weiran Yao, Jianguo Zhang, Le Xue, Shelby Heinecke, Rithesh Murthy, Yihao Feng, Zeyuan Chen, Juan Carlos Niebles, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese

Figure 1 for BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Figure 2 for BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Figure 3 for BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Figure 4 for BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Viaarxiv icon

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

Add code
Bookmark button
Alert button
Aug 04, 2023
Weiran Yao, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Yihao Feng, Le Xue, Rithesh Murthy, Zeyuan Chen, Jianguo Zhang, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese

Figure 1 for Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Figure 2 for Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Figure 3 for Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Figure 4 for Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Viaarxiv icon

REX: Rapid Exploration and eXploitation for AI Agents

Add code
Bookmark button
Alert button
Jul 18, 2023
Rithesh Murthy, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Le Xue, Weiran Yao, Yihao Feng, Zeyuan Chen, Akash Gokul, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese

Figure 1 for REX: Rapid Exploration and eXploitation for AI Agents
Figure 2 for REX: Rapid Exploration and eXploitation for AI Agents
Figure 3 for REX: Rapid Exploration and eXploitation for AI Agents
Figure 4 for REX: Rapid Exploration and eXploitation for AI Agents
Viaarxiv icon

FAMO: Fast Adaptive Multitask Optimization

Add code
Bookmark button
Alert button
Jun 06, 2023
Bo Liu, Yihao Feng, Peter Stone, Qiang Liu

Figure 1 for FAMO: Fast Adaptive Multitask Optimization
Figure 2 for FAMO: Fast Adaptive Multitask Optimization
Figure 3 for FAMO: Fast Adaptive Multitask Optimization
Figure 4 for FAMO: Fast Adaptive Multitask Optimization
Viaarxiv icon

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning

Add code
Bookmark button
Alert button
Jun 05, 2023
Bo Liu, Yifeng Zhu, Chongkai Gao, Yihao Feng, Qiang Liu, Yuke Zhu, Peter Stone

Figure 1 for LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
Figure 2 for LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
Figure 3 for LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
Figure 4 for LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
Viaarxiv icon

Preference-grounded Token-level Guidance for Language Model Fine-tuning

Add code
Bookmark button
Alert button
Jun 01, 2023
Shentao Yang, Shujian Zhang, Congying Xia, Yihao Feng, Caiming Xiong, Mingyuan Zhou

Figure 1 for Preference-grounded Token-level Guidance for Language Model Fine-tuning
Figure 2 for Preference-grounded Token-level Guidance for Language Model Fine-tuning
Figure 3 for Preference-grounded Token-level Guidance for Language Model Fine-tuning
Figure 4 for Preference-grounded Token-level Guidance for Language Model Fine-tuning
Viaarxiv icon