Picture for Yihao Feng

Yihao Feng

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Add code
Apr 02, 2024
Viaarxiv icon

FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability

Add code
Feb 28, 2024
Viaarxiv icon

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

Add code
Feb 26, 2024
Viaarxiv icon

Text2Data: Low-Resource Data Generation with Textual Control

Add code
Feb 08, 2024
Viaarxiv icon

BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

Add code
Aug 11, 2023
Figure 1 for BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Figure 2 for BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Figure 3 for BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Figure 4 for BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Viaarxiv icon

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

Add code
Aug 04, 2023
Figure 1 for Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Figure 2 for Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Figure 3 for Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Figure 4 for Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Viaarxiv icon

REX: Rapid Exploration and eXploitation for AI Agents

Add code
Jul 18, 2023
Figure 1 for REX: Rapid Exploration and eXploitation for AI Agents
Figure 2 for REX: Rapid Exploration and eXploitation for AI Agents
Figure 3 for REX: Rapid Exploration and eXploitation for AI Agents
Figure 4 for REX: Rapid Exploration and eXploitation for AI Agents
Viaarxiv icon

FAMO: Fast Adaptive Multitask Optimization

Add code
Jun 06, 2023
Figure 1 for FAMO: Fast Adaptive Multitask Optimization
Figure 2 for FAMO: Fast Adaptive Multitask Optimization
Figure 3 for FAMO: Fast Adaptive Multitask Optimization
Figure 4 for FAMO: Fast Adaptive Multitask Optimization
Viaarxiv icon

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning

Add code
Jun 05, 2023
Figure 1 for LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
Figure 2 for LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
Figure 3 for LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
Figure 4 for LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
Viaarxiv icon

Preference-grounded Token-level Guidance for Language Model Fine-tuning

Add code
Jun 01, 2023
Figure 1 for Preference-grounded Token-level Guidance for Language Model Fine-tuning
Figure 2 for Preference-grounded Token-level Guidance for Language Model Fine-tuning
Figure 3 for Preference-grounded Token-level Guidance for Language Model Fine-tuning
Figure 4 for Preference-grounded Token-level Guidance for Language Model Fine-tuning
Viaarxiv icon