Picture for Yihao Feng

Yihao Feng

Longhorn: State Space Models are Amortized Online Learners

Add code
Jul 19, 2024
Viaarxiv icon

APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

Add code
Jun 26, 2024
Viaarxiv icon

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Add code
Apr 02, 2024
Figure 1 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 2 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 3 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 4 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Viaarxiv icon

FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability

Add code
Feb 28, 2024
Figure 1 for FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability
Figure 2 for FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability
Figure 3 for FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability
Figure 4 for FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability
Viaarxiv icon

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

Add code
Feb 26, 2024
Figure 1 for AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Figure 2 for AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Figure 3 for AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Figure 4 for AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Viaarxiv icon

Text2Data: Low-Resource Data Generation with Textual Control

Add code
Feb 08, 2024
Figure 1 for Text2Data: Low-Resource Data Generation with Textual Control
Figure 2 for Text2Data: Low-Resource Data Generation with Textual Control
Figure 3 for Text2Data: Low-Resource Data Generation with Textual Control
Figure 4 for Text2Data: Low-Resource Data Generation with Textual Control
Viaarxiv icon

BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

Add code
Aug 11, 2023
Figure 1 for BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Figure 2 for BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Figure 3 for BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Figure 4 for BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Viaarxiv icon

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

Add code
Aug 04, 2023
Figure 1 for Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Figure 2 for Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Figure 3 for Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Figure 4 for Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Viaarxiv icon

REX: Rapid Exploration and eXploitation for AI Agents

Add code
Jul 18, 2023
Figure 1 for REX: Rapid Exploration and eXploitation for AI Agents
Figure 2 for REX: Rapid Exploration and eXploitation for AI Agents
Figure 3 for REX: Rapid Exploration and eXploitation for AI Agents
Figure 4 for REX: Rapid Exploration and eXploitation for AI Agents
Viaarxiv icon

FAMO: Fast Adaptive Multitask Optimization

Add code
Jun 06, 2023
Figure 1 for FAMO: Fast Adaptive Multitask Optimization
Figure 2 for FAMO: Fast Adaptive Multitask Optimization
Figure 3 for FAMO: Fast Adaptive Multitask Optimization
Figure 4 for FAMO: Fast Adaptive Multitask Optimization
Viaarxiv icon