Picture for Jianbo Yuan

Jianbo Yuan

An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing

Add code
Mar 25, 2024
Figure 1 for An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Figure 2 for An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Figure 3 for An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Figure 4 for An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Viaarxiv icon

How Can LLM Guide RL? A Value-Based Approach

Add code
Feb 25, 2024
Figure 1 for How Can LLM Guide RL? A Value-Based Approach
Figure 2 for How Can LLM Guide RL? A Value-Based Approach
Figure 3 for How Can LLM Guide RL? A Value-Based Approach
Figure 4 for How Can LLM Guide RL? A Value-Based Approach
Viaarxiv icon

Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning

Add code
Jan 18, 2024
Viaarxiv icon

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks

Add code
Jan 10, 2024
Viaarxiv icon

InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models

Add code
Dec 04, 2023
Figure 1 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Figure 2 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Figure 3 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Figure 4 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Viaarxiv icon

Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts

Add code
Dec 03, 2023
Viaarxiv icon

Self-Infilling Code Generation

Add code
Nov 29, 2023
Figure 1 for Self-Infilling Code Generation
Figure 2 for Self-Infilling Code Generation
Figure 3 for Self-Infilling Code Generation
Figure 4 for Self-Infilling Code Generation
Viaarxiv icon

Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis

Add code
Nov 28, 2023
Viaarxiv icon

Let's reward step by step: Step-Level reward model as the Navigators for Reasoning

Add code
Oct 16, 2023
Figure 1 for Let's reward step by step: Step-Level reward model as the Navigators for Reasoning
Figure 2 for Let's reward step by step: Step-Level reward model as the Navigators for Reasoning
Figure 3 for Let's reward step by step: Step-Level reward model as the Navigators for Reasoning
Figure 4 for Let's reward step by step: Step-Level reward model as the Navigators for Reasoning
Viaarxiv icon

LoBaSS: Gauging Learnability in Supervised Fine-tuning Data

Add code
Oct 16, 2023
Figure 1 for LoBaSS: Gauging Learnability in Supervised Fine-tuning Data
Figure 2 for LoBaSS: Gauging Learnability in Supervised Fine-tuning Data
Figure 3 for LoBaSS: Gauging Learnability in Supervised Fine-tuning Data
Figure 4 for LoBaSS: Gauging Learnability in Supervised Fine-tuning Data
Viaarxiv icon