Picture for Jianbo Yuan

Jianbo Yuan

Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models

Add code
May 29, 2025
Viaarxiv icon

InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning

Add code
Feb 17, 2025
Figure 1 for InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
Figure 2 for InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
Figure 3 for InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
Figure 4 for InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
Viaarxiv icon

Unconstrained Model Merging for Enhanced LLM Reasoning

Add code
Oct 17, 2024
Figure 1 for Unconstrained Model Merging for Enhanced LLM Reasoning
Figure 2 for Unconstrained Model Merging for Enhanced LLM Reasoning
Figure 3 for Unconstrained Model Merging for Enhanced LLM Reasoning
Figure 4 for Unconstrained Model Merging for Enhanced LLM Reasoning
Viaarxiv icon

BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data

Add code
Oct 01, 2024
Figure 1 for BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
Figure 2 for BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
Figure 3 for BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
Figure 4 for BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
Viaarxiv icon

Law of Vision Representation in MLLMs

Add code
Aug 29, 2024
Figure 1 for Law of Vision Representation in MLLMs
Figure 2 for Law of Vision Representation in MLLMs
Figure 3 for Law of Vision Representation in MLLMs
Figure 4 for Law of Vision Representation in MLLMs
Viaarxiv icon

An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing

Add code
Mar 25, 2024
Figure 1 for An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Figure 2 for An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Figure 3 for An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Figure 4 for An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Viaarxiv icon

How Can LLM Guide RL? A Value-Based Approach

Add code
Feb 25, 2024
Figure 1 for How Can LLM Guide RL? A Value-Based Approach
Figure 2 for How Can LLM Guide RL? A Value-Based Approach
Figure 3 for How Can LLM Guide RL? A Value-Based Approach
Figure 4 for How Can LLM Guide RL? A Value-Based Approach
Viaarxiv icon

Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning

Add code
Jan 18, 2024
Figure 1 for Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
Figure 2 for Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
Figure 3 for Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
Figure 4 for Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
Viaarxiv icon

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks

Add code
Jan 10, 2024
Figure 1 for InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
Figure 2 for InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
Figure 3 for InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
Figure 4 for InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
Viaarxiv icon

InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models

Add code
Dec 04, 2023
Figure 1 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Figure 2 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Figure 3 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Figure 4 for InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Viaarxiv icon