Picture for Bolin Ding

Bolin Ding

Incentivizing Reasoning from Weak Supervision

Add code
May 26, 2025
Figure 1 for Incentivizing Reasoning from Weak Supervision
Figure 2 for Incentivizing Reasoning from Weak Supervision
Figure 3 for Incentivizing Reasoning from Weak Supervision
Figure 4 for Incentivizing Reasoning from Weak Supervision
Viaarxiv icon

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

Add code
May 23, 2025
Viaarxiv icon

Enhancing Latent Computation in Transformers with Latent Tokens

Add code
May 19, 2025
Figure 1 for Enhancing Latent Computation in Transformers with Latent Tokens
Figure 2 for Enhancing Latent Computation in Transformers with Latent Tokens
Figure 3 for Enhancing Latent Computation in Transformers with Latent Tokens
Figure 4 for Enhancing Latent Computation in Transformers with Latent Tokens
Viaarxiv icon

Tree-based Models for Vertical Federated Learning: A Survey

Add code
Apr 03, 2025
Viaarxiv icon

RePO: ReLU-based Preference Optimization

Add code
Mar 10, 2025
Viaarxiv icon

ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models

Add code
Feb 17, 2025
Figure 1 for ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models
Figure 2 for ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models
Figure 3 for ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models
Figure 4 for ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models
Viaarxiv icon

KIMAs: A Configurable Knowledge Integrated Multi-Agent System

Add code
Feb 13, 2025
Viaarxiv icon

Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering

Add code
Jan 14, 2025
Figure 1 for Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering
Figure 2 for Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering
Figure 3 for Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering
Figure 4 for Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering
Viaarxiv icon

HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data

Add code
Dec 23, 2024
Figure 1 for HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data
Figure 2 for HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data
Figure 3 for HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data
Figure 4 for HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data
Viaarxiv icon

A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models

Add code
Nov 29, 2024
Figure 1 for A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models
Figure 2 for A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models
Figure 3 for A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models
Figure 4 for A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models
Viaarxiv icon