Picture for Minpeng Liao

Minpeng Liao

MARS: Optimizing Dual-System Deep Research via Multi-Agent Reinforcement Learning

Add code
Oct 06, 2025
Viaarxiv icon

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Add code
Sep 16, 2025
Viaarxiv icon

From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization

Add code
Jul 09, 2025
Figure 1 for From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization
Figure 2 for From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization
Figure 3 for From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization
Figure 4 for From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization
Viaarxiv icon

Efficient and Adaptive Simultaneous Speech Translation with Fully Unidirectional Architecture

Add code
Apr 16, 2025
Figure 1 for Efficient and Adaptive Simultaneous Speech Translation with Fully Unidirectional Architecture
Figure 2 for Efficient and Adaptive Simultaneous Speech Translation with Fully Unidirectional Architecture
Figure 3 for Efficient and Adaptive Simultaneous Speech Translation with Fully Unidirectional Architecture
Figure 4 for Efficient and Adaptive Simultaneous Speech Translation with Fully Unidirectional Architecture
Viaarxiv icon

LLMs Can Achieve High-quality Simultaneous Machine Translation as Efficiently as Offline

Add code
Apr 13, 2025
Viaarxiv icon

C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation

Add code
Feb 10, 2025
Figure 1 for C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation
Figure 2 for C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation
Figure 3 for C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation
Figure 4 for C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation
Viaarxiv icon

Markov Chain of Thought for Efficient Mathematical Reasoning

Add code
Oct 23, 2024
Figure 1 for Markov Chain of Thought for Efficient Mathematical Reasoning
Figure 2 for Markov Chain of Thought for Efficient Mathematical Reasoning
Figure 3 for Markov Chain of Thought for Efficient Mathematical Reasoning
Figure 4 for Markov Chain of Thought for Efficient Mathematical Reasoning
Viaarxiv icon

Step-level Value Preference Optimization for Mathematical Reasoning

Add code
Jun 16, 2024
Figure 1 for Step-level Value Preference Optimization for Mathematical Reasoning
Figure 2 for Step-level Value Preference Optimization for Mathematical Reasoning
Figure 3 for Step-level Value Preference Optimization for Mathematical Reasoning
Figure 4 for Step-level Value Preference Optimization for Mathematical Reasoning
Viaarxiv icon

BLSP-Emo: Towards Empathetic Large Speech-Language Models

Add code
Jun 06, 2024
Viaarxiv icon

BLSP-KD: Bootstrapping Language-Speech Pre-training via Knowledge Distillation

Add code
May 29, 2024
Viaarxiv icon