Picture for Xunliang Cai

Xunliang Cai

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Add code
Nov 25, 2024
Figure 1 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 2 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 3 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 4 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Viaarxiv icon

Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning

Add code
Nov 05, 2024
Viaarxiv icon

Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling

Add code
Nov 01, 2024
Viaarxiv icon

Multi-Programming Language Sandbox for LLMs

Add code
Oct 30, 2024
Figure 1 for Multi-Programming Language Sandbox for LLMs
Figure 2 for Multi-Programming Language Sandbox for LLMs
Figure 3 for Multi-Programming Language Sandbox for LLMs
Figure 4 for Multi-Programming Language Sandbox for LLMs
Viaarxiv icon

FIRP: Faster LLM inference via future intermediate representation prediction

Add code
Oct 27, 2024
Viaarxiv icon

EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference

Add code
Oct 16, 2024
Viaarxiv icon

Length Desensitization in Directed Preference Optimization

Add code
Sep 10, 2024
Figure 1 for Length Desensitization in Directed Preference Optimization
Figure 2 for Length Desensitization in Directed Preference Optimization
Figure 3 for Length Desensitization in Directed Preference Optimization
Figure 4 for Length Desensitization in Directed Preference Optimization
Viaarxiv icon

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Add code
Sep 05, 2024
Viaarxiv icon

S$^3$c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners

Add code
Sep 03, 2024
Figure 1 for S$^3$c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners
Figure 2 for S$^3$c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners
Figure 3 for S$^3$c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners
Figure 4 for S$^3$c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners
Viaarxiv icon

ReMamba: Equip Mamba with Effective Long-Sequence Modeling

Add code
Sep 01, 2024
Viaarxiv icon