Picture for Liang Tan

Liang Tan

Jack

Boosting LLM Reasoning via Spontaneous Self-Correction

Add code
Jun 07, 2025
Viaarxiv icon

LlamaRL: A Distributed Asynchronous Reinforcement Learning Framework for Efficient Large-scale LLM Trainin

Add code
May 29, 2025
Viaarxiv icon

A Systematic Examination of Preference Learning through the Lens of Instruction-Following

Add code
Dec 18, 2024
Viaarxiv icon

Self-Generated Critiques Boost Reward Modeling for Language Models

Add code
Nov 25, 2024
Figure 1 for Self-Generated Critiques Boost Reward Modeling for Language Models
Figure 2 for Self-Generated Critiques Boost Reward Modeling for Language Models
Figure 3 for Self-Generated Critiques Boost Reward Modeling for Language Models
Figure 4 for Self-Generated Critiques Boost Reward Modeling for Language Models
Viaarxiv icon

Law of the Weakest Link: Cross Capabilities of Large Language Models

Add code
Sep 30, 2024
Figure 1 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 2 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 3 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 4 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Audiobox: Unified Audio Generation with Natural Language Prompts

Add code
Dec 25, 2023
Viaarxiv icon

Learning Easily Updated General Purpose Text Representations with Adaptable Task-Specific Prefixes

Add code
May 22, 2023
Viaarxiv icon

Defending Against Patch-based Backdoor Attacks on Self-Supervised Learning

Add code
Apr 04, 2023
Figure 1 for Defending Against Patch-based Backdoor Attacks on Self-Supervised Learning
Figure 2 for Defending Against Patch-based Backdoor Attacks on Self-Supervised Learning
Figure 3 for Defending Against Patch-based Backdoor Attacks on Self-Supervised Learning
Figure 4 for Defending Against Patch-based Backdoor Attacks on Self-Supervised Learning
Viaarxiv icon

FRAME: Evaluating Simulatability Metrics for Free-Text Rationales

Add code
Jul 02, 2022
Figure 1 for FRAME: Evaluating Simulatability Metrics for Free-Text Rationales
Figure 2 for FRAME: Evaluating Simulatability Metrics for Free-Text Rationales
Figure 3 for FRAME: Evaluating Simulatability Metrics for Free-Text Rationales
Figure 4 for FRAME: Evaluating Simulatability Metrics for Free-Text Rationales
Viaarxiv icon