Picture for Liyu Chen

Liyu Chen

Teaching Language Models to Critique via Reinforcement Learning

Add code
Feb 05, 2025
Figure 1 for Teaching Language Models to Critique via Reinforcement Learning
Figure 2 for Teaching Language Models to Critique via Reinforcement Learning
Figure 3 for Teaching Language Models to Critique via Reinforcement Learning
Figure 4 for Teaching Language Models to Critique via Reinforcement Learning
Viaarxiv icon

TACLR: A Scalable and Efficient Retrieval-based Method for Industrial Product Attribute Value Identification

Add code
Jan 07, 2025
Viaarxiv icon

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Add code
Oct 10, 2024
Figure 1 for Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Figure 2 for Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Figure 3 for Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Figure 4 for Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Viaarxiv icon

BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data

Add code
Oct 01, 2024
Figure 1 for BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
Figure 2 for BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
Figure 3 for BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
Figure 4 for BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
Viaarxiv icon

Effective Diffusion Transformer Architecture for Image Super-Resolution

Add code
Sep 29, 2024
Figure 1 for Effective Diffusion Transformer Architecture for Image Super-Resolution
Figure 2 for Effective Diffusion Transformer Architecture for Image Super-Resolution
Figure 3 for Effective Diffusion Transformer Architecture for Image Super-Resolution
Figure 4 for Effective Diffusion Transformer Architecture for Image Super-Resolution
Viaarxiv icon

Collaboration of Teachers for Semi-supervised Object Detection

Add code
May 22, 2024
Figure 1 for Collaboration of Teachers for Semi-supervised Object Detection
Figure 2 for Collaboration of Teachers for Semi-supervised Object Detection
Figure 3 for Collaboration of Teachers for Semi-supervised Object Detection
Figure 4 for Collaboration of Teachers for Semi-supervised Object Detection
Viaarxiv icon

$\mathbf{}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model

Add code
Mar 11, 2024
Figure 1 for $\mathbf{}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 2 for $\mathbf{}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 3 for $\mathbf{}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 4 for $\mathbf{}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Viaarxiv icon

$\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis

Add code
Oct 04, 2023
Figure 1 for $\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis
Figure 2 for $\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis
Figure 3 for $\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis
Figure 4 for $\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis
Viaarxiv icon

Layered State Discovery for Incremental Autonomous Exploration

Add code
Feb 07, 2023
Figure 1 for Layered State Discovery for Incremental Autonomous Exploration
Figure 2 for Layered State Discovery for Incremental Autonomous Exploration
Viaarxiv icon

Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path

Add code
Oct 10, 2022
Figure 1 for Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path
Figure 2 for Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path
Figure 3 for Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path
Viaarxiv icon