Picture for Wenpin Tang

Wenpin Tang

DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning

Add code
Oct 02, 2025
Figure 1 for DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning
Figure 2 for DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning
Figure 3 for DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning
Figure 4 for DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning
Viaarxiv icon

Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series

Add code
Sep 04, 2025
Figure 1 for Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series
Figure 2 for Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series
Figure 3 for Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series
Figure 4 for Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series
Viaarxiv icon

Fine-Tuning Diffusion Generative Models via Rich Preference Optimization

Add code
Mar 13, 2025
Figure 1 for Fine-Tuning Diffusion Generative Models via Rich Preference Optimization
Figure 2 for Fine-Tuning Diffusion Generative Models via Rich Preference Optimization
Figure 3 for Fine-Tuning Diffusion Generative Models via Rich Preference Optimization
Figure 4 for Fine-Tuning Diffusion Generative Models via Rich Preference Optimization
Viaarxiv icon

Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning

Add code
Feb 03, 2025
Figure 1 for Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Figure 2 for Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Figure 3 for Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Figure 4 for Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Viaarxiv icon

Regret of exploratory policy improvement and $q$-learning

Add code
Nov 02, 2024
Figure 1 for Regret of exploratory policy improvement and $q$-learning
Viaarxiv icon

RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization

Add code
Oct 05, 2024
Figure 1 for RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Figure 2 for RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Figure 3 for RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Figure 4 for RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Viaarxiv icon

Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey

Add code
Sep 17, 2024
Figure 1 for Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey
Figure 2 for Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey
Figure 3 for Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey
Figure 4 for Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey
Viaarxiv icon

Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning

Add code
Sep 12, 2024
Viaarxiv icon

Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions

Add code
May 23, 2024
Figure 1 for Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions
Figure 2 for Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions
Figure 3 for Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions
Figure 4 for Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions
Viaarxiv icon

Fine-tuning of diffusion models via stochastic control: entropy regularization and beyond

Add code
Mar 12, 2024
Viaarxiv icon