Picture for Peizhong Ju

Peizhong Ju

Discrete Flow Matching for Offline-to-Online Reinforcement Learning

Add code
May 12, 2026
Viaarxiv icon

Discrete MeanFlow: One-Step Generation via Conditional Transition Kernels

Add code
May 12, 2026
Viaarxiv icon

An LP-based Sampling Policy for Multi-Armed Bandits with Side-Observations and Stochastic Availability

Add code
Mar 27, 2026
Viaarxiv icon

Provable Last-Iterate Convergence for Multi-Objective Safe LLM Alignment via Optimistic Primal-Dual

Add code
Feb 25, 2026
Viaarxiv icon

Flow Matching for Offline Reinforcement Learning with Discrete Actions

Add code
Feb 05, 2026
Viaarxiv icon

Evaluating Sparse Autoencoders for Monosemantic Representation

Add code
Aug 20, 2025
Viaarxiv icon

FSL-SAGE: Accelerating Federated Split Learning via Smashed Activation Gradient Estimation

Add code
May 29, 2025
Viaarxiv icon

BeST -- A Novel Source Selection Metric for Transfer Learning

Add code
Jan 19, 2025
Figure 1 for BeST -- A Novel Source Selection Metric for Transfer Learning
Figure 2 for BeST -- A Novel Source Selection Metric for Transfer Learning
Figure 3 for BeST -- A Novel Source Selection Metric for Transfer Learning
Figure 4 for BeST -- A Novel Source Selection Metric for Transfer Learning
Viaarxiv icon

PSMGD: Periodic Stochastic Multi-Gradient Descent for Fast Multi-Objective Optimization

Add code
Dec 17, 2024
Viaarxiv icon

How to Find the Exact Pareto Front for Multi-Objective MDPs?

Add code
Oct 21, 2024
Figure 1 for How to Find the Exact Pareto Front for Multi-Objective MDPs?
Figure 2 for How to Find the Exact Pareto Front for Multi-Objective MDPs?
Figure 3 for How to Find the Exact Pareto Front for Multi-Objective MDPs?
Figure 4 for How to Find the Exact Pareto Front for Multi-Objective MDPs?
Viaarxiv icon