Picture for Chengchun Shi

Chengchun Shi

Demystifying Group Relative Policy Optimization: Its Policy Gradient is a U-Statistic

Add code
Mar 03, 2026
Viaarxiv icon

A Difference-in-Difference Approach to Detecting AI-Generated Images

Add code
Feb 27, 2026
Viaarxiv icon

Designing Time Series Experiments in A/B Testing with Transformer Reinforcement Learning

Add code
Feb 02, 2026
Viaarxiv icon

Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text

Add code
Jan 29, 2026
Viaarxiv icon

Double Fairness Policy Learning: Integrating Action Fairness and Outcome Fairness in Decision-making

Add code
Jan 27, 2026
Viaarxiv icon

Detecting LLM-Generated Text with Performance Guarantees

Add code
Jan 10, 2026
Viaarxiv icon

ReDiF: Reinforced Distillation for Few Step Diffusion

Add code
Dec 28, 2025
Viaarxiv icon

PyCFRL: A Python library for counterfactually fair offline reinforcement learning via sequential data preprocessing

Add code
Oct 08, 2025
Viaarxiv icon

Log-Sum-Exponential Estimator for Off-Policy Evaluation and Learning

Add code
Jun 07, 2025
Viaarxiv icon

Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation

Add code
May 28, 2025
Viaarxiv icon