Picture for Peng Wu

Peng Wu

Diagnosing Training Inference Mismatch in LLM Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

Efficient Matrix Implementation for Rotary Position Embedding

Add code
Apr 10, 2026
Viaarxiv icon

Towards Video Anomaly Detection from Event Streams: A Baseline and Benchmark Datasets

Add code
Mar 26, 2026
Viaarxiv icon

MBD: A Model-Based Debiasing Framework Across User, Content, and Model Dimensions

Add code
Mar 15, 2026
Viaarxiv icon

HiFloat4 Format for Language Model Inference

Add code
Feb 13, 2026
Viaarxiv icon

Talos: Optimizing Top-$K$ Accuracy in Recommender Systems

Add code
Jan 27, 2026
Viaarxiv icon

SliceLens: Fine-Grained and Grounded Error Slice Discovery for Multi-Instance Vision Tasks

Add code
Dec 31, 2025
Viaarxiv icon

Joint Link Adaptation and Device Scheduling Approach for URLLC Industrial IoT Network: A DRL-based Method with Bayesian Optimization

Add code
Dec 29, 2025
Viaarxiv icon

MoFu: Scale-Aware Modulation and Fourier Fusion for Multi-Subject Video Generation

Add code
Dec 26, 2025
Viaarxiv icon

GMODiff: One-Step Gain Map Refinement with Diffusion Priors for HDR Reconstruction

Add code
Dec 18, 2025
Figure 1 for GMODiff: One-Step Gain Map Refinement with Diffusion Priors for HDR Reconstruction
Figure 2 for GMODiff: One-Step Gain Map Refinement with Diffusion Priors for HDR Reconstruction
Figure 3 for GMODiff: One-Step Gain Map Refinement with Diffusion Priors for HDR Reconstruction
Figure 4 for GMODiff: One-Step Gain Map Refinement with Diffusion Priors for HDR Reconstruction
Viaarxiv icon