Picture for Xueqian Wang

Xueqian Wang

No Cache Left Idle: Accelerating diffusion model via Extreme-slimming Caching

Add code
Dec 14, 2025
Viaarxiv icon

UACER: An Uncertainty-Aware Critic Ensemble Framework for Robust Adversarial Reinforcement Learning

Add code
Dec 11, 2025
Viaarxiv icon

Count Every Rotation and Every Rotation Counts: Exploring Drone Dynamics via Propeller Sensing

Add code
Nov 17, 2025
Viaarxiv icon

Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation

Add code
Oct 24, 2025
Figure 1 for Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation
Figure 2 for Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation
Figure 3 for Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation
Figure 4 for Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation
Viaarxiv icon

A Hybrid Force-Position Strategy for Shape Control of Deformable Linear Objects With Graph Attention Networks

Add code
Aug 10, 2025
Viaarxiv icon

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

Add code
May 30, 2025
Viaarxiv icon

Lifelong Safety Alignment for Language Models

Add code
May 26, 2025
Figure 1 for Lifelong Safety Alignment for Language Models
Figure 2 for Lifelong Safety Alignment for Language Models
Figure 3 for Lifelong Safety Alignment for Language Models
Figure 4 for Lifelong Safety Alignment for Language Models
Viaarxiv icon

Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language Models

Add code
May 24, 2025
Viaarxiv icon

VTire: A Bimodal Visuotactile Tire with High-Resolution Sensing Capability

Add code
Apr 27, 2025
Viaarxiv icon

Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Preference Understanding

Add code
Apr 25, 2025
Viaarxiv icon