Picture for Zhendong Wang

Zhendong Wang

Audio-Aware Large Language Models as Judges for Speaking Styles

Add code
Jun 06, 2025
Viaarxiv icon

Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay

Add code
Jun 05, 2025
Viaarxiv icon

CAD: A General Multimodal Framework for Video Deepfake Detection via Cross-Modal Alignment and Distillation

Add code
May 21, 2025
Viaarxiv icon

Few-Step Diffusion via Score identity Distillation

Add code
May 19, 2025
Viaarxiv icon

Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation

Add code
May 19, 2025
Viaarxiv icon

GarmageNet: A Dataset and Scalable Representation for Generic Garment Modeling

Add code
Apr 02, 2025
Viaarxiv icon

Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation

Add code
Mar 10, 2025
Viaarxiv icon

DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models

Add code
Mar 03, 2025
Viaarxiv icon

SmartEraser: Remove Anything from Images using Masked-Region Guidance

Add code
Jan 14, 2025
Figure 1 for SmartEraser: Remove Anything from Images using Masked-Region Guidance
Figure 2 for SmartEraser: Remove Anything from Images using Masked-Region Guidance
Figure 3 for SmartEraser: Remove Anything from Images using Masked-Region Guidance
Figure 4 for SmartEraser: Remove Anything from Images using Masked-Region Guidance
Viaarxiv icon

Enhancing and Accelerating Diffusion-Based Inverse Problem Solving through Measurements Optimization

Add code
Dec 05, 2024
Viaarxiv icon